PLOS & The Hacker Within: mining scientific articles tutorial & hackathon
Have you ever wanted to learn how to mine the text and data from scientific articles? Come join us at The Hacker Within for a tutorial and mini-hackathon!
First will be a brief tutorial on the basic structure of XML documents, the JATS XML structure used by PLOS and other scientific publishers, as well as the XML parsing tools in allofplos, a Python library that downloads and parses PLOS articles. Then we'll have some time to mine the corpus, contribute to the allofplos codebase, or whatever else you want to do with hundreds of thousands of research articles at your fingertips!
The tutorial portion will be broadcast live and recorded on YouTube. While a working knowledge of Python is helpful, we will also have .csv documents of allofplos's metadata that can be parsed in R.
Please sign up here if you are planning to attend in person. We may not have spaces for those who don't register.
Pizza will be provided.
Location
Dates
to 29th November 2017 - 08:00 PM