nlp4arc 2018

Event Information
2 February 2018 9:00am – 5:00pm
Dey Hall, Toy Lounge, University of North Carolina, Chapel Hill, North Carolina

About the Symposium
BitCurator NLP will host “nlp4arc – Enabling New Forms of Access to Primary Sources through Natural Language Processing.” The event will focus on the application of natural language processing (NLP) to support use, access, and analysis of digital primary source materials. Click here to register.

nlp4arc 2018
February 2, 2018 – 9:00am – 5:00pm Dey Hall, Toy Lounge
University of North Carolina
Chapel Hill, North Carolina
Suggested hashtag: #nlp4arc


9:00 – 9:15 Welcome and introduction – Cal Lee
9:15-10:30 Foundations and Strategies

  • Michael Piotrowski, University of Lausanne – Historical Texts, NLP, and Formal Models
  • Daniel Pitti, University of Virginia – Name Entities, Named Entities, Facts in Contexts
  • Carl Wilson, Open Preservation Foundation – Not Just Building Tools: Strategies for Sustaining Software and Associated Communities
  • Mark Matienzo, Stanford University – Practical and Ethical Considerations of NLP Applied to Humanitarian Digital Libraries
10:30-10:45 Break
10:45-12:00 Implementation and Projects

  • Mary Elings, University of California, Berkeley – Using NLP to Support Dynamic Arrangement, Description, and Discovery of Born Digital Collections
  • Jeremy Gibson and Nitin Arora, North Carolina Department of Natural and Cultural Resources – “Honey, I Tagged the Email! Now What?”: NLP and the TOMES Project
  • Ryan Shaw, University of North Carolina at Chapel Hill – Gathering Specimens to Augment Authority Files
  • Stéfan Sinclair, McGill University – Spyral Notebooks: Some Reasons Why the World Needs Yet Another Jupyter
12:00-12:30 Panel on NLP Lessons Learned

  • Jaime Arguello, University of North Carolina at Chapel Hill
  • Stephanie Haas, University of North Carolina at Chapel Hill
12:30-1:30 Lunch
1:30-2:15 Enabling Technologies

  • Laney McGlohon, ArchiveSpace – Finding the Data: The Use of a Data
    Dictionary in Retrieving Descriptive Metadata from ArchivesSpace
  • Kam Woods and Cal Lee, University of North Carolina at Chapel Hill – BitCurator NLP Development and Plans
2:15-2:45 Generation of Breakout Topics
2:45-3:00 Break
3:00-3:45 Breakout Sessions
3:45-4:15 Reporting Back from Breakout Sessions
4:15-5:00 Wrap Up and Next Steps

nlp4arc is being administered through BitCurator NLP, a project funded by the Andrew W. Mellon Foundation. Click here for a story about the project, or visit