Registration for nlp4arc 2018 is now open

We are pleased to announce that registration for the BitCurator NLP Forum 2018 (nlp4arc) is now open. The event will focus on the application of natural language processing (NLP) to support use, access, and analysis of digital primary source materials. Click here to register.

A rapidly growing body of materials with significant cultural value are “born digital.” Information professionals must be prepared to extract digital materials from their original environments and media in ways that reflect the rich metadata and ensure the integrity of the materials. They must also support new forms of access: allowing users to make sense of materials and understand their context.

There are many types of contextual information that can be vital to making sense and meaningful use of digital objects. These can include objects, agents, occurrences, purposes, times, places, form of expressions, concepts/abstractions and relationships.

There are many existing open-source tools that libraries, archives and museums (LAMs) can use to identify, extract and expose such contextual entities from the wide diversity of born-digital materials that LAMs already hold and continue to receive. NLP tools and methods can help to both (1) facilitate curatorial decision making and description, and (2) generate access points to be presented to end users.

nlp4arc 2018
February 2, 2018 – 9:00am – 5:00pm Dey Hall, Toy Lounge
University of North Carolina
Chapel Hill, North Carolina
Suggested hashtag: #nlp4arc


Program

9:00 – 9:15 Welcome and introduction – Cal Lee
9:15-10:30 Foundations and Strategies

  • Michael Piotrowski, University of Lausanne – Historical Texts, NLP, and Formal Models
  • Daniel Pitti, University of Virginia – Name Entities, Named Entities, Facts in Contexts
  • Carl Wilson, Open Preservation Foundation – Not Just Building Tools: Strategies for Sustaining Software and Associated Communities
  • Mark Matienzo, Stanford University – Practical and Ethical Considerations of NLP Applied to Humanitarian Digital Libraries
10:30-10:45 Break
10:45-12:00 Implementation and Projects

  • Mary Elings, University of California, Berkeley – Using NLP to Support Dynamic Arrangement, Description, and Discovery of Born Digital Collections
  • Jeremy Gibson and Nitin Arora, North Carolina Department of Natural and Cultural Resources – “Honey, I Tagged the Email! Now What?”: NLP and the TOMES Project
  • Ryan Shaw, University of North Carolina at Chapel Hill – Gathering Specimens to Augment Authority Files
  • Stéfan Sinclair, McGill University – Spyral Notebooks: Some Reasons Why the World Needs Yet Another Jupyter
12:00-12:30 Panel on NLP Lessons Learned

  • Jaime Arguello, University of North Carolina at Chapel Hill
  • Stephanie Haas, University of North Carolina at Chapel Hill
12:30-1:30 Lunch
1:30-2:15 Enabling Technologies

  • Laney McGlohon, ArchiveSpace – Finding the Data: The Use of a Data
    Dictionary in Retrieving Descriptive Metadata from ArchivesSpace
  • Kam Woods and Cal Lee, University of North Carolina at Chapel Hill – BitCurator NLP Development and Plans
2:15-2:45 Generation of Breakout Topics
2:45-3:00 Break
3:00-3:45 Breakout Sessions
3:45-4:15 Reporting Back from Breakout Sessions
4:15-5:00 Wrap Up and Next Steps

Registration
General Registration – $30
Student & BitCurator Consortium Members Registration – $15
Register here.

Accommodations
Please see the list of nearby hotels below.

The Carolina Inn 211 Pittsboro Street
Chapel Hill, NC 27516
Tel 800.962.8519
(This is the closest option. It is on the UNC Campus, just a couple of blocks from the Student Union.)

Hampton Inn & Suites Chapel Hill Carrboro/Downtown
370 East Main Street, Unit 100
Carrboro, North Carolina 27510
Tel 919.969.6988
(Walkable distance)

Holiday Inn Express Chapel Hill
6119 Farrington Road
Chapel Hill, NC 27517
Tel 919.489.7555

Aloft Chapel Hill
1001 South Hamilton Road
Chapel Hill, NC 27517
Tel 866.716.8143
(Shuttle buses available)

Registration for nlp4arc is now open

We are pleased to announce that registration for the BitCurator NLP Forum 2017 (nlp4arc) is now open. The event will focus on the application of natural language processing (NLP) to support use, access, and analysis of digital primary source materials. Click here to register.

A rapidly growing body of materials with significant cultural value are “born digital.” Information professionals must be prepared to extract digital materials from their original environments and media in ways that reflect the rich metadata and ensure the integrity of the materials. They must also support new forms of access: allowing users to make sense of materials and understand their context.

There are many types of contextual information that can be vital to making sense and meaningful use of digital objects. These can include objects, agents, occurrences, purposes, times, places, form of expressions, concepts/abstractions and relationships.

There are many existing open-source tools that libraries, archives and museums (LAMs) can use to identify, extract and expose such contextual entities from the wide diversity of born-digital materials that LAMs already hold and continue to receive. NLP tools and methods can help to both (1) facilitate curatorial decision making and description, and (2) generate access points to be presented to end users.

The day will include a series of short talks by internationally-recognized experts, followed by a set of participant-driven unconference discussions. Speakers will include:

Event Information
Date: 3 February 2017 9:00am – 5:00pm
Location: Student Union rooms 3206A and 3206B, University of North Carolina, Chapel Hill, North Carolina

Registration
General Registration – $30
Student & BitCurator Consortium Members Registration – $15
Register here.

Accommodations
Please see the list of nearby hotels below.

The Carolina Inn 211 Pittsboro Street
Chapel Hill, NC 27516
Tel 800.962.8519
(This is the closest option. It is on the UNC Campus, just a couple of blocks from the Student Union.)

Hampton Inn & Suites Chapel Hill Carrboro/Downtown
370 East Main Street, Unit 100
Carrboro, North Carolina 27510
Tel 919.969.6988
(Walkable distance)

Holiday Inn Express Chapel Hill
6119 Farrington Road
Chapel Hill, NC 27517
Tel 919.489.7555

Aloft Chapel Hill
1001 South Hamilton Road
Chapel Hill, NC 27517
Tel 866.716.8143
(Shuttle buses available)

BitCurator NLP Announced!

The University of North Carolina at Chapel Hill has received a grant for $750,000 from the Andrew W. Mellon Foundation to support BitCurator NLP, a project that will develop software and protocols for the application of natural language processing (NLP) methods to born-digital library, archives and museum (LAM) collections.

See the full press release on the UNC SILS site.

CurateGear + BUF 2016 Wrap Up!

Thanks to everyone who attended and/or participated in CurateGear and the BitCurator Users Forum!

CurateGear 2016 was an interactive day-long event focused on digital curation tools and methods. Participants saw demonstrations, heard about the latest developments, and discussed application in professional contexts. The event was sponsored by the School of Information and Library Science at the University of North Carolina at Chapel Hill and the Andrew W. Mellon Foundation (through the BitCurator Access project).

Presentation slides for CurateGear are now available here.
Abridged notes from each presentation can be found here.

We’ll be posting more BUF wrap up soon – stay tuned! In the meantime, check out the twitter feed from the event.