Natural language annotation for machine learning

James Pustejovsky and Amber Stubbs on machine learning best practices.

James Pustejovsky (@jamespusto) is an O’Reilly author and professor of computer science at Brandeis. Amber Stubbs (@amber_stubbs) is an O’Reilly author and post doc at SUNY Albany.

We sat down to talk about natural language annotation as it relates to machine learning. James and Amber reviewed methods, best practices, and what they see coming in the future.

Highlights from the conversation include:

  • Learn why it is important to create your own corpus for machine learning. [Discussed 20 seconds in.]
  • Discover different methods for creating a corpus. [Discussed at the 6:15 mark.]
  • Understand the MATTER Annotation Development Process. [Discussed at the 9:58 mark.]
  • Hear what James and Amber see coming next for machine learning. [Discussed at the 15:23 mark.]

You can view the entire interview in the following video.

tags: , , , , ,