Safari Books Online is a digital library providing on-demand subscription access to thousands of learning resources.
Structured collections of annotated linguistic data are essential in most areas of NLP; however, we still face many obstacles in using them. The goal of this chapter is to answer the following questions:
How do we design a new language resource and ensure that its coverage, balance, and documentation support a wide range of uses?
When existing data is in the wrong format for some analysis tool, how can we convert it to a suitable format?
What is a good way to document the existence of a resource we have created so that others can easily find it?