This page allows University of Pennsylvania researchers to search for and download copyrighted NLP corpora for which the University has a license. At present, the collection is limited mainly to text and speech transcript corpora.


This page contains data used in experiments by Cognitive Computation Group members. Many are linked to specific publications. Most are accessible to the general public, however some are copyrighted and directly accessible only to University of Pennsylvania researchers.