This interface allows researchers to download copyrighted corpora for which the University has a license. At present, the collection is limited mainly to text and speech transcript corpora. If you're looking for data used in specific experiments by Cognitive Computation Group members, please visit our Data page.
This page is currently under transition as we move corpora from Illinois to Penn and work out a new licensing plan.
Thank you for your patience.
Entering values in the fields below and clicking Submit will display a list of the corpora we have that match the specified criteria.
{{item.name}} | {{item.notes}} |