Enriching Europeana Newspapers aims to expose, via EUDAT, the existing corpora of historic newspapers created as part of the Europeana Newspapers project, including enrichments of these corpora in the form of detected topics and named entities. The work with EUDAT will provide another channel for the corpus to be discoverable by the research community (beyond the existing portal at www.theeuropeanlibrary.org/tel4/newspapers and experimental text dumps via http://research.europeana.eu/blogpost/experimental-text-dumps-from-europeana-newspapers). It will be of particular interest to researchers who wish to explore the corpus from a big data perspective, i.e. those who do not only want to search and browse via the traditional portal, but wish access to the full dataset for text and data mining. The Europeana Newspapers dataset contains over 11 million full text pages drawn from over 20 libraries across Europe, and approaches 1 terabyte of plain text in total. Over 40 languages are represented. (Note: the portal also presents over 11 million scanned images of the newspapers but these are not included in this particular proposal for the pilot). Many of the libraries are National Libraries, but some research libraries are represented as well. Most of the newspapers are drawn from the nineteenth and early twentieth century’s. Nearly all the text of the newspapers is in the public domain.