The Scholastic Commentaries and Texts Archive is a metadata archive automatically extracted from distributed work on the scholastic corpus. This archive, and accompanying apis, functions as an open web service to support new and open development on the scholastic corpus.
Three main types of information are collected in this archive.
The publication information: The first aim is provide interested scholars with publication metadata to quickly assess which parts of any given commentary or text have been edited, which parts are in progress, and which parts have not been started. It further aims to provide information about whether a text has been encoded in the TEI XML standard, when it was first published, how many subsequent editions exist, and whether or not editors are looking for collaborators or reviewers.
The content information: The second aim is to provide metadata about the content of the commentaries and texts themselves. This metadata provides information about many authors or works have been mentioned, referenced, or quoted. It will also provide metadata about the use of key phrases or topics used in any given section. The use of this metadata combined with other data streams will allow for robust searches and queries. For example, a user could query every article that discusses "faith" and quotes the book of Hebrews at least once. Or again, a user could ask how many times Robert Holcot is mentioned by authors working at Paris after the year 1350.
The linking information: The third aim is to provide a clear API that will allow other applications to be built on top of the archive. The linking data stream in particular will provide references to various places that the text can be found, including HTML output, plain text (useful for large scale text indexing) the development repository, and any institutional repositories where the raw XML text has been deposited for long term preservation.
In this way, the archive aims to support the continued work on Scholastic commentary and text transcriptions in a way that paves the way for future unforeseen possibilities. This is perhaps most visible in the promise of large scale corpus analysis or topic modeling. But there are many other possibilities including close readings of similar parts or sections across commentaries, visualizations of networked relationships between authors, and analysis of the changing structures of the commentaries themselves.