A GUI for annotation in the Penn Treebank format


Annotald is a program for annotating parsed corpora in the Penn Treebank format. For more information on the format (as instantiated by the Penn Parsed Corpora of Historical English), see the documentation by Beatrice Santorini. Annotald was originally written by Anton Ingason as part of the Icelandic Parsed Historical Corpus project. It is currently being developed by him along with Jana Beck and Aaron Ecay.

Obtaining Annotald

The central location for Annotald development is on Github. You can view or download the program’s source code from there. The latest release is available as a Python package. Install it with the command pip install annotald . (Further information about installation is available in the user’s manual.)

Using Annotald

The Annotald user’s manual can be found online. For developers, there is also automatically generated API documentation.


Annotald is available under the terms of the GNU General Public License (GPL) version 3 or (at your option) any later version. Please see the LICENSE file included with the source code for more information.

Funding Sources

Annotald development has been funded by the following funding sources:

  • Icelandic Research Fund (RANNÍS), grant #090662011: “Viable Language Technology beyond English – Icelandic as a Test Case”
  • The research funds of Anthony Kroch at the University of Pennsylvania.