TNTC Project

This site is for releasing the metalanguages and the data constructed in the JSPS Grant-in-Aid for Scientific Research (S) 19H05660: “Developing a translation process model and constructing an integrated translation environment through detailed descriptions of translation norms and competences.”

Translation Metalanguages (TML)

We release the following sets of metalanguages.

The development and use of the metalanguages are described in Miyata et al. (2022) and Yamada et al. (2020).

Translation-related Datasets

MultiEnJa

A set of 46 examples of English source documents with several types of translation-related derivatives, including professional translation and post-edited machine translation outputs.

MultiEnJa

ParaNatCom

Parallel English-Japanese abstract corpus made from Nature Communications articles.

ParaNatCom

Staged PE Dataset

Examples of translation issues and their revisions collected through 2-stage post-editing (PE) of machine translation (MT) outputs.

Staged PE Dataset

Software

To be released.

References

Rei Miyata, Masaru Yamada, and Kyo Kageura (2022) Metalanguages for Dissecting Translation Processes: Theoretical Development and Practical Applications. London: Routledge. [Corrigendum]
Masaru Yamada, Mayuka Yamamoto, Nanami Onish, Atsushi Fujita, Rei Miyata, and Kyo Kageura (2020) “Metalanguage for the Translation Process,” Book of Abstracts: Translation in Transition (TT5), pp. 46–51.