You are here
First round of transcriptions
We finish the year 2015 happily with the transcription of 522 scripts written by children and 21 basal readers!
The children's scripts are classified by year (1979: 246 files, 1988: 276 files) and by communicative funcation (to argue/Rule task: 261 files; to narrate/story task: 261 files).
To date we have produced the TEI-XML version, with headers displaying bibliograhpic metadata. We are in the process of producing the untagged .TXT version with normalised spelling, from which we will derive the tagged versions with POS tagging (CLAWS7) and with semantic tagging (USAS). We envisage we will finish this stage in January 2016. More soon!