Skip to content

Commit

Permalink
Update TECH_DOC.md
Browse files Browse the repository at this point in the history
  • Loading branch information
KaterinaRysova authored Nov 20, 2024
1 parent e6066b6 commit f6d5cb8
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion data_preparation/70.releasing/TECH_DOC.md
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
# Technical Documentation
# Database of Spoken Czech as a Foreign Language (Permanent Residency in the Czech Republic): Technical Documentation

The language corpus of spoken performances by non-native speakers of Czech, focused on the A2 language level (according to the CEFR), required for obtaining permanent residency in the Czech Republic, is the result of a project implemented at the Institute of Formal and Applied Linguistics of the Faculty of Mathematics and Physics, Charles University. The corpus contains recordings capturing the oral part of the [Czech Language Certificate Exam](https://ujop.cuni.cz/UJOPEN-70.html?ujopcmsid=12:czech-language-certificate-exam-cce) at the A2 level. The recordings include dialogues between the examiner (a native speaker) and the candidate (a non-native speaker). We have provided transcriptions of the recordings, enriched with extensive linguistic annotations. Some recordings are accompanied by multiple transcriptions from different annotators, allowing for comparisons of various transcriptions of the same recording and the assessment of the degree of agreement when converting spoken language into written text.

Expand Down

0 comments on commit f6d5cb8

Please sign in to comment.