diff --git a/data_preparation/70.releasing/html/db_residency-cs.html b/data_preparation/70.releasing/html/db_residency-cs.html
index d95f2a5..a418142 100644
--- a/data_preparation/70.releasing/html/db_residency-cs.html
+++ b/data_preparation/70.releasing/html/db_residency-cs.html
@@ -149,20 +149,51 @@
 			<div stle='margin-top: 0px; margin-bottom: 20px;'><span class=langon>EN</span> | <span class=langoff><a href='/services/teitok-live/evaldio/cs/index.php?action=db_residency-cs'>CS</a></span></div><p class='header'><a href='index.php'>Evaldio</a></p><ul style='text-align: left'><li><a href='index.php?action=databases'>Databases</a><li><a href='index.php?action=browser'>Browse</a><li><a href='index.php?action=cqp'>Search</a><li><a target="repository" href='http://hdl.handle.net/11234/1-5731'>Download</a></ul><ul style='text-align: left'><li><a href='index.php?action=login' >Login</a></ul><hr style='opacity: 0.5; margin-top: 40px;'><p id=powby style='opacity: 0.5; font-size: smaller;'><span onClick="window.open('http://www.teitok.org/index.php', 'teitok');">Powered by <span style='font-family: Courier;'>&lt;TEI:TOK&gt;</span></span><br><span onClick="window.open('http://www.teitok.org/index.php?action=credits', 'teitok');">Maarten Janssen, 2014-</a></p>
         	</div>
     <div id="main">
-		<h1>Datab&aacute;ze mluven&yacute;ch projevů v če&scaron;tině jako ciz&iacute;m jazyce (trval&yacute; pobyt v ČR)</h1>
-<p dir="auto">Jazykov&yacute; korpus byl vytvořen v &Uacute;stavu form&aacute;ln&iacute; a aplikovan&eacute; lingvistiky Matematicko-fyzik&aacute;ln&iacute; fakulty Univerzity Karlovy za &uacute;čelem podpory v&yacute;uky, v&yacute;zkumu a hodnocen&iacute; jazykov&eacute; kompetence nerodil&yacute;ch mluvč&iacute;ch če&scaron;tiny. C&iacute;lem je poskytnout strukturovan&yacute; a snadno př&iacute;stupn&yacute; zdroj autentick&yacute;ch mluven&yacute;ch dat pro lingvisty, pedagogy, studenty, veřejnost a vědeckou komunitu. Korpus se zaměřuje na jazykovou &uacute;roveň A2, kter&aacute; je potřebn&aacute; pro udělen&iacute; trval&eacute;ho pobytu v Česk&eacute; republice. Audionahr&aacute;vky pro datab&aacute;zi poskytl &Uacute;stav jazykov&eacute; a odborn&eacute; př&iacute;pravy Univerzity Karlovy (ujop.cuni.cz).</p>
-<h3><a href="index.php?action=browser&amp;class=database&amp;val=Datab&aacute;ze+mluven&yacute;ch+projevů+v+če&scaron;tině+jako+ciz&iacute;m+jazyce+%28trval&yacute;+pobyt+v+ČR%29">Vstup do korpusu &ndash; prohl&iacute;žen&iacute;</a></h3>
-<h3><a href="https://lindat.mff.cuni.cz/services/teitok-live/evaldio/cs/index.php?action=cqp">Hled&aacute;n&iacute; v korpusu</a></h3>
-<h3>Popis korpusu</h3>
-<h3>Technick&aacute; dokumentace</h3>
-<h3>Uživatelsk&aacute; př&iacute;ručka</h3>
-<h3 dir="auto"><a href="https://ufal.mff.cuni.cz/automated-speech-scoring-czech">Str&aacute;nky projektu</a></h3>
-<h3 dir="auto">Financov&aacute;n&iacute;</h3>
-<p dir="auto">Vznik datab&aacute;ze byl financov&aacute;n z prostředků Programu na podporu aplikovan&eacute;ho v&yacute;zkumu v oblasti n&aacute;rodn&iacute; a kulturn&iacute; identity na l&eacute;ta 2023 až 2030 (NAKI III) Ministerstva kultury ČR v r&aacute;mci projektu <em>Automatick&eacute; hodnocen&iacute; mluven&eacute;ho projevu v če&scaron;tině</em> (DH23P03OVV037).</p>
-<h3 dir="auto">Jak citovat</h3>
-<p>Rysov&aacute; Kateřina, Nov&aacute;k Michal, Rysov&aacute; Magdal&eacute;na, Pol&aacute;k Peter, Bojar Ondřej: <em>Datab&aacute;ze mluven&yacute;ch projevů v če&scaron;tině jako ciz&iacute;m jazyce (trval&yacute; pobyt v ČR)</em>. &Uacute;stav form&aacute;ln&iacute; a aplikovan&eacute; lingvistiky MFF UK, Praha 2024. Dostupn&aacute; z WWW&nbsp;<a href="https://lindat.mff.cuni.cz/services/teitok-live/evaldio/cs/index.php?action=db_residency" rel="nofollow">https://lindat.mff.cuni.cz/services/teitok-live/evaldio/cs/index.php?action=db_residency</a>.</p>
-<p>&nbsp;</p>
-<p>&nbsp;</p>
+		<h1 id="datab&aacute;ze-mluven&yacute;ch-projevů-v-če&scaron;tině-jako-ciz&iacute;m-jazyce-trval&yacute;-pobyt-v-čr">Datab&aacute;ze mluven&yacute;ch projevů v če&scaron;tině jako ciz&iacute;m jazyce (trval&yacute; pobyt v ČR)</h1>
+<p>Datab&aacute;ze mluven&yacute;ch projevů v če&scaron;tině jako ciz&iacute;m jazyce (trval&yacute; pobyt v ČR) je jazykov&yacute; korpus mluven&yacute;ch projevů nerodil&yacute;ch mluvč&iacute;ch če&scaron;tiny zaměřen&yacute; na jazykovou &uacute;roveň A2 (podle SERR), požadovanou pro udělen&iacute; trval&eacute;ho pobytu v Česk&eacute; republice. Obsahuje nahr&aacute;vky zaznamen&aacute;vaj&iacute;c&iacute; &uacute;stn&iacute; č&aacute;st <a href="http://ujop.cuni.cz/cce">Certifikovan&eacute; zkou&scaron;ky z če&scaron;tiny pro cizince</a>. Nahr&aacute;vky zahrnuj&iacute; dialogy mezi zkou&scaron;ej&iacute;c&iacute;m (rodil&yacute;m mluvč&iacute;m) a kandid&aacute;tem zkou&scaron;ky (nerodil&yacute;m mluvč&iacute;m). Kromě nahr&aacute;vek korpus obsahuje tak&eacute; jejich přepisy, kter&eacute; jsou opatřeny bohatou lingvistickou anotac&iacute;. K někter&yacute;m nahr&aacute;vk&aacute;m je připojeno v&iacute;ce přepisů od různ&yacute;ch anot&aacute;torů, což umožňuje srovn&aacute;n&iacute; různ&yacute;ch přepisů t&eacute;že nahr&aacute;vky a vyhodnocen&iacute; m&iacute;ry shody při převodu mluven&eacute; řeči do psan&eacute;ho textu.</p>
+<p>Korpus je zveřejněn jako specializovan&aacute; veřejn&aacute; datab&aacute;ze s c&iacute;lem poskytnout strukturovan&yacute; a snadno př&iacute;stupn&yacute; zdroj autentick&yacute;ch mluven&yacute;ch dat pro lingvisty, pedagogy, studenty, vědeckou komunitu a &scaron;irokou veřejnost.</p>
+<p>Jazykov&yacute; korpus byl vytvořen v <a href="https://ufal.mff.cuni.cz/">&Uacute;stavu form&aacute;ln&iacute; a aplikovan&eacute; lingvistiky Matematicko-fyzik&aacute;ln&iacute; fakulty Univerzity Karlovy</a> za &uacute;čelem podpory v&yacute;uky, v&yacute;zkumu a hodnocen&iacute; jazykov&eacute; kompetence nerodil&yacute;ch mluvč&iacute;ch če&scaron;tiny v r&aacute;mci projektu <a href="https://ufal.mff.cuni.cz/automated-speech-scoring-czech"><em>Automatick&eacute; hodnocen&iacute; mluven&eacute;ho projevu v če&scaron;tině</em></a>. Audionahr&aacute;vky poskytl <a href="https://ujop.cuni.cz/">&Uacute;stav jazykov&eacute; a odborn&eacute; př&iacute;pravy Univerzity Karlovy</a> (ujop.cuni.cz).</p>
+<h2 id="statistiky">Statistiky</h2>
+<p>Datab&aacute;ze obsahuje 63 nahr&aacute;vek zachycuj&iacute;c&iacute;ch stejn&yacute; počet zkou&scaron;ek a stejn&yacute; počet nerodil&yacute;ch mluvč&iacute;ch. Celkov&aacute; d&eacute;lka v&scaron;ech nahr&aacute;vek je 3h 15min 40s. Tabulka n&iacute;že ukazuje statistiky přepisů, přičemž pro každou nahr&aacute;vku byl vybr&aacute;n pr&aacute;vě jeden kanonick&yacute; přepis.</p>
+<table>
+<thead>
+<tr class="header">
+<th>&nbsp;</th>
+<th style="text-align: right;">V&scaron;echny</th>
+<th style="text-align: right;">Kanonick&eacute;</th>
+</tr>
+</thead>
+<tbody>
+<tr class="odd">
+<td>Soubory</td>
+<td style="text-align: right;">106</td>
+<td style="text-align: right;">63</td>
+</tr>
+<tr class="even">
+<td>Repliky</td>
+<td style="text-align: right;">4 773</td>
+<td style="text-align: right;">2 888</td>
+</tr>
+<tr class="odd">
+<td>Tokeny</td>
+<td style="text-align: right;">33 267</td>
+<td style="text-align: right;">20 035</td>
+</tr>
+</tbody>
+</table>
+<h2 id="dokumentace">Dokumentace</h2>
+<ul>
+<li><a href="index.php?action=db_residency_manual">Uživatelsk&aacute; př&iacute;ručka</a></li>
+<li><a href="index.php?action=db_residency_techdoc">Technick&aacute; dokumentace</a></li>
+</ul>
+<h2 id="licence">Licence</h2>
+<p>Korpus je zveřejněn pod licenc&iacute; CC BY-NC-SA 4.0.</p>
+<h2 id="financov&aacute;n&iacute;">Financov&aacute;n&iacute;</h2>
+<p>Vznik datab&aacute;ze byl financov&aacute;n z prostředků Programu na podporu aplikovan&eacute;ho v&yacute;zkumu v oblasti n&aacute;rodn&iacute; a kulturn&iacute; identity na l&eacute;ta 2023 až 2030 (NAKI III) Ministerstva kultury ČR v r&aacute;mci projektu <em>Automatick&eacute; hodnocen&iacute; mluven&eacute;ho projevu v če&scaron;tině</em> (DH23P03OVV037).</p>
+<h2 id="poděkov&aacute;n&iacute;">Poděkov&aacute;n&iacute;</h2>
+<p>Autoři datab&aacute;ze srdečně děkuj&iacute; PhDr. Pavlovi Pečen&eacute;mu, Ph.D., z &Uacute;stavu jazykov&eacute; a odborn&eacute; př&iacute;pravy Univerzity Karlovy za poskytnut&iacute; audiodat.</p>
+<h2 id="jak-citovat">Jak citovat</h2>
+<p>Rysov&aacute; Kateřina, Nov&aacute;k Michal, Rysov&aacute; Magdal&eacute;na, Pol&aacute;k Peter, Bojar Ondřej: <em>Datab&aacute;ze mluven&yacute;ch projevů v če&scaron;tině jako ciz&iacute;m jazyce (trval&yacute; pobyt v ČR)</em>. &Uacute;stav form&aacute;ln&iacute; a aplikovan&eacute; lingvistiky MFF UK, Praha 2024. Dostupn&aacute; z WWW <a href="https://lindat.mff.cuni.cz/services/teitok-live/evaldio/cs/index.php?action=db_residency">https://lindat.mff.cuni.cz/services/teitok-live/evaldio/cs/index.php?action=db_residency</a>.</p>
 	</div>
 </div>
 
diff --git a/data_preparation/70.releasing/html/db_residency.html b/data_preparation/70.releasing/html/db_residency.html
index 6efc8d4..a5cdf79 100644
--- a/data_preparation/70.releasing/html/db_residency.html
+++ b/data_preparation/70.releasing/html/db_residency.html
@@ -149,10 +149,51 @@
 			<div stle='margin-top: 0px; margin-bottom: 20px;'><span class=langon>EN</span> | <span class=langoff><a href='/services/teitok-live/evaldio/cs/index.php?action=db_residency'>CS</a></span></div><p class='header'><a href='index.php'>Evaldio</a></p><ul style='text-align: left'><li><a href='index.php?action=databases'>Databases</a><li><a href='index.php?action=browser'>Browse</a><li><a href='index.php?action=cqp'>Search</a><li><a target="repository" href='http://hdl.handle.net/11234/1-5731'>Download</a></ul><ul style='text-align: left'><li><a href='index.php?action=login' >Login</a></ul><hr style='opacity: 0.5; margin-top: 40px;'><p id=powby style='opacity: 0.5; font-size: smaller;'><span onClick="window.open('http://www.teitok.org/index.php', 'teitok');">Powered by <span style='font-family: Courier;'>&lt;TEI:TOK&gt;</span></span><br><span onClick="window.open('http://www.teitok.org/index.php?action=credits', 'teitok');">Maarten Janssen, 2014-</a></p>
         	</div>
     <div id="main">
-		<h1>Database of Spoken Czech as a Foreign Language (Permanent Residency in the Czech Republic)</h1>
-<p><span style="font-size: 11pt; font-family: 'arial', sans-serif; color: #000000; background-color: transparent; font-weight: 400; font-style: normal; font-variant: normal; text-decoration: none; vertical-align: baseline; white-space: pre-wrap;">The database was funded by the Programme to Support Applied Research in the Area of the National and Cultural Identity for the Years 2023 to 2030 (NAKI III) of the Ministry of Culture of the Czech Republic within the project <em>Automated Speech Scoring in Czech</em> (DH23P03OVV037).</span></p>
-<p>Rysov&aacute; Kateřina, Nov&aacute;k Michal, Rysov&aacute; Magdal&eacute;na, Pol&aacute;k Peter, Bojar Ondřej: <em>Database of Spoken Czech as a Foreign Language (Permanent Residency in the Czech Republic)</em>. Institute of Formal and Applied Linguistics MFF UK, Prague 2024. Available from WWW <a href="https://lindat.mff.cuni.cz/services/teitok-live/evaldio/en/index.php?action=db_residency" rel="nofollow">https://lindat.mff.cuni.cz/services/teitok-live/evaldio/en/index.php?action=db_residency</a>.</p>
-<p><a href="index.php?action=browser&amp;class=database&amp;val=Datab&aacute;ze+mluven&yacute;ch+projevů+v+če&scaron;tině+jako+ciz&iacute;m+jazyce+%28trval&yacute;+pobyt+v+ČR%29">Enter Corpus</a></p>
+		<h1 id="database-of-spoken-czech-as-a-foreign-language-permanent-residency-in-the-czech-republic">Database of Spoken Czech as a Foreign Language (Permanent Residency in the Czech Republic)</h1>
+<p>Database of Spoken Czech as a Foreign Language (Permanent Residency in the Czech Republic) is the language corpus of spoken performances by non-native speakers of Czech focused on A2 level (according to the CEFR), which is required for the granting of permanent residency in the Czech Republic. It includes recordings capturing the oral part of the <a href="https://ujop.cuni.cz/UJOPEN-70.html?ujopcmsid=12:czech-language-certificate-exam-cce">Czech Language Certificate Exam</a>. The recordings consist of dialogues between the examiner (a native speaker) and the candidate (a non-native speaker). In addition to the recordings, the corpus also contains their transcriptions, which are richly linguistically annotated. Some recordings are accompanied by multiple transcriptions from different annotators, allowing for comparisons of various transcripts of the same recording and evaluations of the degree of consistency in converting spoken language into written text.</p>
+<p>The corpus is published as a specialized public database aimed at providing a structured and easily accessible source of authentic spoken data for linguists, educators, students, the scientific community, and the general public.</p>
+<p>The corpus was created at the <a href="https://ufal.mff.cuni.cz/">Institute of Formal and Applied Linguistics at the Faculty of Mathematics and Physics, Charles University</a> to support teaching, research, and assessment of language competence among non-native speakers of Czech as part of the project <a href="https://ufal.mff.cuni.cz/automated-speech-scoring-czech"><em>Automated Speech Scoring in Czech</em></a>. Audio recordings were provided by the <a href="https://ujop.cuni.cz/UJOPEN-1.html">Institute for Language and Preparatory Studies, Charles University</a> (ujop.cuni.cz).</p>
+<h2 id="statistics">Statistics</h2>
+<p>The database contains 63 recordings, capturing the same number of tests and the same number of non-native speakers. The total length of all recordings is 3h 15min 40s. The table below shows the transcription statistics, with one canonical transcription selected for each recording.</p>
+<table>
+<thead>
+<tr class="header">
+<th>&nbsp;</th>
+<th style="text-align: right;">All</th>
+<th style="text-align: right;">Canonical</th>
+</tr>
+</thead>
+<tbody>
+<tr class="odd">
+<td>Files</td>
+<td style="text-align: right;">106</td>
+<td style="text-align: right;">63</td>
+</tr>
+<tr class="even">
+<td>Utterances</td>
+<td style="text-align: right;">4,773</td>
+<td style="text-align: right;">2,888</td>
+</tr>
+<tr class="odd">
+<td>Tokens</td>
+<td style="text-align: right;">33,267</td>
+<td style="text-align: right;">20,035</td>
+</tr>
+</tbody>
+</table>
+<h2 id="documentation">Documentation</h2>
+<ul>
+<li><a href="index.php?action=db_residency_manual">User Manual</a></li>
+<li><a href="index.php?action=db_residency_techdoc">Technical Documentation</a></li>
+</ul>
+<h2 id="license">License</h2>
+<p>The corpus is published under the CC BY-NC-SA 4.0 license.</p>
+<h2 id="acknowledgment">Acknowledgment</h2>
+<p>The database was funded by the Programme to Support Applied Research in the Area of the National and Cultural Identity for the Years 2023 to 2030 (NAKI III) of the Ministry of Culture of the Czech Republic within the project <em>Automated Speech Scoring in Czech</em> (DH23P03OVV037).</p>
+<h2 id="special-thanks">Special Thanks</h2>
+<p>The authors of the database sincerely thank PhDr. Pavel Pečen&yacute;, Ph.D., from the Institute for Language and Preparatory Studies, Charles University for providing audio data.</p>
+<h2 id="how-to-cite">How to Cite</h2>
+<p>Rysov&aacute; Kateřina, Nov&aacute;k Michal, Rysov&aacute; Magdal&eacute;na, Pol&aacute;k Peter, Bojar Ondřej: <em>Database of Spoken Czech as a Foreign Language (Permanent Residency in the Czech Republic)</em>. Institute of Formal and Applied Linguistics MFF UK, Prague 2024. Available from WWW https://lindat.mff.cuni.cz/services/teitok-live/evaldio/en/index.php?action=db_residency.</p>
 	</div>
 </div>
 
diff --git a/data_preparation/70.releasing/html/db_residency_manual-cs.html b/data_preparation/70.releasing/html/db_residency_manual-cs.html
new file mode 100644
index 0000000..388ca85
--- /dev/null
+++ b/data_preparation/70.releasing/html/db_residency_manual-cs.html
@@ -0,0 +1,360 @@
+
+<!DOCTYPE html>
+<html>
+<head>
+<title>Evaldio</title>
+<meta charset="utf-8" />
+<meta name="viewport" content="width=device-width, initial-scale=1">
+<link href='https://fonts.googleapis.com/css?family=Cousine:400|Roboto:300,400,400italic,700,700italic|Roboto+Condensed:400,700&amp;subset=latin,latin-ext' rel='stylesheet' type='text/css'>
+<link href='//lindat.mff.cuni.cz/services/teitok-live/themes/lindat/css/font-awesome.min.css' rel='stylesheet' type='text/css'>
+
+<link rel="stylesheet" type="text/css" href="//lindat.mff.cuni.cz/services/teitok/css/common.css" />
+<link rel="stylesheet" type="text/css" href="//lindat.mff.cuni.cz/services/teitok/css/view.css" />
+<link rel="stylesheet" type="text/css" href="//lindat.mff.cuni.cz/aai/discojuice/discojuice.css" />
+
+<link rel="stylesheet" type="text/css" href="//lindat.mff.cuni.cz/aai/discojuice/discojuice.css" />
+  <link rel="stylesheet" type="text/css" href="/services/teitok/css/media-dark.css" media="screen">
+  <link rel="stylesheet" type="text/css" href="/services/teitok/Scripts/teitok.css" media="screen">
+  <link rel="stylesheet" type="text/css" href="/services/teitok/css/xmlstyles.css" media="screen">
+  <link rel="stylesheet" type="text/css" href="Resources/xmlstyles.css" media="screen">
+  <link rel="stylesheet" type="text/css" href="/services/teitok/css/htmlstyles.css" media="screen">
+  <link rel="stylesheet" type="text/css" href="Resources/htmlstyles.css" media="screen">
+
+<!-- plug-ins dependencies -->
+<script type="text/javascript" src="//code.jquery.com/jquery-3.3.1.min.js"></script>
+<script type="text/javascript" src="//lindat.mff.cuni.cz/aai/discojuice/discojuice-2.1.en.min.js"></script>
+<script type="text/javascript" src="//lindat.mff.cuni.cz/aai/aai.js"></script>
+<script type="text/javascript" src="//lindat.mff.cuni.cz/services/teitok/ufal/idp.js"></script>
+<!-- --------------------- -->
+
+</head>
+<body>
+
+
+<div class="lindat-common2 lindat-common-header">
+<header data-version="3.0.5" data-build="05eff1186f12528f221a63b021c7b7dc81301429">
+    <nav class="lindat-navbar lindat-navbar-expand-lg lindat-justify-content-between lindat-navbar-dark ">
+        <div class="lindat-block lindat-block--clariah-theme-branding">
+            <a href="https://lindat.mff.cuni.cz/" class="lindat-navbar-brand lindat-d-flex lindat-align-items-center " aria-label="">
+                <img src="https://lindat.mff.cuni.cz/sites/default/files/LINDAT-CLARIAH-cz-gray_0.svg" width="auto" height="53" style="height: 53px !important;" alt="LINDAT/CLARIAH-CZ logo" class="" />
+            </a>
+        </div>
+        <button class="lindat-navbar-toggler" type="button" data-toggle="collapse" data-target=".lindat-navbar-collapse" aria-controls="lindat-navbar-collapse" aria-expanded="false" aria-label="Toggle navigation"
+                onclick="this.parentNode.querySelector('.lindat-navbar-toggler+div.lindat-collapse.lindat-navbar-collapse').classList.toggle('lindat-show')">
+            <span class="lindat-navbar-toggler-icon"></span>
+        </button>
+        <div class="lindat-collapse lindat-navbar-collapse">
+            <div class="">
+                <div class="lindat-block lindat-block--clariah-theme-main-menu">
+                    <ul class="lindat-nav lindat-navbar-nav">
+                        
+          <li class="lindat-nav-item ">
+              <a href="https://lindat.cz/#search" class="lindat-nav-link "
+                                    
+                                    
+                                    >Search</a>
+              
+          </li>
+        
+          <li class="lindat-nav-item ">
+              <a href="https://lindat.mff.cuni.cz/repository/xmlui/?locale-attribute=en" class="lindat-nav-link "
+                                    
+                                    
+                                    >Catalogue</a>
+              
+          </li>
+        
+          <li class="lindat-nav-item ">
+              <a href="https://lindat.cz/#education" class="lindat-nav-link "
+                                    
+                                    
+                                    >Education</a>
+              
+          </li>
+        
+          <li class="lindat-nav-item ">
+              <a href="https://lindat.cz/#projects" class="lindat-nav-link "
+                                    
+                                    
+                                    >Projects</a>
+              
+          </li>
+        
+          <li class="lindat-nav-item ">
+              <a href="https://lindat.cz/#tools" class="lindat-nav-link "
+                                    
+                                    
+                                    >Tools</a>
+              
+          </li>
+        
+          <li class="lindat-nav-item ">
+              <a href="https://lindat.cz/en/services" class="lindat-nav-link "
+                                    
+                                    
+                                    >Services</a>
+              
+          </li>
+        
+          <li class="lindat-nav-item lindat-dropdown">
+              <a href="https://lindat.cz/" class="lindat-nav-link lindat-dropdown-toggle"
+                                     data-toggle="dropdown"
+                                     onclick="this.parentNode.querySelector('.lindat-dropdown-toggle+div.lindat-dropdown-menu').classList.toggle('lindat-show'); return false;"
+                                    >About</a>
+              <div class="lindat-dropdown-menu">
+               <a href="https://lindat.cz/partners" class="lindat-dropdown-item">Partners</a>
+            
+               <a href="https://lindat.cz/files/mission-en.pdf" class="lindat-dropdown-item">Mission Statement</a>
+            
+               <a href="https://www.clarin.eu/" class="lindat-dropdown-item">CLARIN</a>
+            
+               <a href="https://www.dariah.eu/" class="lindat-dropdown-item">DARIAH</a>
+            
+               <a href="https://lindat.cz/integration" class="lindat-dropdown-item">Service integrations</a>
+            
+               <a href="https://lindat.cz/partnership" class="lindat-dropdown-item">Project partnerships</a>
+            </div>
+          </li>
+        
+                    </ul>
+                </div>
+            </div>
+            <div class="lindat-block lindat-block--clariah-theme-account-menu">
+                <ul class="lindat-nav lindat-navbar-nav">
+                    <li class="lindat-nav-item" id="margin-filler"></li>
+                    <li class="lindat-nav-item  ">
+                        <a class="lindat-nav-link lindat-nav-link-dariah" href="https://www.dariah.eu/"><img src="https://lindat.mff.cuni.cz/images/dariah-eu.png" alt="DARIAH logo" /></a>
+                    </li>
+                    <li class="lindat-nav-item  ">
+                        <a class="lindat-nav-link lindat-nav-link-clarin" href="https://www.clarin.eu/"><img src="https://lindat.mff.cuni.cz/images/clarin.png" alt="CLARIN logo" /></a>
+                    </li>
+                </ul>
+            </div>
+            <slot name="languageswitcher"></slot>
+        </div>
+    </nav>
+</header>
+</div>
+    
+
+
+<div id="localization-bar"></div>
+<div id="content">
+        	<div id='menubox'>
+        				<p class='header main'><a href='http://lindat.mff.cuni.cz/services/teitok/index.php'>TEITOK</a></p>
+			<ul style='text-align: left' class='teitok'>
+			<li><a href='index.php?action=login'>Login</a></li>
+			<li><a href='http://lindat.mff.cuni.cz/services/teitok/index.php?action=corplist'>Available Corpora</a></li>
+			</ul>
+			<div stle='margin-top: 0px; margin-bottom: 20px;'><span class=langon>EN</span> | <span class=langoff><a href='/services/teitok-live/evaldio/cs/index.php?action=db_residency_manual-cs'>CS</a></span></div><p class='header'><a href='index.php'>Evaldio</a></p><ul style='text-align: left'><li><a href='index.php?action=databases'>Databases</a><li><a href='index.php?action=browser'>Browse</a><li><a href='index.php?action=cqp'>Search</a><li><a target="repository" href='http://hdl.handle.net/11234/1-5731'>Download</a></ul><ul style='text-align: left'><li><a href='index.php?action=login' >Login</a></ul><hr style='opacity: 0.5; margin-top: 40px;'><p id=powby style='opacity: 0.5; font-size: smaller;'><span onClick="window.open('http://www.teitok.org/index.php', 'teitok');">Powered by <span style='font-family: Courier;'>&lt;TEI:TOK&gt;</span></span><br><span onClick="window.open('http://www.teitok.org/index.php?action=credits', 'teitok');">Maarten Janssen, 2014-</a></p>
+        	</div>
+    <div id="main">
+		<h1 id="uživatelsk&aacute;-př&iacute;ručka">Uživatelsk&aacute; př&iacute;ručka</h1>
+<p>Z&aacute;kladn&iacute; funkce datab&aacute;ze zahrnuje prohl&iacute;žen&iacute; z&aacute;znamů s různ&yacute;mi způsoby jejich zobrazen&iacute;, filtrov&aacute;n&iacute; z&aacute;znamů podle různ&yacute;ch kategori&iacute; a komplexn&iacute; vyhled&aacute;v&aacute;n&iacute; v obsahu datab&aacute;ze. Datab&aacute;ze rovněž umožňuje st&aacute;hnout korpus jako celek nebo st&aacute;hnout vybran&eacute; z&aacute;znamy.</p>
+<h2 id="prohl&iacute;žen&iacute;-z&aacute;znamů">Prohl&iacute;žen&iacute; z&aacute;znamů</h2>
+<p>Po vstupu do korpusu se v přehledn&eacute; tabulce zobraz&iacute; v&scaron;echny z&aacute;znamy (tj. soubory transkriptů) uložen&eacute; v datab&aacute;zi. Pro každ&yacute; soubor s transkriptem tabulka kromě n&aacute;zvu souboru zobrazuje v dal&scaron;&iacute;ch sloupc&iacute;ch &uacute;roveň a identifik&aacute;tor zkou&scaron;ky, č&iacute;slo &uacute;lohy, zdroj předběžn&eacute; anotace, k&oacute;d anot&aacute;tora a informaci o tom, zda je přepis pro danou nahr&aacute;vku kanonick&yacute;. Soubory v tabulce je možn&eacute; tř&iacute;dit podle hodnot vybran&eacute;ho sloupce. Z&aacute;znamy lze tak&eacute; filtrovat na z&aacute;kladě libovoln&eacute;ho podřetězce v n&aacute;zvu souboru zad&aacute;n&iacute;m tohoto podřetězce do textov&eacute;ho pole &ldquo;Search&rdquo; um&iacute;stěn&eacute;ho vpravo nad tabulkou. Kliknut&iacute;m na konkr&eacute;tn&iacute; soubor se tento soubor zobraz&iacute;.</p>
+<h2 id="zobrazen&iacute;-souboru">Zobrazen&iacute; souboru</h2>
+<p>Datab&aacute;ze umožňuje prohl&iacute;žet přepisy jednotliv&yacute;ch replik spolu s anotacemi a metadaty a tak&eacute; poslouchat př&iacute;slu&scaron;n&eacute; zvukov&eacute; nahr&aacute;vky. Charakter zobrazen&yacute;ch informac&iacute; se li&scaron;&iacute; podle zvolen&eacute;ho režimu zobrazen&iacute;, mezi kter&yacute;mi lze přep&iacute;nat v doln&iacute; č&aacute;sti str&aacute;nky pod samotn&yacute;m přepisem.</p>
+<h3 id="režim-text-view">Režim Text View</h3>
+<p>Text View je z&aacute;kladn&iacute; režim zobrazen&iacute;, kter&yacute; se objev&iacute; po otevřen&iacute; souboru. V horn&iacute; č&aacute;sti obrazovky se nach&aacute;z&iacute; hlavička s n&aacute;zvem přepisu a vybran&yacute;mi metadaty. V doln&iacute; č&aacute;sti je zobrazen samotn&yacute; přepis, rozdělen&yacute; na repliky. Každ&aacute; replika je označena identifik&aacute;torem mluvč&iacute;ho (EXAM_1 pro zkou&scaron;ej&iacute;c&iacute;ho a CAND_1 pro kandid&aacute;ta).</p>
+<p>Tento režim rovněž umožňuje zobrazit automatickou morfologickou anotaci a lemmatizaci. Po najet&iacute; kurzorem na konkr&eacute;tn&iacute; token se zobraz&iacute; př&iacute;slu&scaron;n&aacute; anotace v kontextu. Pro zobrazen&iacute; vybran&eacute;ho atributu pro v&scaron;echny tokeny v přepisu lze využ&iacute;t ovl&aacute;dac&iacute; prvky um&iacute;stěn&eacute; pod hlavičkou, kter&eacute; obsahuj&iacute; n&aacute;sleduj&iacute;c&iacute; tlač&iacute;tka: - PoS: Zobraz&iacute; slovn&iacute; druhy. - Tag: Uk&aacute;že morfologick&eacute; tagy. - Features: Poskytne podrobn&eacute; morfologick&eacute; informace. - Lemma: Zobraz&iacute; z&aacute;kladn&iacute; tvary slov.</p>
+<h3 id="režim-waveform-view">Režim Waveform View</h3>
+<p>V horn&iacute; č&aacute;sti obrazovky se nach&aacute;z&iacute; roz&scaron;&iacute;řen&yacute; ovl&aacute;dac&iacute; prvek pro přehr&aacute;v&aacute;n&iacute; nahr&aacute;vky, kter&yacute; zobrazuje graf sign&aacute;lu (tzv. waveform). Pod n&iacute;m jsou zobrazeny přepisy jednotliv&yacute;ch replik. Kliknut&iacute;m na konkr&eacute;tn&iacute; repliku se tato replika přehraje.</p>
+<h3 id="režim-dependencies">Režim Dependencies</h3>
+<p>Tento režim zobrazuje syntaktickou anotaci. Po kliknut&iacute; na konkr&eacute;tn&iacute; repliku se zobraz&iacute; automaticky vygenerovan&yacute; z&aacute;vislostn&iacute; strom, u nějž je možn&eacute; zobrazit detaily pomoc&iacute; my&scaron;i. Vpravo nahoře od stromu se nach&aacute;z&iacute; tlač&iacute;tko &equiv; pro dal&scaron;&iacute; možnosti zobrazen&iacute; stromu. Je tak možn&eacute; uspoř&aacute;dat uzly podle slovosledu, zobrazit interpunkci nebo uložit obr&aacute;zek stromu ve form&aacute;tu SVG.</p>
+<h2 id="filtrov&aacute;n&iacute;-z&aacute;znamů-přes-kategorie">Filtrov&aacute;n&iacute; z&aacute;znamů přes kategorie</h2>
+<p>Po kliknut&iacute; na tlač&iacute;tko <em>Kategorie</em> v lev&eacute;m hlavn&iacute;m menu je možn&eacute; filtrovat přepisy na z&aacute;kladě hodnot jednotliv&yacute;ch kategori&iacute;. Např&iacute;klad je tak možn&eacute; zobrazit si pouze seznam kanonick&yacute;ch přepisů nebo přepisů od konkr&eacute;tn&iacute;ho anot&aacute;tora.</p>
+<h2 id="vyhled&aacute;v&aacute;n&iacute;">Vyhled&aacute;v&aacute;n&iacute;</h2>
+<p>Vyhled&aacute;v&aacute;n&iacute; v korpusu lze prov&aacute;dět na str&aacute;nce, kter&aacute; se zobraz&iacute; po klinut&iacute; na tlač&iacute;tko <em>Hledat</em> v lev&eacute;m hlavn&iacute;m menu. Str&aacute;nka umožňuje zad&aacute;vat dotazy ve form&aacute;tu CQL (Corpus Query Language). Např.</p>
+<blockquote>
+<p><code>[upos = "NUM.*"] [lemma = "ot&aacute;zka"]</code></p>
+<p>pro nalezen&iacute; tvarů slova <em>ot&aacute;zka</em>, jimž předch&aacute;z&iacute; č&iacute;slovka.</p>
+</blockquote>
+<p>Pro usnadněn&iacute; vyhled&aacute;v&aacute;n&iacute; nab&iacute;z&iacute; rozhran&iacute; TEITOK n&aacute;stroj pro sestavov&aacute;n&iacute; dotazů. Tento n&aacute;stroj umožňuje snadno definovat jednoduch&eacute; dotazy v CQL prostřednictv&iacute;m formul&aacute;ře. Stač&iacute; kliknout na ikonu <em>Query builder</em>, definovat svůj dotaz a pot&eacute; stisknout tlač&iacute;tko <em>Create query</em>, č&iacute;mž se dotaz vlož&iacute; do textov&eacute;ho pole CQL, kde jej můžete př&iacute;padně upravit.</p>
+<p>V z&aacute;kladn&iacute;m nastaven&iacute; TEITOK prov&aacute;d&iacute; vyhled&aacute;v&aacute;n&iacute; v cel&eacute;m korpusu, kter&yacute; může obsahovat k jedn&eacute; nahr&aacute;vce v&iacute;ce přepisů. Pokud chcete vyhled&aacute;vat pouze v t&eacute; č&aacute;sti korpusu, v n&iacute;ž je ke každ&eacute; nahr&aacute;vce přiřazen&yacute; jen jedin&yacute; přepis, je nutn&eacute; omezit hled&aacute;n&iacute; na tzv. kanonick&eacute; přepisy. Např.</p>
+<blockquote>
+<p><code>[lemma = "situace"] :: match.text_canonical = "1"</code></p>
+<p>vyhled&aacute;v&aacute; lemma <em>situace</em> jenom v kanonick&yacute;ch přepisech.</p>
+</blockquote>
+<h2 id="stahov&aacute;n&iacute;">Stahov&aacute;n&iacute;</h2>
+<p>Cel&yacute; korpus včetně nahr&aacute;vek a dokumentace je možn&eacute; st&aacute;hnout z hlavn&iacute;ho menu vlevo.</p>
+<p>Konkr&eacute;tn&iacute; přepis lze st&aacute;hnout v režimu <em>Text view</em> kliknut&iacute;m na tlač&iacute;tko <em>Download XML</em> um&iacute;stěn&eacute; v doln&iacute; č&aacute;sti str&aacute;nky.</p>
+	</div>
+</div>
+
+</div>
+
+
+
+<div class="lindat-common2 lindat-common-footer">
+ <footer data-version="3.0.5" data-build="05eff1186f12528f221a63b021c7b7dc81301429">
+    
+      <div id="about-lindat">
+        <h4><a href="https://lindat.cz/sites/default/files/2021-01/lindat_clariah_flyer.pdf">LINDAT/CLARIAH-CZ</a></h4>
+        <ul>
+          
+          <li><a href="https://lindat.cz/files/mission-en.pdf">Mission Statement</a></li>
+          
+          <li><a href="https://lindat.cz/ab">Advisory Board</a></li>
+          
+          <li><a href="https://lindat.cz/events">Events</a></li>
+          
+          <li><a href="https://www.clarin.eu/">CLARIN Participation</a></li>
+          
+          <li><a href="https://www.dariah.eu/">DARIAH Participation</a></li>
+          <br/>
+          <li><a href="https://lindat.cz/faq-repository">FAQ</a></li>
+          
+          <li><a href="mailto:lindat-help@ufal.mff.cuni.cz">Helpdesk</a></li>
+          
+          <li><a href="https://lindat.cz/user_feedback">User Feedback Form</a></li>
+          <br/>
+          <li><a href="https://lindat.cz/acknowledgement">Acknowledge LINDAT/CLARIAH-CZ</a></li>
+          
+        </ul>
+      </div>
+      
+      <div id="about-partners">
+        <h4><a href="https://lindat.cz/partners">Partners</a></h4>
+        <ul>
+          
+            <li>Charles University
+                <ul>
+                    
+          <li><a href="https://lindat.cz/partners/mff-uk">Faculty <i>of</i> Mathematics <i>and</i> Physics</a></li>
+          
+          <li><a href="https://lindat.cz/partners/ff-uk">Faculty <i>of</i> Arts</a></li>
+          
+                </ul>
+            </li>
+          
+            <li>Masaryk University
+                <ul>
+                    
+          <li><a href="https://lindat.cz/partners/ff-mu">Faculty <i>of</i> Arts</a></li>
+          
+          <li><a href="https://lindat.cz/partners/fi-mu">Faculty  <i>of</i> Informatics</a></li>
+          
+                </ul>
+            </li>
+          
+            <li>University of West Bohemia
+                <ul>
+                    
+          <li><a href="https://lindat.cz/partners/zcu">Faculty <i>of</i> Applied Sciences</a></li>
+          
+                </ul>
+            </li>
+          
+            <li>Czech Academy of Sciences
+                <ul>
+                    
+          <li><a href="https://lindat.cz/partners/ujc">Czech Language Institute</a></li>
+          
+          <li><a href="https://lindat.cz/partners/knav">Library <i>of</i> Academy</a></li>
+          
+          <li><a href="https://lindat.cz/partners/hu">Institute <i>of</i> History</a></li>
+          
+          <li><a href="https://lindat.cz/partners/flu">Institute <i>of</i> Philosophy</a></li>
+          
+                </ul>
+            </li>
+          
+            <li>Archives, Libraries and Galleries
+                <ul>
+                    
+          <li><a href="https://lindat.cz/partners/nk">National Library <i>of the Czech Republic</i></a></li>
+          
+          <li><a href="https://lindat.cz/partners/mzk">Moravian Library <i>in Brno</i></a></li>
+          
+          <li><a href="https://lindat.cz/partners/ng">National Gallery Prague</a></li>
+          
+          <li><a href="https://lindat.cz/partners/nfa">National Film Archive</a></li>
+          
+                </ul>
+            </li>
+          
+        </ul>
+      </div>
+      
+      <div id="about-website">
+        <h4><a href="https://lindat.cz/services">Services</a></h4>
+        <ul>
+          
+          <li><a href="https://lindat.mff.cuni.cz/en/monitoring">Service Status</a></li>
+          
+          <li><a href="https://lindat.mff.cuni.cz/repository/xmlui/page/about?locale-attribute=en">About and Policies</a></li>
+          
+          <li><a href="https://lindat.mff.cuni.cz/en/terms-of-use">Terms of Use</a></li>
+          
+        </ul>
+      </div>
+      
+
+    <div id="badges-a">
+        <a href="https://www.clarin.eu/content/certified-centres"><img src="https://lindat.mff.cuni.cz/images/b-centre.png" alt="CLARIN CENTRE B" /></a>
+        <a href="https://www.clarin.eu/content/knowledge-centres"><img src="https://lindat.mff.cuni.cz/images/k-centre.png" alt="CLARIN CENTRE K" style="filter:brightness(0.88)" /></a>
+        <a href="https://www.coretrustseal.org/wp-content/uploads/2019/08/LINDAT-CLARIN.pdf"><img src="https://lindat.mff.cuni.cz/images/core-trust-seal-mono.png" alt="CoreTrustSeal Certification" /></a>
+    </div>
+
+    <div id="badges-b">
+        <a href="https://twitter.com/lindatclariahcz">Follow us on Twitter <img src="https://lindat.mff.cuni.cz/images/twitter-circular.svg" alt="Link to Profile" /></a>
+        <a href="https://lindat.cz/user/login"><img src="https://lindat.mff.cuni.cz/sites/default/files/LINDAT-CLARIAH-cz-gray_0.svg" alt="Home Page" /></a>
+    </div>
+
+    <div id="ack-msmt">
+        THE LINDAT/CLARIAH-CZ PROJECT (LM2018101; formerly LM2010013, LM2015071) IS FULLY SUPPORTED BY THE MINISTRY OF EDUCATION, SPORTS AND YOUTH OF THE CZECH REPUBLIC UNDER THE&#160;PROGRAMME LM OF "LARGE INFRASTRUCTURES"
+    </div>
+    <div id="ack-freepik">Icons ©  Smashicons and Freepik from flaticon.com licensed by <a href="https://creativecommons.org/licenses/by/3.0/">CC 3.0 BY</a></div>
+    <div id="ack-ufal">website © 2022 by <a href="https://ufal.mff.cuni.cz/">ÚFAL</a></div>
+    
+  <!-- TRACKING CODE -->
+
+  <script type="text/javascript">
+    //<![CDATA[
+    
+    (function(i,s,o,g,r,a,m){i['GoogleAnalyticsObject']=r;i[r]=i[r]||function(){
+        (i[r].q=i[r].q||[]).push(arguments)},i[r].l=1*new Date();a=s.createElement(o),
+      m=s.getElementsByTagName(o)[0];a.async=1;a.src=g;m.parentNode.insertBefore(a,m)
+    })(window,document,'script','//www.google-analytics.com/analytics.js','ga');
+
+    // main LINDAT/CLARIAH-CZ tracker
+    ga('create', 'UA-27008245-2', 'cuni.cz');
+    ga('send', 'pageview');
+      
+    //]]>
+  </script>
+
+  <!-- Piwik LINDAT/CLARIAH-CZ tracker -->
+  <script type="text/javascript">
+    //<![CDATA[
+    
+    var _paq = _paq || [];
+    _paq.push(["setDocumentTitle", document.domain + "/" + document.title]);
+    _paq.push(["setCookieDomain", "*.mff.cuni.cz"]);
+    _paq.push(["setDomains", ["*.mff.cuni.cz"]]);
+    _paq.push(['setCustomVariable', 1, "source", "common-theme", "page"]);
+    _paq.push(['trackPageView']);
+    _paq.push(['enableLinkTracking']);
+    (function() {
+      var u='//lindat.mff.cuni.cz/piwik/';
+      _paq.push(['setTrackerUrl', u+'piwik.php']);
+      _paq.push(['setSiteId', 2]);
+      var d=document, g=d.createElement('script'), s=d.getElementsByTagName('script')[0];
+      g.type='text/javascript'; g.async=true; g.defer=true; g.src=u+'piwik.js'; s.parentNode.insertBefore(g,s);
+    })();
+      
+    //]]>
+  </script>
+  <noscript><p><img src="//lindat.mff.cuni.cz/piwik/piwik.php?idsite=2" style="border:0;" alt="" /></p></noscript>
+  <!-- End Piwik Code -->
+  <!-- End TRACKING CODE -->
+      
+</footer>
+</div>
+    
+
+
+
+</body>
+</html>
diff --git a/data_preparation/70.releasing/html/residency.html b/data_preparation/70.releasing/html/db_residency_manual.html
similarity index 71%
rename from data_preparation/70.releasing/html/residency.html
rename to data_preparation/70.releasing/html/db_residency_manual.html
index 07b9af4..731d850 100644
--- a/data_preparation/70.releasing/html/residency.html
+++ b/data_preparation/70.releasing/html/db_residency_manual.html
@@ -146,13 +146,39 @@
 			<li><a href='index.php?action=login'>Login</a></li>
 			<li><a href='http://lindat.mff.cuni.cz/services/teitok/index.php?action=corplist'>Available Corpora</a></li>
 			</ul>
-			<div stle='margin-top: 0px; margin-bottom: 20px;'><span class=langon>EN</span> | <span class=langoff><a href='/services/teitok-live/evaldio/cs/index.php?action=residency'>CS</a></span></div><p class='header'><a href='index.php'>Evaldio</a></p><ul style='text-align: left'><li><a href='index.php?action=databases'>Databases</a><li><a href='index.php?action=browser'>Browse</a><li><a href='index.php?action=cqp'>Search</a><li><a target="repository" href='http://hdl.handle.net/11234/1-5731'>Download</a></ul><ul style='text-align: left'><li><a href='index.php?action=login' >Login</a></ul><hr style='opacity: 0.5; margin-top: 40px;'><p id=powby style='opacity: 0.5; font-size: smaller;'><span onClick="window.open('http://www.teitok.org/index.php', 'teitok');">Powered by <span style='font-family: Courier;'>&lt;TEI:TOK&gt;</span></span><br><span onClick="window.open('http://www.teitok.org/index.php?action=credits', 'teitok');">Maarten Janssen, 2014-</a></p>
+			<div stle='margin-top: 0px; margin-bottom: 20px;'><span class=langon>EN</span> | <span class=langoff><a href='/services/teitok-live/evaldio/cs/index.php?action=db_residency_manual'>CS</a></span></div><p class='header'><a href='index.php'>Evaldio</a></p><ul style='text-align: left'><li><a href='index.php?action=databases'>Databases</a><li><a href='index.php?action=browser'>Browse</a><li><a href='index.php?action=cqp'>Search</a><li><a target="repository" href='http://hdl.handle.net/11234/1-5731'>Download</a></ul><ul style='text-align: left'><li><a href='index.php?action=login' >Login</a></ul><hr style='opacity: 0.5; margin-top: 40px;'><p id=powby style='opacity: 0.5; font-size: smaller;'><span onClick="window.open('http://www.teitok.org/index.php', 'teitok');">Powered by <span style='font-family: Courier;'>&lt;TEI:TOK&gt;</span></span><br><span onClick="window.open('http://www.teitok.org/index.php?action=credits', 'teitok');">Maarten Janssen, 2014-</a></p>
         	</div>
     <div id="main">
-		<h1>Datab&aacute;ze mluven&yacute;ch projevů v če&scaron;tině jako ciz&iacute;m jazyce (trval&yacute; pobyt v ČR)</h1>
-<p>TODO: popis</p>
-<p>&nbsp;</p>
-<p><a href="index.php?action=browser&amp;class=database&amp;val=Datab&aacute;ze+mluven&yacute;ch+projevů+v+če&scaron;tině+jako+ciz&iacute;m+jazyce+%28trval&yacute;+pobyt+v+ČR%29">List all the items</a></p>
+		<h1 id="user-manual">User Manual</h1>
+<p>The basic functions of the database include browsing records with various display options, filtering records by different categories, and performing complex searches within the database content. The database also allows users to download the entire corpus or selected records.</p>
+<h2 id="browsing-records">Browsing Records</h2>
+<p>Upon entering the corpus, all records (i.e., transcript files) stored in the database are displayed in a clear table. For each transcript file, the table shows, in addition to the file name, the level and identifier of the exam, the task number, the source of the preliminary annotation, the annotator&rsquo;s code, and information on whether the transcript for that recording is canonical. The files in the table can be sorted by the values in a selected column. Records can also be filtered based on any substring in the file name by entering this substring in the &ldquo;Search&rdquo; text box located to the right above the table. Clicking on a specific file will display that file.</p>
+<h2 id="viewing-a-file">Viewing a File</h2>
+<p>The database allows users to view the transcripts of individual turns along with annotations and metadata, and to listen to the corresponding audio recordings. The nature of the displayed information varies according to the selected display mode, which can be switched at the bottom of the page below the transcript.</p>
+<h3 id="text-view-mode">Text View Mode</h3>
+<p>Text View is the basic display mode that appears upon opening a file. At the top of the screen is a header with the title of the transcript and selected metadata. The transcript itself is displayed at the bottom, divided into turns. Each turn is marked with the speaker&rsquo;s identifier (EXAM_1 for the examiner and CAND_1 for the candidate).</p>
+<p>This mode also allows users to view automatic morphological annotation and lemmatization. Hovering the cursor over a specific token will display the corresponding annotation in context. To display a selected attribute for all tokens in the transcript, controls located below the header can be used, which include the following buttons: - PoS: Displays parts of speech. - Tag: Shows morphological tags. - Features: Provides detailed morphological information. - Lemma: Displays base forms of words.</p>
+<h3 id="waveform-view-mode">Waveform View Mode</h3>
+<p>At the top of the screen, there is an extended playback control for the recording, which displays a signal graph (i.e., waveform). Below it, the transcripts of individual turns are displayed. Clicking on a specific turn will play that turn.</p>
+<h3 id="dependencies-mode">Dependencies Mode</h3>
+<p>This mode displays syntactic annotation. When clicking on a specific turn, an automatically generated dependency tree is displayed, with details available via mouse hover. In the upper right corner of the tree is a &equiv; button for additional display options for the tree. It is possible to arrange nodes by word order, display punctuation, or save an image of the tree in SVG format.</p>
+<h2 id="filtering-records-by-categories">Filtering Records by Categories</h2>
+<p>By clicking on the <em>Browse</em> button in the left main menu, users can filter transcripts based on the values of individual categories. For example, it is possible to display only a list of canonical transcripts or transcripts from a specific annotator.</p>
+<h2 id="searching">Searching</h2>
+<p>Searching within the corpus can be done on a page that appears after clicking the <em>Search</em> button in the left main menu. This page allows users to enter queries in CQL (Corpus Query Language) format. For example:</p>
+<blockquote>
+<p><code>[upos = "NUM.*"] [lemma = "ot&aacute;zka"]</code></p>
+<p>to find forms of the word <em>ot&aacute;zka</em> that are preceded by a numeral.</p>
+</blockquote>
+<p>To facilitate searching, the TEITOK interface provides a query builder tool. This tool allows users to easily define simple queries in CQL through a form. Just click the <em>Query builder</em> icon, define your query, and then press the <em>Create query</em> button, which inserts the query into the CQL text box where it can be further edited if needed.</p>
+<p>By default, TEITOK searches the entire corpus, which may contain multiple transcripts for a single recording. If you want to search only in the part of the corpus where each recording has only a single associated transcript, you must restrict the search to so-called canonical transcripts. For example:</p>
+<blockquote>
+<p><code>[lemma = "situace"] :: match.text_canonical = "1"</code></p>
+<p>searches for the lemma <em>situace</em> only in canonical transcripts.</p>
+</blockquote>
+<h2 id="downloading">Downloading</h2>
+<p>The entire corpus, including recordings and documentation, can be downloaded from the main menu on the left.</p>
+<p>A specific transcript can be downloaded in <em>Text view</em> mode by clicking the <em>Download XML</em> button located at the bottom of the page.</p>
 	</div>
 </div>
 
diff --git a/data_preparation/70.releasing/html/db_residency_techdoc-cs.html b/data_preparation/70.releasing/html/db_residency_techdoc-cs.html
new file mode 100644
index 0000000..39aae98
--- /dev/null
+++ b/data_preparation/70.releasing/html/db_residency_techdoc-cs.html
@@ -0,0 +1,409 @@
+
+<!DOCTYPE html>
+<html>
+<head>
+<title>Evaldio</title>
+<meta charset="utf-8" />
+<meta name="viewport" content="width=device-width, initial-scale=1">
+<link href='https://fonts.googleapis.com/css?family=Cousine:400|Roboto:300,400,400italic,700,700italic|Roboto+Condensed:400,700&amp;subset=latin,latin-ext' rel='stylesheet' type='text/css'>
+<link href='//lindat.mff.cuni.cz/services/teitok-live/themes/lindat/css/font-awesome.min.css' rel='stylesheet' type='text/css'>
+
+<link rel="stylesheet" type="text/css" href="//lindat.mff.cuni.cz/services/teitok/css/common.css" />
+<link rel="stylesheet" type="text/css" href="//lindat.mff.cuni.cz/services/teitok/css/view.css" />
+<link rel="stylesheet" type="text/css" href="//lindat.mff.cuni.cz/aai/discojuice/discojuice.css" />
+
+<link rel="stylesheet" type="text/css" href="//lindat.mff.cuni.cz/aai/discojuice/discojuice.css" />
+  <link rel="stylesheet" type="text/css" href="/services/teitok/css/media-dark.css" media="screen">
+  <link rel="stylesheet" type="text/css" href="/services/teitok/Scripts/teitok.css" media="screen">
+  <link rel="stylesheet" type="text/css" href="/services/teitok/css/xmlstyles.css" media="screen">
+  <link rel="stylesheet" type="text/css" href="Resources/xmlstyles.css" media="screen">
+  <link rel="stylesheet" type="text/css" href="/services/teitok/css/htmlstyles.css" media="screen">
+  <link rel="stylesheet" type="text/css" href="Resources/htmlstyles.css" media="screen">
+
+<!-- plug-ins dependencies -->
+<script type="text/javascript" src="//code.jquery.com/jquery-3.3.1.min.js"></script>
+<script type="text/javascript" src="//lindat.mff.cuni.cz/aai/discojuice/discojuice-2.1.en.min.js"></script>
+<script type="text/javascript" src="//lindat.mff.cuni.cz/aai/aai.js"></script>
+<script type="text/javascript" src="//lindat.mff.cuni.cz/services/teitok/ufal/idp.js"></script>
+<!-- --------------------- -->
+
+</head>
+<body>
+
+
+<div class="lindat-common2 lindat-common-header">
+<header data-version="3.0.5" data-build="05eff1186f12528f221a63b021c7b7dc81301429">
+    <nav class="lindat-navbar lindat-navbar-expand-lg lindat-justify-content-between lindat-navbar-dark ">
+        <div class="lindat-block lindat-block--clariah-theme-branding">
+            <a href="https://lindat.mff.cuni.cz/" class="lindat-navbar-brand lindat-d-flex lindat-align-items-center " aria-label="">
+                <img src="https://lindat.mff.cuni.cz/sites/default/files/LINDAT-CLARIAH-cz-gray_0.svg" width="auto" height="53" style="height: 53px !important;" alt="LINDAT/CLARIAH-CZ logo" class="" />
+            </a>
+        </div>
+        <button class="lindat-navbar-toggler" type="button" data-toggle="collapse" data-target=".lindat-navbar-collapse" aria-controls="lindat-navbar-collapse" aria-expanded="false" aria-label="Toggle navigation"
+                onclick="this.parentNode.querySelector('.lindat-navbar-toggler+div.lindat-collapse.lindat-navbar-collapse').classList.toggle('lindat-show')">
+            <span class="lindat-navbar-toggler-icon"></span>
+        </button>
+        <div class="lindat-collapse lindat-navbar-collapse">
+            <div class="">
+                <div class="lindat-block lindat-block--clariah-theme-main-menu">
+                    <ul class="lindat-nav lindat-navbar-nav">
+                        
+          <li class="lindat-nav-item ">
+              <a href="https://lindat.cz/#search" class="lindat-nav-link "
+                                    
+                                    
+                                    >Search</a>
+              
+          </li>
+        
+          <li class="lindat-nav-item ">
+              <a href="https://lindat.mff.cuni.cz/repository/xmlui/?locale-attribute=en" class="lindat-nav-link "
+                                    
+                                    
+                                    >Catalogue</a>
+              
+          </li>
+        
+          <li class="lindat-nav-item ">
+              <a href="https://lindat.cz/#education" class="lindat-nav-link "
+                                    
+                                    
+                                    >Education</a>
+              
+          </li>
+        
+          <li class="lindat-nav-item ">
+              <a href="https://lindat.cz/#projects" class="lindat-nav-link "
+                                    
+                                    
+                                    >Projects</a>
+              
+          </li>
+        
+          <li class="lindat-nav-item ">
+              <a href="https://lindat.cz/#tools" class="lindat-nav-link "
+                                    
+                                    
+                                    >Tools</a>
+              
+          </li>
+        
+          <li class="lindat-nav-item ">
+              <a href="https://lindat.cz/en/services" class="lindat-nav-link "
+                                    
+                                    
+                                    >Services</a>
+              
+          </li>
+        
+          <li class="lindat-nav-item lindat-dropdown">
+              <a href="https://lindat.cz/" class="lindat-nav-link lindat-dropdown-toggle"
+                                     data-toggle="dropdown"
+                                     onclick="this.parentNode.querySelector('.lindat-dropdown-toggle+div.lindat-dropdown-menu').classList.toggle('lindat-show'); return false;"
+                                    >About</a>
+              <div class="lindat-dropdown-menu">
+               <a href="https://lindat.cz/partners" class="lindat-dropdown-item">Partners</a>
+            
+               <a href="https://lindat.cz/files/mission-en.pdf" class="lindat-dropdown-item">Mission Statement</a>
+            
+               <a href="https://www.clarin.eu/" class="lindat-dropdown-item">CLARIN</a>
+            
+               <a href="https://www.dariah.eu/" class="lindat-dropdown-item">DARIAH</a>
+            
+               <a href="https://lindat.cz/integration" class="lindat-dropdown-item">Service integrations</a>
+            
+               <a href="https://lindat.cz/partnership" class="lindat-dropdown-item">Project partnerships</a>
+            </div>
+          </li>
+        
+                    </ul>
+                </div>
+            </div>
+            <div class="lindat-block lindat-block--clariah-theme-account-menu">
+                <ul class="lindat-nav lindat-navbar-nav">
+                    <li class="lindat-nav-item" id="margin-filler"></li>
+                    <li class="lindat-nav-item  ">
+                        <a class="lindat-nav-link lindat-nav-link-dariah" href="https://www.dariah.eu/"><img src="https://lindat.mff.cuni.cz/images/dariah-eu.png" alt="DARIAH logo" /></a>
+                    </li>
+                    <li class="lindat-nav-item  ">
+                        <a class="lindat-nav-link lindat-nav-link-clarin" href="https://www.clarin.eu/"><img src="https://lindat.mff.cuni.cz/images/clarin.png" alt="CLARIN logo" /></a>
+                    </li>
+                </ul>
+            </div>
+            <slot name="languageswitcher"></slot>
+        </div>
+    </nav>
+</header>
+</div>
+    
+
+
+<div id="localization-bar"></div>
+<div id="content">
+        	<div id='menubox'>
+        				<p class='header main'><a href='http://lindat.mff.cuni.cz/services/teitok/index.php'>TEITOK</a></p>
+			<ul style='text-align: left' class='teitok'>
+			<li><a href='index.php?action=login'>Login</a></li>
+			<li><a href='http://lindat.mff.cuni.cz/services/teitok/index.php?action=corplist'>Available Corpora</a></li>
+			</ul>
+			<div stle='margin-top: 0px; margin-bottom: 20px;'><span class=langon>EN</span> | <span class=langoff><a href='/services/teitok-live/evaldio/cs/index.php?action=db_residency_techdoc-cs'>CS</a></span></div><p class='header'><a href='index.php'>Evaldio</a></p><ul style='text-align: left'><li><a href='index.php?action=databases'>Databases</a><li><a href='index.php?action=browser'>Browse</a><li><a href='index.php?action=cqp'>Search</a><li><a target="repository" href='http://hdl.handle.net/11234/1-5731'>Download</a></ul><ul style='text-align: left'><li><a href='index.php?action=login' >Login</a></ul><hr style='opacity: 0.5; margin-top: 40px;'><p id=powby style='opacity: 0.5; font-size: smaller;'><span onClick="window.open('http://www.teitok.org/index.php', 'teitok');">Powered by <span style='font-family: Courier;'>&lt;TEI:TOK&gt;</span></span><br><span onClick="window.open('http://www.teitok.org/index.php?action=credits', 'teitok');">Maarten Janssen, 2014-</a></p>
+        	</div>
+    <div id="main">
+		<h1 id="technick&aacute;-dokumentace">Technick&aacute; dokumentace</h1>
+<p>Jazykov&yacute; korpus mluven&yacute;ch projevů nerodil&yacute;ch mluvč&iacute;ch če&scaron;tiny zaměřen&yacute; na jazykovou &uacute;roveň A2 (podle SERR), požadovanou pro udělen&iacute; trval&eacute;ho pobytu v Česk&eacute; republice, je v&yacute;sledkem projektu realizovan&eacute;ho v &Uacute;stavu form&aacute;ln&iacute; a aplikovan&eacute; lingvistiky Matematicko-fyzik&aacute;ln&iacute; fakulty Univerzity Karlovy. Korpus obsahuje nahr&aacute;vky zaznamen&aacute;vaj&iacute;c&iacute; &uacute;stn&iacute; č&aacute;st <a href="http://ujop.cuni.cz/cce">Certifikovan&eacute; zkou&scaron;ky z če&scaron;tiny pro cizince</a> na &uacute;rovni A2. Nahr&aacute;vky zahrnuj&iacute; dialogy mezi zkou&scaron;ej&iacute;c&iacute;m (rodil&yacute;m mluvč&iacute;m) a kandid&aacute;tem zkou&scaron;ky (nerodil&yacute;m mluvč&iacute;m). N&aacute;hravky jsme opatřili jejich přepisy a bohatou lingvistickou anotac&iacute;. K někter&yacute;m nahr&aacute;vk&aacute;m je připojeno v&iacute;ce přepisů od různ&yacute;ch anot&aacute;torů, což umožňuje srovn&aacute;n&iacute; různ&yacute;ch přepisů t&eacute;že nahr&aacute;vky a vyhodnocen&iacute; m&iacute;ry shody při převodu mluven&eacute; řeči do psan&eacute;ho textu.</p>
+<p>Korpus je zveřejněn jako specializovan&aacute; veřejn&aacute; datab&aacute;ze a je volně dostupn&yacute; &scaron;irok&eacute; veřejnosti, vědeck&eacute; komunitě, pedagogům a studentům. Datab&aacute;ze je integrov&aacute;na do syst&eacute;mu TEITOK, kter&yacute; je spravov&aacute;n na platformě <a href="https://lindat.cz/">LINDAT/CLARIAH-CZ</a>.</p>
+<h2 id="teitok">TEITOK</h2>
+<p><a href="http://teitok.corpuswiki.org/">TEITOK</a> je framework pro vytv&aacute;řen&iacute;, spr&aacute;vu a zveřejňov&aacute;n&iacute; anotovan&yacute;ch korpusů. Jeho webov&eacute; rozhran&iacute; je implementov&aacute;no v kombinaci jazyků PHP a JavaScript. Pro n&aacute;&scaron; projekt, kter&yacute; kombinuje nahr&aacute;vky mluven&eacute;ho projevu a jejich přepisy, je stěžejn&iacute; funkcionalita prostřed&iacute; TEITOK, kter&aacute; umožňuje <a href="http://www.teitok.org/index.php?action=help&amp;id=wavesurfer">vytv&aacute;řet, zobrazovat a upravovat přepisy nahr&aacute;vek</a>. K pr&aacute;ci se samotnou nahr&aacute;vkou TEITOK využ&iacute;v&aacute; Javascript knihovnu <a href="http://wavesurfer-js.org/">wavesurfer</a>.</p>
+<h3 id="uložen&iacute;-dat">Uložen&iacute; dat</h3>
+<p>Data korpusu jsou v prostřed&iacute; TEITOK prim&aacute;rně uložena ve formě souborů. V tomto př&iacute;padě se jedn&aacute; o nahr&aacute;vky ve form&aacute;tu MP3, hlavn&iacute; č&aacute;sti jsou v&scaron;ak soubory ve form&aacute;tu TEITOK, kter&eacute; obsahuj&iacute; v&scaron;echny přepisy a anotace včetně metadat. Tyto soubory jsou navz&aacute;jem prov&aacute;z&aacute;ny s odpov&iacute;daj&iacute;c&iacute;mi nahr&aacute;vkami.</p>
+<h3 id="struktura-souborů-teitok">Struktura souborů TEITOK</h3>
+<p>Form&aacute;t TEITOK je form&aacute;t XML, kter&yacute; plně odpov&iacute;d&aacute; standardu <a href="https://www.tei-c.org/">Text Encoding Initiative (TEI)</a>, av&scaron;ak s m&iacute;rně odli&scaron;n&yacute;m př&iacute;stupem k tokenizaci. Struktura TEITOK souborů v na&scaron;&iacute; datab&aacute;zi je n&aacute;sleduj&iacute;c&iacute;:</p>
+<h4 id="hlavička-s-metadaty-teiheader">Hlavička s metadaty <code>&lt;teiHeader&gt;</code></h4>
+<ol type="1">
+<li><strong><code>&lt;fileDesc&gt;</code></strong> &ndash; Popis souboru
+<ul>
+<li><strong><code>&lt;titleStmt&gt;</code></strong>: Obsahuje n&aacute;zev souboru a informace o autorech a anot&aacute;torech.</li>
+<li><strong><code>&lt;editionStmt&gt;</code></strong>: Obsahuje č&iacute;slo verze.</li>
+<li><strong><code>&lt;publicationStmt&gt;</code></strong>: Publikačn&iacute; detaily, jako je vydavatel, datum vyd&aacute;n&iacute; a licence.</li>
+<li><strong><code>&lt;sourceDesc&gt;</code></strong>: Popis zdrojov&eacute; nahr&aacute;vky a odkaz na ni.</li>
+</ul>
+</li>
+<li><strong><code>&lt;encodingDesc&gt;</code></strong> &ndash; Popis k&oacute;dov&aacute;n&iacute;
+<ul>
+<li><strong><code>&lt;projectDesc&gt;</code></strong>: Stručn&yacute; popis projektu, v r&aacute;mci něhož data vznikla.</li>
+<li><strong><code>&lt;annotationDecl&gt;</code></strong>: Detaily o jednotliv&yacute;ch kroc&iacute;ch anotace (prim&aacute;rn&iacute;, revize, lingvistick&aacute; anotace).</li>
+</ul>
+</li>
+<li><strong><code>&lt;profileDesc&gt;</code></strong> &ndash; Profil textu
+<ul>
+<li><strong><code>&lt;langUsage&gt;</code></strong>: Použit&yacute; jazyk (če&scaron;tina).</li>
+<li><strong><code>&lt;textClass&gt;</code></strong>: Metadata dokumentu:
+<ul>
+<li><code>database</code>: N&aacute;zev datab&aacute;ze.</li>
+<li><code>exam-id</code>: Identifik&aacute;tor zkou&scaron;ky.</li>
+<li><code>cefr-level</code>: &Uacute;roveň podle SERR. Tato datab&aacute;ze obsahuje v&yacute;hradně nahr&aacute;vky zkou&scaron;ek &uacute;rovně A2.</li>
+<li><code>task-number</code>: Č&iacute;slo &uacute;lohy.</li>
+<li><code>preannot-source</code>: Zdroj předběžn&eacute; anotace.</li>
+<li><code>annotator</code>: K&oacute;d anot&aacute;tora.</li>
+<li><code>canonical</code>: Hodnota <code>1</code> znač&iacute; kanonick&yacute; přepis.</li>
+</ul>
+</li>
+</ul>
+</li>
+</ol>
+<h4 id="hlavn&iacute;-obsah-text">Hlavn&iacute; obsah <code>&lt;text&gt;</code></h4>
+<p>Sekce <code>&lt;text&gt;</code> obsahuje jednotliv&eacute; &uacute;seky mluven&eacute;ho projevu strukturovan&eacute; pomoc&iacute; elementů <code>&lt;u&gt;</code>: - <strong><code>&lt;u&gt;</code></strong>: Každ&yacute; element <code>&lt;u&gt;</code> reprezentuje &uacute;sek projevu a m&aacute; atributy: - <code>start</code> a <code>end</code>: Poč&aacute;tečn&iacute; a koncov&yacute; čas v sekund&aacute;ch. - <code>who</code>: Mluvč&iacute; (např. &ldquo;EXAM_1&rdquo; pro zkou&scaron;ej&iacute;c&iacute;ho a &ldquo;CAND_1&rdquo; pro kandid&aacute;ta). - <strong><code>&lt;s&gt;</code></strong>: Každ&aacute; věta je označena elementem <code>&lt;s&gt;</code>. - <strong><code>&lt;tok&gt;</code></strong>: Elementy tokenů, jejichž atributy popisuj&iacute; lemma, slovn&iacute; druh, morfologick&eacute; rysy a syntaktick&yacute; vztah. - <strong><code>&lt;anon/&gt;</code></strong>: Anonymizovan&yacute; &uacute;sek nahr&aacute;vky. - <strong><code>&lt;gap reason="unintelligible"/&gt;</code></strong>: Nesrozumiteln&yacute; &uacute;sek nahr&aacute;vky.</p>
+<h3 id="př&iacute;prava-souborů-teitok">Př&iacute;prava souborů TEITOK</h3>
+<p>Př&iacute;prava souborů TEITOK prob&iacute;hala v několika f&aacute;z&iacute;ch:</p>
+<ol type="1">
+<li><strong>Předběžn&aacute; anotace</strong>. V r&aacute;mci v&yacute;zkumu spojen&eacute;ho s vytv&aacute;řen&iacute;m datab&aacute;ze jsme porovn&aacute;vali př&iacute;mou ručn&iacute; anotaci s manu&aacute;ln&iacute; post-editac&iacute; v&yacute;stupů syst&eacute;mů pro automatick&eacute; rozpozn&aacute;v&aacute;n&iacute; řeči. Manu&aacute;ln&iacute; anotace tak může vych&aacute;zet z automaticky připraven&eacute; předběžn&eacute; anotace. Zdroj předběžn&eacute; anotace rozli&scaron;ujeme pomoc&iacute; atributu <code>preannot-source</code>, jehož hodnota může b&yacute;t:
+<ul>
+<li><code>from_scratch</code>: Kompletně manu&aacute;ln&iacute; anotace, t.j. předběžn&aacute; anotace je pr&aacute;zdn&aacute;.</li>
+<li><code>from_whisperX</code>: Předběžn&aacute; anotace z&iacute;skan&aacute; pomoc&iacute; syst&eacute;mu <a href="https://github.com/m-bain/whisperX">WhisperX</a>.</li>
+<li><code>from_mixed</code>: Předběžn&aacute; anotace z&iacute;skan&aacute; n&aacute;hodn&yacute;m kombinovan&iacute;m v&yacute;stupů čtyř syst&eacute;mů na &uacute;rovni replik.</li>
+</ul>
+</li>
+</ol>
+<p>Když předběžn&aacute; anotace nebyla pr&aacute;zdn&aacute;, převedli jsme ji do z&aacute;kladn&iacute; verze form&aacute;tu TEITOK. Na konci t&eacute;to f&aacute;ze tak obsahovala přepisy rozdělen&eacute; do replik (elementy <code>&lt;u&gt;</code>), přiřazen&iacute; mluvč&iacute;ch k replik&aacute;m (atribut <code>who</code>) a časov&eacute; zarovn&aacute;n&iacute; s nahr&aacute;vkou (atributy <code>start</code> a <code>end</code>).</p>
+<ol start="2" type="1">
+<li>
+<p><strong>Manu&aacute;ln&iacute; anotace</strong>. Po nahr&aacute;n&iacute; souborů provedly za&scaron;kolen&eacute; anot&aacute;torky manu&aacute;ln&iacute; anotaci v prostřed&iacute; TEITOK, během n&iacute;ž vytv&aacute;řely nebo opravovaly přepisy, přiřazovaly mluvč&iacute; k replik&aacute;m a pomoc&iacute; časov&yacute;ch značek zarovn&aacute;valy repliky s nahr&aacute;vkou. Nahr&aacute;vky byly anonymizov&aacute;ny v souladu s požadavky &Uacute;stavu jazykov&eacute; a odborn&eacute; př&iacute;pravy Univerzity Karlovy (&Uacute;JOP UK), kter&yacute; audionahr&aacute;vky pro korpus poskytl. Někter&eacute; anot&aacute;torky z opatrnosti anonymizovaly i &uacute;daje, kter&eacute; anonymizov&aacute;ny b&yacute;t nemusely (např. smy&scaron;len&aacute; jm&eacute;na osob).</p>
+</li>
+<li>
+<p><strong>Revize</strong>. Ručn&iacute; kontrola manu&aacute;ln&iacute;ch anotac&iacute; spoluautorkou datab&aacute;ze.</p>
+</li>
+<li>
+<p><strong>Normalizace</strong>. Automatick&aacute; &uacute;prava přepisů, kter&aacute; odstran&iacute; odchylky ve jm&eacute;nech mluvč&iacute;ch, seřad&iacute; repliky podle poč&aacute;tečn&iacute;ho času a přiděl&iacute; replik&aacute;m nov&eacute; sekvenčn&iacute; ID.</p>
+</li>
+<li>
+<p><strong>Rozdělen&iacute; na &uacute;lohy a selekce</strong>. Poskytovatel nahr&aacute;vek (&Uacute;JOP UK) povolil ke zveřejněn&iacute; pouze vybran&eacute; &uacute;lohy. Ty jsme museli z nahr&aacute;vek vystřihnout a upravit časov&eacute; značky v přepisech, aby se zachovalo zarovn&aacute;n&iacute; replik v přepisu s nahr&aacute;vkou. Pro stř&iacute;h&aacute;n&iacute; nahr&aacute;vky jsme použili n&aacute;stroj <a href="https://www.ffmpeg.org/">FFmpeg</a>.</p>
+</li>
+<li>
+<p><strong>Lingvistick&aacute; anotace</strong>. Až do t&eacute;to f&aacute;ze nebyly repliky v přepisech d&aacute;le strukturov&aacute;ny. V t&eacute;to f&aacute;zi jsme text rozdělili na věty (element <code>&lt;s&gt;</code>) a n&aacute;sledně věty na tokeny (elemety <code>&lt;tok&gt;</code>). Na &uacute;rovni tokenů jsou přepisy automaticky lingvisticky anotov&aacute;ny. Každ&eacute;mu tokenu je přiděleno lemma (atribut <code>lemma</code>), jazykově specifick&aacute; morfologick&aacute; značka (atribut <code>xpos</code>), slovn&iacute; druh a morfologick&eacute; vlastnosti dle kategorizace projektu <a href="https://universaldependencies.org/">Universal Dependencies</a> (atributy <code>upos</code> a <code>feats</code>). D&aacute;le je každ&eacute;mu tokenu přiřazen odkaz na ID rodiče podle pravidel z&aacute;vislostn&iacute; syntaxe (atribut <code>head</code>) a typ z&aacute;vislosti tokenu ve vztahu k jeho rodiči (atribut <code>deprel</code>). Pro lingvistickou anotaci, včetně tokenizace, jsme použili n&aacute;stroj <a href="https://ufal.mff.cuni.cz/udpipe/2">UDPipe 2</a>, konkr&eacute;tně model <code>czech-pdt-ud-2.12-230717</code> pro če&scaron;tinu. Ačkoli je možn&eacute; prov&aacute;dět tokenizaci a automatickou lingvistickou anotaci př&iacute;mo v prostřed&iacute; TEITOK, my jsme tento proces realizovali samostatně. Důvodem je, že metoda tokenizace v prostřed&iacute; TEITOK se li&scaron;&iacute; od t&eacute;, kter&aacute; je optimalizov&aacute;na pro UDPipe, což by mohlo způsobovat chyby při spojov&aacute;n&iacute; těchto dvou kroků.</p>
+</li>
+<li>
+<p><strong>Doplněn&iacute; hlavičky TEI</strong>. Na z&aacute;věr jsme doplnili hlavičku podle v&scaron;ech dostupn&yacute;ch metadat, aby odpov&iacute;dala standardům TEI.</p>
+</li>
+</ol>
+<p>V&scaron;echy n&aacute;stroje a skripty (přev&aacute;žně v jazyc&iacute;ch Python 3 a BASH) jsou k dispozici ve <a href="https://github.com/ufal/evaldio">veřejn&eacute;m repozit&aacute;ři projektu</a> v adres&aacute;ři <code>data_preparation</code>.</p>
+<h3 id="dotazov&aacute;n&iacute;-vyhled&aacute;v&aacute;n&iacute;-a-filtrov&aacute;n&iacute;">Dotazov&aacute;n&iacute;, vyhled&aacute;v&aacute;n&iacute; a filtrov&aacute;n&iacute;</h3>
+<p>Rychl&eacute; dotazov&aacute;n&iacute;, vyhled&aacute;v&aacute;n&iacute; a filtrace jsou umožněny integrovan&yacute;m <a href="https://cwb.sourceforge.io/files/CQP_Manual.pdf">procesorem dotazů CQP</a>, kl&iacute;čovou komponentou sady n&aacute;strojů <a href="https://cwb.sourceforge.io/">IMS Open Corpus Workbench (CWB)</a>. CQP přev&aacute;d&iacute; korpusy ve form&aacute;tu XML do bin&aacute;rn&iacute; podoby a efektivně je indexuje. Dotazov&aacute;n&iacute; v indexovan&yacute;ch korpusech prob&iacute;h&aacute; pomoc&iacute; jazyka <a href="https://www.cambridge.org/sketch/help/userguides/CQL%20Help%201.3.pdf">CQL</a>, kter&yacute; je standardem v korpusov&eacute; lingvistice. TEITOK tak&eacute; nab&iacute;z&iacute; Query builder, v němž může uživatel specifikovat dotaz vyplněn&iacute;m formul&aacute;ře. V&yacute;sledek dotazu vr&aacute;cen&yacute; z CQP je n&aacute;sledně zpracov&aacute;n pomoc&iacute; TEITOKu a zobrazen uživateli v přehledn&eacute; formě. V&yacute;sledky dotazů je možn&eacute; st&aacute;hnout ve form&aacute;tu XML.</p>
+	</div>
+</div>
+
+</div>
+
+
+
+<div class="lindat-common2 lindat-common-footer">
+ <footer data-version="3.0.5" data-build="05eff1186f12528f221a63b021c7b7dc81301429">
+    
+      <div id="about-lindat">
+        <h4><a href="https://lindat.cz/sites/default/files/2021-01/lindat_clariah_flyer.pdf">LINDAT/CLARIAH-CZ</a></h4>
+        <ul>
+          
+          <li><a href="https://lindat.cz/files/mission-en.pdf">Mission Statement</a></li>
+          
+          <li><a href="https://lindat.cz/ab">Advisory Board</a></li>
+          
+          <li><a href="https://lindat.cz/events">Events</a></li>
+          
+          <li><a href="https://www.clarin.eu/">CLARIN Participation</a></li>
+          
+          <li><a href="https://www.dariah.eu/">DARIAH Participation</a></li>
+          <br/>
+          <li><a href="https://lindat.cz/faq-repository">FAQ</a></li>
+          
+          <li><a href="mailto:lindat-help@ufal.mff.cuni.cz">Helpdesk</a></li>
+          
+          <li><a href="https://lindat.cz/user_feedback">User Feedback Form</a></li>
+          <br/>
+          <li><a href="https://lindat.cz/acknowledgement">Acknowledge LINDAT/CLARIAH-CZ</a></li>
+          
+        </ul>
+      </div>
+      
+      <div id="about-partners">
+        <h4><a href="https://lindat.cz/partners">Partners</a></h4>
+        <ul>
+          
+            <li>Charles University
+                <ul>
+                    
+          <li><a href="https://lindat.cz/partners/mff-uk">Faculty <i>of</i> Mathematics <i>and</i> Physics</a></li>
+          
+          <li><a href="https://lindat.cz/partners/ff-uk">Faculty <i>of</i> Arts</a></li>
+          
+                </ul>
+            </li>
+          
+            <li>Masaryk University
+                <ul>
+                    
+          <li><a href="https://lindat.cz/partners/ff-mu">Faculty <i>of</i> Arts</a></li>
+          
+          <li><a href="https://lindat.cz/partners/fi-mu">Faculty  <i>of</i> Informatics</a></li>
+          
+                </ul>
+            </li>
+          
+            <li>University of West Bohemia
+                <ul>
+                    
+          <li><a href="https://lindat.cz/partners/zcu">Faculty <i>of</i> Applied Sciences</a></li>
+          
+                </ul>
+            </li>
+          
+            <li>Czech Academy of Sciences
+                <ul>
+                    
+          <li><a href="https://lindat.cz/partners/ujc">Czech Language Institute</a></li>
+          
+          <li><a href="https://lindat.cz/partners/knav">Library <i>of</i> Academy</a></li>
+          
+          <li><a href="https://lindat.cz/partners/hu">Institute <i>of</i> History</a></li>
+          
+          <li><a href="https://lindat.cz/partners/flu">Institute <i>of</i> Philosophy</a></li>
+          
+                </ul>
+            </li>
+          
+            <li>Archives, Libraries and Galleries
+                <ul>
+                    
+          <li><a href="https://lindat.cz/partners/nk">National Library <i>of the Czech Republic</i></a></li>
+          
+          <li><a href="https://lindat.cz/partners/mzk">Moravian Library <i>in Brno</i></a></li>
+          
+          <li><a href="https://lindat.cz/partners/ng">National Gallery Prague</a></li>
+          
+          <li><a href="https://lindat.cz/partners/nfa">National Film Archive</a></li>
+          
+                </ul>
+            </li>
+          
+        </ul>
+      </div>
+      
+      <div id="about-website">
+        <h4><a href="https://lindat.cz/services">Services</a></h4>
+        <ul>
+          
+          <li><a href="https://lindat.mff.cuni.cz/en/monitoring">Service Status</a></li>
+          
+          <li><a href="https://lindat.mff.cuni.cz/repository/xmlui/page/about?locale-attribute=en">About and Policies</a></li>
+          
+          <li><a href="https://lindat.mff.cuni.cz/en/terms-of-use">Terms of Use</a></li>
+          
+        </ul>
+      </div>
+      
+
+    <div id="badges-a">
+        <a href="https://www.clarin.eu/content/certified-centres"><img src="https://lindat.mff.cuni.cz/images/b-centre.png" alt="CLARIN CENTRE B" /></a>
+        <a href="https://www.clarin.eu/content/knowledge-centres"><img src="https://lindat.mff.cuni.cz/images/k-centre.png" alt="CLARIN CENTRE K" style="filter:brightness(0.88)" /></a>
+        <a href="https://www.coretrustseal.org/wp-content/uploads/2019/08/LINDAT-CLARIN.pdf"><img src="https://lindat.mff.cuni.cz/images/core-trust-seal-mono.png" alt="CoreTrustSeal Certification" /></a>
+    </div>
+
+    <div id="badges-b">
+        <a href="https://twitter.com/lindatclariahcz">Follow us on Twitter <img src="https://lindat.mff.cuni.cz/images/twitter-circular.svg" alt="Link to Profile" /></a>
+        <a href="https://lindat.cz/user/login"><img src="https://lindat.mff.cuni.cz/sites/default/files/LINDAT-CLARIAH-cz-gray_0.svg" alt="Home Page" /></a>
+    </div>
+
+    <div id="ack-msmt">
+        THE LINDAT/CLARIAH-CZ PROJECT (LM2018101; formerly LM2010013, LM2015071) IS FULLY SUPPORTED BY THE MINISTRY OF EDUCATION, SPORTS AND YOUTH OF THE CZECH REPUBLIC UNDER THE&#160;PROGRAMME LM OF "LARGE INFRASTRUCTURES"
+    </div>
+    <div id="ack-freepik">Icons ©  Smashicons and Freepik from flaticon.com licensed by <a href="https://creativecommons.org/licenses/by/3.0/">CC 3.0 BY</a></div>
+    <div id="ack-ufal">website © 2022 by <a href="https://ufal.mff.cuni.cz/">ÚFAL</a></div>
+    
+  <!-- TRACKING CODE -->
+
+  <script type="text/javascript">
+    //<![CDATA[
+    
+    (function(i,s,o,g,r,a,m){i['GoogleAnalyticsObject']=r;i[r]=i[r]||function(){
+        (i[r].q=i[r].q||[]).push(arguments)},i[r].l=1*new Date();a=s.createElement(o),
+      m=s.getElementsByTagName(o)[0];a.async=1;a.src=g;m.parentNode.insertBefore(a,m)
+    })(window,document,'script','//www.google-analytics.com/analytics.js','ga');
+
+    // main LINDAT/CLARIAH-CZ tracker
+    ga('create', 'UA-27008245-2', 'cuni.cz');
+    ga('send', 'pageview');
+      
+    //]]>
+  </script>
+
+  <!-- Piwik LINDAT/CLARIAH-CZ tracker -->
+  <script type="text/javascript">
+    //<![CDATA[
+    
+    var _paq = _paq || [];
+    _paq.push(["setDocumentTitle", document.domain + "/" + document.title]);
+    _paq.push(["setCookieDomain", "*.mff.cuni.cz"]);
+    _paq.push(["setDomains", ["*.mff.cuni.cz"]]);
+    _paq.push(['setCustomVariable', 1, "source", "common-theme", "page"]);
+    _paq.push(['trackPageView']);
+    _paq.push(['enableLinkTracking']);
+    (function() {
+      var u='//lindat.mff.cuni.cz/piwik/';
+      _paq.push(['setTrackerUrl', u+'piwik.php']);
+      _paq.push(['setSiteId', 2]);
+      var d=document, g=d.createElement('script'), s=d.getElementsByTagName('script')[0];
+      g.type='text/javascript'; g.async=true; g.defer=true; g.src=u+'piwik.js'; s.parentNode.insertBefore(g,s);
+    })();
+      
+    //]]>
+  </script>
+  <noscript><p><img src="//lindat.mff.cuni.cz/piwik/piwik.php?idsite=2" style="border:0;" alt="" /></p></noscript>
+  <!-- End Piwik Code -->
+  <!-- End TRACKING CODE -->
+      
+</footer>
+</div>
+    
+
+
+
+</body>
+</html>
diff --git a/data_preparation/70.releasing/html/db_residency_techdoc.html b/data_preparation/70.releasing/html/db_residency_techdoc.html
new file mode 100644
index 0000000..722c00c
--- /dev/null
+++ b/data_preparation/70.releasing/html/db_residency_techdoc.html
@@ -0,0 +1,394 @@
+
+<!DOCTYPE html>
+<html>
+<head>
+<title>Evaldio</title>
+<meta charset="utf-8" />
+<meta name="viewport" content="width=device-width, initial-scale=1">
+<link href='https://fonts.googleapis.com/css?family=Cousine:400|Roboto:300,400,400italic,700,700italic|Roboto+Condensed:400,700&amp;subset=latin,latin-ext' rel='stylesheet' type='text/css'>
+<link href='//lindat.mff.cuni.cz/services/teitok-live/themes/lindat/css/font-awesome.min.css' rel='stylesheet' type='text/css'>
+
+<link rel="stylesheet" type="text/css" href="//lindat.mff.cuni.cz/services/teitok/css/common.css" />
+<link rel="stylesheet" type="text/css" href="//lindat.mff.cuni.cz/services/teitok/css/view.css" />
+<link rel="stylesheet" type="text/css" href="//lindat.mff.cuni.cz/aai/discojuice/discojuice.css" />
+
+<link rel="stylesheet" type="text/css" href="//lindat.mff.cuni.cz/aai/discojuice/discojuice.css" />
+  <link rel="stylesheet" type="text/css" href="/services/teitok/css/media-dark.css" media="screen">
+  <link rel="stylesheet" type="text/css" href="/services/teitok/Scripts/teitok.css" media="screen">
+  <link rel="stylesheet" type="text/css" href="/services/teitok/css/xmlstyles.css" media="screen">
+  <link rel="stylesheet" type="text/css" href="Resources/xmlstyles.css" media="screen">
+  <link rel="stylesheet" type="text/css" href="/services/teitok/css/htmlstyles.css" media="screen">
+  <link rel="stylesheet" type="text/css" href="Resources/htmlstyles.css" media="screen">
+
+<!-- plug-ins dependencies -->
+<script type="text/javascript" src="//code.jquery.com/jquery-3.3.1.min.js"></script>
+<script type="text/javascript" src="//lindat.mff.cuni.cz/aai/discojuice/discojuice-2.1.en.min.js"></script>
+<script type="text/javascript" src="//lindat.mff.cuni.cz/aai/aai.js"></script>
+<script type="text/javascript" src="//lindat.mff.cuni.cz/services/teitok/ufal/idp.js"></script>
+<!-- --------------------- -->
+
+</head>
+<body>
+
+
+<div class="lindat-common2 lindat-common-header">
+<header data-version="3.0.5" data-build="05eff1186f12528f221a63b021c7b7dc81301429">
+    <nav class="lindat-navbar lindat-navbar-expand-lg lindat-justify-content-between lindat-navbar-dark ">
+        <div class="lindat-block lindat-block--clariah-theme-branding">
+            <a href="https://lindat.mff.cuni.cz/" class="lindat-navbar-brand lindat-d-flex lindat-align-items-center " aria-label="">
+                <img src="https://lindat.mff.cuni.cz/sites/default/files/LINDAT-CLARIAH-cz-gray_0.svg" width="auto" height="53" style="height: 53px !important;" alt="LINDAT/CLARIAH-CZ logo" class="" />
+            </a>
+        </div>
+        <button class="lindat-navbar-toggler" type="button" data-toggle="collapse" data-target=".lindat-navbar-collapse" aria-controls="lindat-navbar-collapse" aria-expanded="false" aria-label="Toggle navigation"
+                onclick="this.parentNode.querySelector('.lindat-navbar-toggler+div.lindat-collapse.lindat-navbar-collapse').classList.toggle('lindat-show')">
+            <span class="lindat-navbar-toggler-icon"></span>
+        </button>
+        <div class="lindat-collapse lindat-navbar-collapse">
+            <div class="">
+                <div class="lindat-block lindat-block--clariah-theme-main-menu">
+                    <ul class="lindat-nav lindat-navbar-nav">
+                        
+          <li class="lindat-nav-item ">
+              <a href="https://lindat.cz/#search" class="lindat-nav-link "
+                                    
+                                    
+                                    >Search</a>
+              
+          </li>
+        
+          <li class="lindat-nav-item ">
+              <a href="https://lindat.mff.cuni.cz/repository/xmlui/?locale-attribute=en" class="lindat-nav-link "
+                                    
+                                    
+                                    >Catalogue</a>
+              
+          </li>
+        
+          <li class="lindat-nav-item ">
+              <a href="https://lindat.cz/#education" class="lindat-nav-link "
+                                    
+                                    
+                                    >Education</a>
+              
+          </li>
+        
+          <li class="lindat-nav-item ">
+              <a href="https://lindat.cz/#projects" class="lindat-nav-link "
+                                    
+                                    
+                                    >Projects</a>
+              
+          </li>
+        
+          <li class="lindat-nav-item ">
+              <a href="https://lindat.cz/#tools" class="lindat-nav-link "
+                                    
+                                    
+                                    >Tools</a>
+              
+          </li>
+        
+          <li class="lindat-nav-item ">
+              <a href="https://lindat.cz/en/services" class="lindat-nav-link "
+                                    
+                                    
+                                    >Services</a>
+              
+          </li>
+        
+          <li class="lindat-nav-item lindat-dropdown">
+              <a href="https://lindat.cz/" class="lindat-nav-link lindat-dropdown-toggle"
+                                     data-toggle="dropdown"
+                                     onclick="this.parentNode.querySelector('.lindat-dropdown-toggle+div.lindat-dropdown-menu').classList.toggle('lindat-show'); return false;"
+                                    >About</a>
+              <div class="lindat-dropdown-menu">
+               <a href="https://lindat.cz/partners" class="lindat-dropdown-item">Partners</a>
+            
+               <a href="https://lindat.cz/files/mission-en.pdf" class="lindat-dropdown-item">Mission Statement</a>
+            
+               <a href="https://www.clarin.eu/" class="lindat-dropdown-item">CLARIN</a>
+            
+               <a href="https://www.dariah.eu/" class="lindat-dropdown-item">DARIAH</a>
+            
+               <a href="https://lindat.cz/integration" class="lindat-dropdown-item">Service integrations</a>
+            
+               <a href="https://lindat.cz/partnership" class="lindat-dropdown-item">Project partnerships</a>
+            </div>
+          </li>
+        
+                    </ul>
+                </div>
+            </div>
+            <div class="lindat-block lindat-block--clariah-theme-account-menu">
+                <ul class="lindat-nav lindat-navbar-nav">
+                    <li class="lindat-nav-item" id="margin-filler"></li>
+                    <li class="lindat-nav-item  ">
+                        <a class="lindat-nav-link lindat-nav-link-dariah" href="https://www.dariah.eu/"><img src="https://lindat.mff.cuni.cz/images/dariah-eu.png" alt="DARIAH logo" /></a>
+                    </li>
+                    <li class="lindat-nav-item  ">
+                        <a class="lindat-nav-link lindat-nav-link-clarin" href="https://www.clarin.eu/"><img src="https://lindat.mff.cuni.cz/images/clarin.png" alt="CLARIN logo" /></a>
+                    </li>
+                </ul>
+            </div>
+            <slot name="languageswitcher"></slot>
+        </div>
+    </nav>
+</header>
+</div>
+    
+
+
+<div id="localization-bar"></div>
+<div id="content">
+        	<div id='menubox'>
+        				<p class='header main'><a href='http://lindat.mff.cuni.cz/services/teitok/index.php'>TEITOK</a></p>
+			<ul style='text-align: left' class='teitok'>
+			<li><a href='index.php?action=login'>Login</a></li>
+			<li><a href='http://lindat.mff.cuni.cz/services/teitok/index.php?action=corplist'>Available Corpora</a></li>
+			</ul>
+			<div stle='margin-top: 0px; margin-bottom: 20px;'><span class=langon>EN</span> | <span class=langoff><a href='/services/teitok-live/evaldio/cs/index.php?action=db_residency_techdoc'>CS</a></span></div><p class='header'><a href='index.php'>Evaldio</a></p><ul style='text-align: left'><li><a href='index.php?action=databases'>Databases</a><li><a href='index.php?action=browser'>Browse</a><li><a href='index.php?action=cqp'>Search</a><li><a target="repository" href='http://hdl.handle.net/11234/1-5731'>Download</a></ul><ul style='text-align: left'><li><a href='index.php?action=login' >Login</a></ul><hr style='opacity: 0.5; margin-top: 40px;'><p id=powby style='opacity: 0.5; font-size: smaller;'><span onClick="window.open('http://www.teitok.org/index.php', 'teitok');">Powered by <span style='font-family: Courier;'>&lt;TEI:TOK&gt;</span></span><br><span onClick="window.open('http://www.teitok.org/index.php?action=credits', 'teitok');">Maarten Janssen, 2014-</a></p>
+        	</div>
+    <div id="main">
+		<h1 id="technical-documentation">Technical Documentation</h1>
+<p>The language corpus of spoken performances by non-native speakers of Czech, focused on the A2 language level (according to the CEFR), required for obtaining permanent residency in the Czech Republic, is the result of a project implemented at the Institute of Formal and Applied Linguistics of the Faculty of Mathematics and Physics, Charles University. The corpus contains recordings capturing the oral part of the <a href="https://ujop.cuni.cz/UJOPEN-70.html?ujopcmsid=12:czech-language-certificate-exam-cce">Czech Language Certificate Exam</a> at the A2 level. The recordings include dialogues between the examiner (a native speaker) and the candidate (a non-native speaker). We have provided transcriptions of the recordings, enriched with extensive linguistic annotations. Some recordings are accompanied by multiple transcriptions from different annotators, allowing for comparisons of various transcriptions of the same recording and the assessment of the degree of agreement when converting spoken language into written text.</p>
+<p>The corpus is published as a specialized public database and is freely accessible to the general public, the scientific community, educators, and students. The database is integrated into the TEITOK system, managed on the <a href="https://lindat.cz/">LINDAT/CLARIAH-CZ</a> platform.</p>
+<h2 id="teitok">TEITOK</h2>
+<p><a href="http://teitok.corpuswiki.org/">TEITOK</a> is a framework for creating, managing, and publishing annotated corpora. Its web interface is implemented using a combination of PHP and JavaScript. For our project, which combines recordings of spoken speech and their transcriptions, the key functionality of the TEITOK environment allows us to <a href="http://www.teitok.org/index.php?action=help&amp;id=wavesurfer">create, display, and edit recordings&rsquo; transcriptions</a>. To work with the recordings themselves, TEITOK utilizes the JavaScript library <a href="http://wavesurfer-js.org/">wavesurfer</a>.</p>
+<h3 id="data-storage">Data Storage</h3>
+<p>The corpus data is primarily stored in the TEITOK environment in the form of files. In this case, the recordings are in MP3 format, while the main components are TEITOK format files, which contain all transcriptions and annotations, including metadata. These files are interconnected with the corresponding recordings.</p>
+<h3 id="structure-of-teitok-files">Structure of TEITOK Files</h3>
+<p>The TEITOK format is an XML format that fully complies with the <a href="https://www.tei-c.org/">Text Encoding Initiative (TEI)</a> standards, but with a slightly different approach to tokenization. The structure of TEITOK files in our database is as follows:</p>
+<h4 id="header-with-metadata-teiheader">Header with Metadata <code>&lt;teiHeader&gt;</code></h4>
+<ol type="1">
+<li><strong><code>&lt;fileDesc&gt;</code></strong> &ndash; File description
+<ul>
+<li><strong><code>&lt;titleStmt&gt;</code></strong>: Contains the title of the file and information about authors and annotators.</li>
+<li><strong><code>&lt;editionStmt&gt;</code></strong>: Contains version number.</li>
+<li><strong><code>&lt;publicationStmt&gt;</code></strong>: Publication details, such as publisher, release date, and license.</li>
+<li><strong><code>&lt;sourceDesc&gt;</code></strong>: Description of the source recording and a link to it.</li>
+</ul>
+</li>
+<li><strong><code>&lt;encodingDesc&gt;</code></strong> &ndash; Description of encoding
+<ul>
+<li><strong><code>&lt;projectDesc&gt;</code></strong>: A brief description of the project under which the data was created.</li>
+<li><strong><code>&lt;annotationDecl&gt;</code></strong>: Details of the individual annotation steps (primary, revision, linguistic annotation).</li>
+</ul>
+</li>
+<li><strong><code>&lt;profileDesc&gt;</code></strong> &ndash; Profile of the text
+<ul>
+<li><strong><code>&lt;langUsage&gt;</code></strong>: Language used (Czech).</li>
+<li><strong><code>&lt;textClass&gt;</code></strong>: Document metadata:
+<ul>
+<li><code>database</code>: Database name.</li>
+<li><code>exam-id</code>: Exam identifier.</li>
+<li><code>cefr-level</code>: CEFR level. This database contains recordings exclusively from A2 level exams.</li>
+<li><code>task-number</code>: Task number.</li>
+<li><code>preannot-source</code>: Source of preliminary annotation.</li>
+<li><code>annotator</code>: Annotator code.</li>
+<li><code>canonical</code>: A value of <code>1</code> indicates a canonical transcription.</li>
+</ul>
+</li>
+</ul>
+</li>
+</ol>
+<h4 id="main-content-text">Main Content <code>&lt;text&gt;</code></h4>
+<p>The <code>&lt;text&gt;</code> section contains individual segments of spoken speech structured using <code>&lt;u&gt;</code> elements: - <strong><code>&lt;u&gt;</code></strong>: Each <code>&lt;u&gt;</code> element represents a segment of speech and has attributes: - <code>start</code> and <code>end</code>: Start and end time in seconds. - <code>who</code>: Speaker (e.g., &ldquo;EXAM_1&rdquo; for the examiner and &ldquo;CAND_1&rdquo; for the candidate). - <strong><code>&lt;s&gt;</code></strong>: Each sentence is marked with the <code>&lt;s&gt;</code> element. - <strong><code>&lt;tok&gt;</code></strong>: Token elements whose attributes describe lemma, part of speech, morphological features, and syntactic relations. - <strong><code>&lt;anon/&gt;</code></strong>: Anonymized segment of the recording. - <strong><code>&lt;gap reason="unintelligible"/&gt;</code></strong>: Unintelligible segment of the recording.</p>
+<h3 id="preparation-of-teitok-files">Preparation of TEITOK Files</h3>
+<p>The preparation of TEITOK files took place in several phases:</p>
+<ol type="1">
+<li><strong>Preliminary Annotation</strong>. In the research associated with the creation of the database, we compared direct manual annotation with manual post-editing of outputs from automatic speech recognition systems. Thus, manual annotation may be based on automatically prepared preliminary annotation. The source of the preliminary annotation is distinguished using the <code>preannot-source</code> attribute, which can have the following values:
+<ul>
+<li><code>from_scratch</code>: Completely manual annotation, i.e., the preliminary annotation is empty.</li>
+<li><code>from_whisperX</code>: Preliminary annotation obtained using the <a href="https://github.com/m-bain/whisperX">WhisperX</a> system.</li>
+<li><code>from_mixed</code>: Preliminary annotation obtained by randomly combining outputs from four systems at the level of utterances. When the preliminary annotation was not empty, we converted it into the basic version of the TEITOK format. At the end of this phase, the transcriptions contained segments divided into utterances (the <code>&lt;u&gt;</code> elements), assignment of speakers to utterances (the <code>who</code> attribute), and time alignment with the recording (the <code>start</code> and <code>end</code> attributes).</li>
+</ul>
+</li>
+<li><strong>Manual Annotation</strong>. After uploading the files, trained annotators performed manual annotation in the TEITOK environment, during which they created or corrected transcriptions, assigned speakers to utterances, and aligned utterances with the recording using timestamps. The recordings were anonymized in accordance with the requirements of the Institute for Language and Preparatory Studies of Charles University (ILPS CU), which provided the audio recordings for the corpus. Some annotators, out of caution, anonymized even data that did not need to be anonymized (e.g., fictitious names).</li>
+<li><strong>Revision</strong>. Manual review of the manual annotations by a co-author of the database.</li>
+<li><strong>Normalization</strong>. Automatic adjustment of transcriptions that removes discrepancies in speaker names, orders utterances according to start time, and assigns new sequential IDs to utterances.</li>
+<li><strong>Segmentation by Tasks and Selection</strong>. The provider of the recordings (ILPS CU) permitted the publication of only selected tasks. We had to cut these from the recordings and adjust timestamps in the transcriptions to preserve the alignment of utterances in the transcription with the recording. We used the <a href="https://www.ffmpeg.org/">FFmpeg</a> tool for cutting the recordings.</li>
+<li><strong>Linguistic Annotation</strong>. Until this phase, the utterances in the transcriptions had not been further structured. In this phase, we divided the text into sentences (the <code>&lt;s&gt;</code> element) and then into tokens (the <code>&lt;tok&gt;</code> elements). At the token level, the transcriptions are automatically linguistically annotated. Each token is assigned a lemma (the <code>lemma</code> attribute), language-specific morphological tag (the <code>xpos</code> attribute), part of speech, and morphological properties according to the categorization of the <a href="https://universaldependencies.org/">Universal Dependencies</a> project (the <code>upos</code> and <code>feats</code> attributes). Additionally, each token is assigned a reference to the parent ID according to dependency syntax rules (the <code>head</code> attribute) and the type of dependency of the token in relation to its parent (the <code>deprel</code> attribute). For linguistic annotation, including tokenization, we used the <a href="https://ufal.mff.cuni.cz/udpipe/2">UDPipe 2</a> tool, specifically the model <code>czech-pdt-ud-2.12-230717</code> for Czech. Although it is possible to perform tokenization and automatic linguistic annotation directly in the TEITOK environment, we carried out this process separately. The reason is that the tokenization method in the TEITOK environment differs from the one optimized for UDPipe, which could lead to errors when combining these two steps.</li>
+<li><strong>Completion of the TEI Header</strong>. Finally, we supplemented the header according to all available metadata to comply with TEI standards.</li>
+</ol>
+<p>All tools and scripts (primarily in Python 3 and BASH) are available in the <a href="https://github.com/ufal/evaldio">public repository of the project</a> in the <code>data_preparation</code> directory.</p>
+<h3 id="querying-searching-and-filtering">Querying, Searching, and Filtering</h3>
+<p>Rapid querying, searching, and filtering are enabled by the integrated <a href="https://cwb.sourceforge.io/files/CQP_Manual.pdf">CQP Query Processor</a>, a key component of the <a href="https://cwb.sourceforge.io/">IMS Open Corpus Workbench (CWB)</a> toolkit. CQP converts XML-formatted corpora into binary format and efficiently indexes them. Querying in indexed corpora is conducted using the <a href="https://www.cambridge.org/sketch/help/userguides/CQL%20Help%201.3.pdf">CQL</a> language, which is a standard in corpus linguistics. TEITOK also offers a Query Builder, in which users can specify a query by filling out a form. The results of the query returned from CQP are subsequently processed using TEITOK and presented to the user in a clear format. Query results can be downloaded in XML format.</p>
+	</div>
+</div>
+
+</div>
+
+
+
+<div class="lindat-common2 lindat-common-footer">
+ <footer data-version="3.0.5" data-build="05eff1186f12528f221a63b021c7b7dc81301429">
+    
+      <div id="about-lindat">
+        <h4><a href="https://lindat.cz/sites/default/files/2021-01/lindat_clariah_flyer.pdf">LINDAT/CLARIAH-CZ</a></h4>
+        <ul>
+          
+          <li><a href="https://lindat.cz/files/mission-en.pdf">Mission Statement</a></li>
+          
+          <li><a href="https://lindat.cz/ab">Advisory Board</a></li>
+          
+          <li><a href="https://lindat.cz/events">Events</a></li>
+          
+          <li><a href="https://www.clarin.eu/">CLARIN Participation</a></li>
+          
+          <li><a href="https://www.dariah.eu/">DARIAH Participation</a></li>
+          <br/>
+          <li><a href="https://lindat.cz/faq-repository">FAQ</a></li>
+          
+          <li><a href="mailto:lindat-help@ufal.mff.cuni.cz">Helpdesk</a></li>
+          
+          <li><a href="https://lindat.cz/user_feedback">User Feedback Form</a></li>
+          <br/>
+          <li><a href="https://lindat.cz/acknowledgement">Acknowledge LINDAT/CLARIAH-CZ</a></li>
+          
+        </ul>
+      </div>
+      
+      <div id="about-partners">
+        <h4><a href="https://lindat.cz/partners">Partners</a></h4>
+        <ul>
+          
+            <li>Charles University
+                <ul>
+                    
+          <li><a href="https://lindat.cz/partners/mff-uk">Faculty <i>of</i> Mathematics <i>and</i> Physics</a></li>
+          
+          <li><a href="https://lindat.cz/partners/ff-uk">Faculty <i>of</i> Arts</a></li>
+          
+                </ul>
+            </li>
+          
+            <li>Masaryk University
+                <ul>
+                    
+          <li><a href="https://lindat.cz/partners/ff-mu">Faculty <i>of</i> Arts</a></li>
+          
+          <li><a href="https://lindat.cz/partners/fi-mu">Faculty  <i>of</i> Informatics</a></li>
+          
+                </ul>
+            </li>
+          
+            <li>University of West Bohemia
+                <ul>
+                    
+          <li><a href="https://lindat.cz/partners/zcu">Faculty <i>of</i> Applied Sciences</a></li>
+          
+                </ul>
+            </li>
+          
+            <li>Czech Academy of Sciences
+                <ul>
+                    
+          <li><a href="https://lindat.cz/partners/ujc">Czech Language Institute</a></li>
+          
+          <li><a href="https://lindat.cz/partners/knav">Library <i>of</i> Academy</a></li>
+          
+          <li><a href="https://lindat.cz/partners/hu">Institute <i>of</i> History</a></li>
+          
+          <li><a href="https://lindat.cz/partners/flu">Institute <i>of</i> Philosophy</a></li>
+          
+                </ul>
+            </li>
+          
+            <li>Archives, Libraries and Galleries
+                <ul>
+                    
+          <li><a href="https://lindat.cz/partners/nk">National Library <i>of the Czech Republic</i></a></li>
+          
+          <li><a href="https://lindat.cz/partners/mzk">Moravian Library <i>in Brno</i></a></li>
+          
+          <li><a href="https://lindat.cz/partners/ng">National Gallery Prague</a></li>
+          
+          <li><a href="https://lindat.cz/partners/nfa">National Film Archive</a></li>
+          
+                </ul>
+            </li>
+          
+        </ul>
+      </div>
+      
+      <div id="about-website">
+        <h4><a href="https://lindat.cz/services">Services</a></h4>
+        <ul>
+          
+          <li><a href="https://lindat.mff.cuni.cz/en/monitoring">Service Status</a></li>
+          
+          <li><a href="https://lindat.mff.cuni.cz/repository/xmlui/page/about?locale-attribute=en">About and Policies</a></li>
+          
+          <li><a href="https://lindat.mff.cuni.cz/en/terms-of-use">Terms of Use</a></li>
+          
+        </ul>
+      </div>
+      
+
+    <div id="badges-a">
+        <a href="https://www.clarin.eu/content/certified-centres"><img src="https://lindat.mff.cuni.cz/images/b-centre.png" alt="CLARIN CENTRE B" /></a>
+        <a href="https://www.clarin.eu/content/knowledge-centres"><img src="https://lindat.mff.cuni.cz/images/k-centre.png" alt="CLARIN CENTRE K" style="filter:brightness(0.88)" /></a>
+        <a href="https://www.coretrustseal.org/wp-content/uploads/2019/08/LINDAT-CLARIN.pdf"><img src="https://lindat.mff.cuni.cz/images/core-trust-seal-mono.png" alt="CoreTrustSeal Certification" /></a>
+    </div>
+
+    <div id="badges-b">
+        <a href="https://twitter.com/lindatclariahcz">Follow us on Twitter <img src="https://lindat.mff.cuni.cz/images/twitter-circular.svg" alt="Link to Profile" /></a>
+        <a href="https://lindat.cz/user/login"><img src="https://lindat.mff.cuni.cz/sites/default/files/LINDAT-CLARIAH-cz-gray_0.svg" alt="Home Page" /></a>
+    </div>
+
+    <div id="ack-msmt">
+        THE LINDAT/CLARIAH-CZ PROJECT (LM2018101; formerly LM2010013, LM2015071) IS FULLY SUPPORTED BY THE MINISTRY OF EDUCATION, SPORTS AND YOUTH OF THE CZECH REPUBLIC UNDER THE&#160;PROGRAMME LM OF "LARGE INFRASTRUCTURES"
+    </div>
+    <div id="ack-freepik">Icons ©  Smashicons and Freepik from flaticon.com licensed by <a href="https://creativecommons.org/licenses/by/3.0/">CC 3.0 BY</a></div>
+    <div id="ack-ufal">website © 2022 by <a href="https://ufal.mff.cuni.cz/">ÚFAL</a></div>
+    
+  <!-- TRACKING CODE -->
+
+  <script type="text/javascript">
+    //<![CDATA[
+    
+    (function(i,s,o,g,r,a,m){i['GoogleAnalyticsObject']=r;i[r]=i[r]||function(){
+        (i[r].q=i[r].q||[]).push(arguments)},i[r].l=1*new Date();a=s.createElement(o),
+      m=s.getElementsByTagName(o)[0];a.async=1;a.src=g;m.parentNode.insertBefore(a,m)
+    })(window,document,'script','//www.google-analytics.com/analytics.js','ga');
+
+    // main LINDAT/CLARIAH-CZ tracker
+    ga('create', 'UA-27008245-2', 'cuni.cz');
+    ga('send', 'pageview');
+      
+    //]]>
+  </script>
+
+  <!-- Piwik LINDAT/CLARIAH-CZ tracker -->
+  <script type="text/javascript">
+    //<![CDATA[
+    
+    var _paq = _paq || [];
+    _paq.push(["setDocumentTitle", document.domain + "/" + document.title]);
+    _paq.push(["setCookieDomain", "*.mff.cuni.cz"]);
+    _paq.push(["setDomains", ["*.mff.cuni.cz"]]);
+    _paq.push(['setCustomVariable', 1, "source", "common-theme", "page"]);
+    _paq.push(['trackPageView']);
+    _paq.push(['enableLinkTracking']);
+    (function() {
+      var u='//lindat.mff.cuni.cz/piwik/';
+      _paq.push(['setTrackerUrl', u+'piwik.php']);
+      _paq.push(['setSiteId', 2]);
+      var d=document, g=d.createElement('script'), s=d.getElementsByTagName('script')[0];
+      g.type='text/javascript'; g.async=true; g.defer=true; g.src=u+'piwik.js'; s.parentNode.insertBefore(g,s);
+    })();
+      
+    //]]>
+  </script>
+  <noscript><p><img src="//lindat.mff.cuni.cz/piwik/piwik.php?idsite=2" style="border:0;" alt="" /></p></noscript>
+  <!-- End Piwik Code -->
+  <!-- End TRACKING CODE -->
+      
+</footer>
+</div>
+    
+
+
+
+</body>
+</html>