Skip to content

Commit 638589f

Browse files
committed
corrected annotations and added to software list
1 parent 1ccd144 commit 638589f

14 files changed

+80
-66
lines changed

data/JTEI/10_2016-19/jtei-10-haaf-source.xml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -212,7 +212,7 @@
212212
well as <ref target="http://www.deutschestextarchiv.de/dtaq/about">collaborative text
213213
correction and annotation</ref><note rend="inside.parenthesis">See <bibl><title
214214
level="a"><ptr type="software" xml:id="R3"
215-
target="#dtaq"/><rs type="soft.name" ref="R3">DTAQ: Kollaborative Qualitätssicherung im Deutschen Textarchiv</rs></title>
215+
target="#dtaq"/><rs type="soft.name" ref="#R3">DTAQ: Kollaborative Qualitätssicherung im Deutschen Textarchiv</rs></title>
216216
(Collaborative Quality Assurance within the DTA), accessed January 28, 2017, <rs type="soft.url" ref="#R3"><ptr
217217
target="http://www.deutschestextarchiv.de/dtaq/about"/></rs></bibl>. On the process of
218218
quality assurance in the DTA, see, for example, <ref target="#haaf13" type="bibl">Haaf,

data/JTEI/11_2019-20/jtei-cc-ra-hannessschlaeger-164-source.xml

Lines changed: 3 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -548,10 +548,9 @@
548548
type="soft.name" ref="#R3">GitHub</rs>).</p>
549549
<p>But the story did not end there. The freely available and processable collection of
550550
abstracts inspired Peter Andorfer, a colleague of the editors at the Austrian Centre for
551-
Digital Humanities, to use this text collection to built an <ptr type="software" xml:id="R12"
552-
target="#existdbpoweredwebapplication"/><rs type="soft.name" ref="#R12">eXistdb-powered web
553-
application</rs> (<rs type="soft.bib.ref" ref="#R12"><ref type="bibl" target="#andorfer17">Andorfer and Hannesschläger
554-
2017</ref></rs>). In the context of licensing issues, it is important to mention that
551+
Digital Humanities, to use this text collection to built an eXistdb-powered web
552+
application (<ref type="bibl" target="#andorfer17">Andorfer and Hannesschläger
553+
2017</ref>). In the context of licensing issues, it is important to mention that
555554
Andorfer was never approached by the editors or explicitly asked to process the TEI
556555
files, and he only informed the editors about the web application that he was building
557556
when it was already available online (as a <soCalled>work in progress</soCalled>, but

data/JTEI/13_2020-22/jtei-cc-ra-parisse-182-source.xml

Lines changed: 21 additions & 21 deletions
Original file line numberDiff line numberDiff line change
@@ -115,7 +115,7 @@
115115
format. Backward conversion is possible in many cases, with limitations inherent in the
116116
destination target format. <ptr type="software" xml:id="R8" target="#teicorpo"/>
117117
<rs type="soft.name" ref="#R8">TEICORPO</rs> can run the <ptr type="software" xml:id="R9"
118-
target="#treetager"/>
118+
target="#treetagger"/>
119119
<rs type="soft.name" ref="#R9">Treetagger</rs> part-of-speech tagger and the <ptr
120120
type="software" xml:id="R10" target="#stanfordcorenlp"/>
121121
<rs type="soft.name" ref="#R10">Stanford CoreNLP</rs> tools on TEI files and can export
@@ -231,15 +231,15 @@
231231
<div xml:id="similarities">
232232
<head>Similarities with and Differences from Other Approaches</head>
233233
<p>Many software packages dedicated to editing spoken language transcription contain
234-
utilities that can convert many formats: for example, <ptr type="software" xml:id="15"
234+
utilities that can convert many formats: for example, <ptr type="software" xml:id="R15"
235235
target="#exmaralda"/><rs type="soft.name" ref="#R15">EXMARaLDA</rs> (<rs
236236
type="Bib.Ref" target="#R15"><ref type="bibl" target="#schmidt2004">Schmidt 2004</ref>
237237
</rs>; see <rs type="URL" target="#R15"><ptr target="https://exmaralda.org"/></rs>),
238-
<ptr type="software" xml:id="16" target="#anvil"/>
238+
<ptr type="software" xml:id="R16" target="#anvil"/>
239239
<rs type="soft.name" ref="#R16">Anvil</rs> (<rs type="Bib.Ref" target="#R16">
240240
<ref type="bibl" target="#kipp2001">Kipp 2001</ref></rs>; see <rs type="URL"
241241
target="#R16"><ptr target="https://www.anvil-software.org"/></rs>), and <ptr
242-
type="software" xml:id="17" target="#elan"/><rs type="soft.name" ref="#17">ELAN</rs>
242+
type="software" xml:id="R17" target="#elan"/><rs type="soft.name" ref="#R17">ELAN</rs>
243243
(<rs type="bib.ref" target="#R17"><ref type="bibl" target="#wittenburg2006">Wittenburg
244244
et al. 2006</ref></rs>; see <rs type="URL" target="#R17">
245245
<ptr target="https://archive.mpi.nl/tla/elan"/></rs>). However, in all cases, the
@@ -257,7 +257,7 @@
257257
<p>The list of tools that are considered in the two projects is nearly the same. The only
258258
tools missing in the <ptr type="software" xml:id="R18" target="#teicorpo"/>
259259
<rs type="soft.name" ref="#R18">TEICORPO</rs> approach are <ptr type="software"
260-
xml:id="19" target="#exmaralda"/><rs type="soft.name" ref="#R19">EXMARaLDA</rs> and
260+
xml:id="R19" target="#exmaralda"/><rs type="soft.name" ref="#R19">EXMARaLDA</rs> and
261261
<ptr type="software" xml:id="R19" target="#folker"/>FOLKER (<rs type="bib.ref"
262262
target="#R19"><ref type="bibl" target="#schmidts2010">Schmidt and Schütte
263263
2010</ref></rs>; see <rs type="URL" target="#R19"><ptr
@@ -620,7 +620,7 @@
620620
tools, a single-level annotation structure within the <gi>spanGrp</gi> elements is
621621
insufficient to represent the complex organization that can be constructed with the
622622
<ptr type="software" xml:id="R78" target="#elan"/><rs type="soft.name" ref="#R78"
623-
>ELAN</rs> and <ptr type="software" xml:id="R78" target="#praat"/>
623+
>ELAN</rs> and <ptr type="software" xml:id="R79" target="#praat"/>
624624
<rs type="soft.name" ref="#R79">Praat</rs> tools. <ptr type="software" xml:id="R80"
625625
target="#elan"/><rs type="soft.name" ref="#R80">ELAN</rs> is a tool used by many
626626
researchers to describe data of greater complexity than the data presented in the
@@ -792,7 +792,7 @@
792792
<figure xml:id="fig4">
793793
<graphic url="media/image2.PNG" width="620px" height="980px"/>
794794
<head type="legend"><ptr type="software" xml:id="R98" target="#elan"/><rs
795-
type="soft.name" ref="#98">ELAN</rs> example of a temporal division</head>
795+
type="soft.name" ref="#R98">ELAN</rs> example of a temporal division</head>
796796
</figure>
797797
<figure xml:id="example_code_4">
798798
<egXML xmlns="http://www.tei-c.org/ns/Examples">
@@ -851,7 +851,7 @@
851851
corpora to be used with other editing tools, some of which are suited to specific
852852
processing: for example, <ptr type="software" xml:id="R104" target="#praat"/>
853853
<rs type="soft.name" ref="#R104">Praat</rs> for phonetics/phonology; <ptr
854-
type="software" xml:id="#R105" target="#transcriber"/>
854+
type="software" xml:id="R105" target="#transcriber"/>
855855
<rs type="soft.name" ref="#R105">Transcriber</rs>/<ptr type="software" xml:id="R106"
856856
target="#clan"/>
857857
<rs type="soft.name" ref="#R106">CLAN</rs> for raw transcription; and <ptr
@@ -1076,7 +1076,7 @@
10761076
<rs type="soft.name" ref="#R126">CLAN</rs> , <ptr type="software" xml:id="R127"
10771077
target="#elan"/><rs type="soft.name" ref="#R127">ELAN</rs>, <ptr type="software"
10781078
xml:id="R128" target="#praat"/>
1079-
<rs type="soft.name" ref="R128">Praat</rs>, <ptr type="software" xml:id="R129"
1079+
<rs type="soft.name" ref="#R128">Praat</rs>, <ptr type="software" xml:id="R129"
10801080
target="#transcriber"/>
10811081
<rs type="soft.name" ref="#R129">Transcriber</rs>, nor of course in TEI format.</p>
10821082
<p><ptr type="software" xml:id="R130" target="#teicorpo"/>
@@ -1094,7 +1094,7 @@
10941094
<rs type="soft.name" ref="#R134">TEICORPO</rs>: <ptr type="software" xml:id="R135"
10951095
target="#treetagger"/>
10961096
<rs type="soft.name" ref="#R135">TreeTagger</rs> and <ptr type="software" xml:id="R136"
1097-
target="#corenlp"/>
1097+
target="#stanfordcorenlp"/>
10981098
<rs type="soft.name" ref="#R136">CoreNLP</rs>.</p>
10991099
<div xml:id="treetagger">
11001100
<head><ptr type="software" xml:id="R138" target="#treetagger"/>
@@ -1118,11 +1118,11 @@
11181118
<rs type="soft.name" ref="#R140">TEICORPO</rs> should be used to generate an annotated
11191119
file with lemma and POS information based on <ptr type="software" xml:id="R141"
11201120
target="#treetagger"/>
1121-
<rs type="soft.name" ref="#141">TreeTagger</rs>. <ptr type="software" xml:id="142"
1121+
<rs type="soft.name" ref="#R141">TreeTagger</rs>. <ptr type="software" xml:id="R142"
11221122
target="#treetagger"/>
1123-
<rs type="soft.name" ref="#142">TreeTagger</rs> should be installed separately. The
1124-
implementation of <ptr type="software" xml:id="143" target="#treetagger"/>
1125-
<rs type="soft.name" ref="#143">TreeTagger</rs> in <ptr type="software" xml:id="R144"
1123+
<rs type="soft.name" ref="#R142">TreeTagger</rs> should be installed separately. The
1124+
implementation of <ptr type="software" xml:id="R143" target="#treetagger"/>
1125+
<rs type="soft.name" ref="#R143">TreeTagger</rs> in <ptr type="software" xml:id="R144"
11261126
target="#teicorpo"/>
11271127
<rs type="soft.name" ref="#R144">TEICORPO</rs> includes the ability to use any
11281128
syntactic model. For French data, we used the PERCEO model (<ref type="bibl"
@@ -1150,7 +1150,7 @@
11501150
<gi>filename</gi></p></cell>
11511151
<cell><p><gi>filename</gi> is the full location of the <ptr type="software"
11521152
xml:id="R146" target="#treetagger"/>
1153-
<rs type="soft.name" ref="#146">TreeTagger</rs> program, according to the system
1153+
<rs type="soft.name" ref="#R146">TreeTagger</rs> program, according to the system
11541154
used (Windows, MacOS, or Linux).</p></cell>
11551155
</row>
11561156
<row>
@@ -1163,7 +1163,7 @@
11631163
<p>The environment variable TREE_TAGGER can be used to locate the model and the program.
11641164
If no <code>-program</code> option is used, the default name for the <ptr
11651165
type="software" xml:id="R147" target="#treetagger"/>
1166-
<rs type="soft.name" ref="#147">TreeTagger</rs> program is used.</p>
1166+
<rs type="soft.name" ref="#R147">TreeTagger</rs> program is used.</p>
11671167
<p>The <code>-model</code> parameter is mandatory.</p>
11681168
<p>The resulting filename ends with <code>.tei_corpo_ttg.tei_corpo.xml</code> or a
11691169
specific name provided by the user (option <code>-o</code>).</p>
@@ -1279,10 +1279,10 @@
12791279
</div>
12801280
<div xml:id="stanford">
12811281
<head><ptr type="software" xml:id="R148" target="#stanfordcorenlp"/>
1282-
<rs type="soft.name" ref="#148">Stanford CoreNLP</rs></head>
1282+
<rs type="soft.name" ref="#R148">Stanford CoreNLP</rs></head>
12831283
<p><ptr type="software" xml:id="R149" target="#stanfordcorenlp"/>
1284-
<rs type="soft.name" ref="#149">The Stanford Core Natural Language Processing</rs><note>
1285-
<p>Accessed March 11, 2021, <rs type="url" ref="#149"><ptr
1284+
<rs type="soft.name" ref="#R149">The Stanford Core Natural Language Processing</rs><note>
1285+
<p>Accessed March 11, 2021, <rs type="url" ref="#R149"><ptr
12861286
target="https://nlp.stanford.edu/software/"/></rs>.</p>
12871287
</note> (<ptr type="software" xml:id="R150" target="#stanfordcorenlp"/>
12881288
<rs type="soft.name" ref="#R150">CoreNLP</rs>) package is a suite of tools (<rs
@@ -1437,7 +1437,7 @@
14371437
recent developments (see <ref type="bibl" target="#badin2021">Badin et al. 2021</ref>)
14381438
made it possible to insert metadata stored in CSV files (including participant metadata)
14391439
into the TEI files. This makes it possible to achieve more powerful corpus analysis
1440-
using a tool such as <ptr type="software" xml:id="R177" target="txm"/><rs
1440+
using a tool such as <ptr type="software" xml:id="R177" target="#txm"/><rs
14411441
type="soft.name" ref="#R177">TXM</rs>.</p>
14421442
<p>Our approach is somewhat similar to what is suggested in the conclusion of Schmidt,
14431443
Hedeland, and Jettka (<ref type="bibl" target="#schmidt2017">2017</ref>), who describe a
@@ -1465,7 +1465,7 @@
14651465
<div xml:id="conclusion">
14661466
<head>Conclusion</head>
14671467
<p><ptr type="software" xml:id="R183" target="#teicorpo"/>
1468-
<rs type="soft.name" ref="R183">TEICORPO</rs> is a functional tool, created by the CORLI
1468+
<rs type="soft.name" ref="#R183">TEICORPO</rs> is a functional tool, created by the CORLI
14691469
network and ORTOLANG, that converts files created by software specializing in editing
14701470
spoken-language data into TEI format. The result is fully compatible with the most recent
14711471
developments in TEI, especially those that concern spoken-language material.</p>

data/JTEI/13_2020-22/jtei-cc-ra-wittern-189-source.xml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -574,7 +574,7 @@
574574
usage has been increasing slowly but steadily.</p>
575575
<div xml:id="kanripo">
576576
<head>Kanripo Project Details</head>
577-
<p>All the texts are freely available on <rs type="soft.name" ref="">GitHub</rs> in their
577+
<p>All the texts are freely available on <rs type="soft.name" ref="#github">GitHub</rs> in their
578578
source form. This repository of texts can be accessed through the <ref
579579
target="https://www.kanripo.org/">kanripo.org</ref> website, but also through a module
580580
of the Emacs editor called Mandoku. This allows users to query, access, clone, edit, and

data/JTEI/14_2021-23/jtei-cc-ra-mylonas-202-source.xml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -619,7 +619,7 @@
619619
target="http://nomisma.org/">Nomisma</ref>, and <ref
620620
target="http://www.cidoc-crm.org/crmtex/home-8">CRMtex</ref><note>CIDOC (International
621621
Committee for Documentation) Conceptual <ptr type="software" xml:id="Reference"
622-
target="#Reference"/><rs type="soft.name" ref="#Reference">Reference</rs> Model,
622+
target="#omekareference"/><rs type="soft.name" ref="#Reference">Reference</rs> Model,
623623
accessed July 4, 2022, <ptr target="http://www.cidoc-crm.org/"/>; Nomisma (knowledge
624624
organization system for numismatics), accessed July 4, 2022, <ptr
625625
target="http://nomisma.org/"/>; CRMtex model for the study of ancient texts (an

data/JTEI/7_2014/jtei-7-dee-source.xml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -734,9 +734,9 @@
734734
<head>Integrated Resources</head>
735735
<p>While initiatives such as TAPAS, TEICHI, and <ref
736736
target="https://sites.google.com/site/cwrcwriterhelp/"><ptr type="software"
737-
xml:id="CWRC-Writer" target="#CWRC-Writer"/><rs type="soft.name" ref="#CWRC-Writer"
737+
xml:id="CWRC-Writer" target="#cwrcwriter"/><rs type="soft.name" ref="#CWRC-Writer"
738738
>CWRC-Writer</rs></ref><note><p><title level="a">Welcome to CWRC Writer</title>,
739-
<ptr type="software" xml:id="CWRC-Writer" target="#CWRC-Writer"/><rs
739+
<ptr type="software" xml:id="CWRC-Writer" target="#cwrcwriter"/><rs
740740
type="soft.name" ref="#CWRC-Writer">CWRC-Writer</rs> Help, accessed September 7,
741741
2013, <ptr target="https://sites.google.com/site/cwrcwriterhelp/"/>.</p></note> have
742742
begun to address to different aspects of these needs (<ref type="bibl"

data/JTEI/8_2014-15/jtei-8-boschetti-source.xml

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -162,7 +162,7 @@
162162
open-source general-purpose framework <ref target="http://cocoon.apache.org/"
163163
>Cocoon</ref><note><ptr target="http://cocoon.apache.org/"/>.</note> and the native
164164
XML database <ref target="http://exist-db.org/"><ptr type="software" xml:id="eXist-db"
165-
target="#eXist-db"/><rs type="soft.name" ref="#eXist-db">eXist-db</rs></ref><note><ptr
165+
target="#existdb"/><rs type="soft.name" ref="#eXist-db">eXist-db</rs></ref><note><ptr
166166
target="http://exist-db.org/"/>.</note> deserve to be mentioned. Specifically for
167167
TEI-annotated documents, <ref target="http://www.tustep.uni-tuebingen.de/tustep_eng.html"
168168
>TUSTEP</ref>,<note><ptr target="http://www.tustep.uni-tuebingen.de/tustep_eng.html"
@@ -245,7 +245,7 @@
245245
exposes methods that parse the XML file and create <ptr type="software" xml:id="Java"
246246
target="#Java"/><rs type="soft.name" ref="#Java">Java</rs> objects. The resources are
247247
stored and maintained in a native XML database management system (i.e., <ptr
248-
type="software" xml:id="eXist-db" target="#eXist-db"/><rs type="soft.name"
248+
type="software" xml:id="eXist-db" target="#existdb"/><rs type="soft.name"
249249
ref="#eXist-db">eXist-db</rs>). The APIs and services provided by Lucene, a software
250250
library developed and hosted by the Apache Foundation, have been used for indexing the
251251
textual data.</p>
@@ -646,7 +646,7 @@
646646
<p> The marshalling and unmarshalling process handles the serialization of the object
647647
representation of the TEI document, in order to store and retrieve data on the filesystem
648648
or in native XML databases, such as <ptr type="software" xml:id="eXist-db"
649-
target="#eXist-db"/><rs type="soft.name" ref="#eXist-db">eXist-db</rs>.</p>
649+
target="#existdb"/><rs type="soft.name" ref="#eXist-db">eXist-db</rs>.</p>
650650
<p>Performance measurement tools such as JMeter will help to optimize the performance of the
651651
library components.</p>
652652
<p> Software currently under development will be available on <ptr type="software"

0 commit comments

Comments
 (0)