Skip to content

Commit c47b764

Browse files
committed
2 parents 4e6e68c + 7ac0bd6 commit c47b764

21 files changed

+474
-303
lines changed

data/JTEI/12_2019-20/jtei-cc-ra-flanders-176-source.xml

Lines changed: 142 additions & 115 deletions
Large diffs are not rendered by default.

data/JTEI/13_2020-22/jtei-cc-pn-kuhry-188-source.xml

Lines changed: 95 additions & 60 deletions
Large diffs are not rendered by default.

data/JTEI/13_2020-22/jtei-cc-ra-parisse-182-source.xml

Lines changed: 26 additions & 21 deletions
Original file line numberDiff line numberDiff line change
@@ -234,16 +234,16 @@
234234
<p>Many software packages dedicated to editing spoken language transcription contain
235235
utilities that can convert many formats: for example, <ptr type="software" xml:id="R15"
236236
target="#exmaralda"/>
237-
<rs type="soft.name" ref="#R15">EXMARaLDA</rs> ( <rs type="soft.Bib.Ref" target="#R15"
237+
<rs type="soft.name" ref="#R15">EXMARaLDA</rs> ( <rs type="soft.bib.ref" ref="#R15"
238238
><ref type="bibl" target="#schmidt2004">Schmidt 2004</ref>
239-
</rs>; see <rs type="soft.url" target="#R15"><ptr target="https://exmaralda.org"
239+
</rs>; see <rs type="soft.url" ref="#R15"><ptr target="https://exmaralda.org"
240240
/></rs>), <ptr type="software" xml:id="R16" target="#anvil"/>
241-
<rs type="soft.name" ref="#R16">Anvil</rs> (<rs type="soft.Bib.Ref" target="#R16">
241+
<rs type="soft.name" ref="#R16">Anvil</rs> (<rs type="soft.bib.ref" ref="#R16">
242242
<ref type="bibl" target="#kipp2001">Kipp 2001</ref></rs>; see <rs type="soft.url"
243-
target="#R16"><ptr target="https://www.anvil-software.org"/></rs>), and <ptr
243+
ref="#R16"><ptr target="https://www.anvil-software.org"/></rs>), and <ptr
244244
type="software" xml:id="R17" target="#elan"/><rs type="soft.name" ref="#R17">ELAN</rs>
245-
(<rs type="soft.bib.ref" target="#R17"><ref type="bibl" target="#wittenburg2006"
246-
>Wittenburg et al. 2006</ref></rs>; see <rs type="soft.url" target="#R17">
245+
(<rs type="soft.bib.ref" ref="#R17"><ref type="bibl" target="#wittenburg2006"
246+
>Wittenburg et al. 2006</ref></rs>; see <rs type="soft.url" ref="#R17">
247247
<ptr target="https://archive.mpi.nl/tla/elan"/></rs>). However, in all cases, the
248248
conversions are limited to the features implemented in the tool itself—for example, with
249249
a limited set of metadata—and they cannot always be used to prepare data to be used by
@@ -260,9 +260,9 @@
260260
tools missing in the <ptr type="software" xml:id="R18" target="#teicorpo"/>
261261
<rs type="soft.name" ref="#R18">TEICORPO</rs> approach are <ptr type="software"
262262
xml:id="R19" target="#exmaralda"/><rs type="soft.name" ref="#R19">EXMARaLDA</rs> and
263-
<ptr type="software" xml:id="R19" target="#folker"/>FOLKER (<rs type="soft.bib.ref"
264-
target="#R19"><ref type="bibl" target="#schmidts2010">Schmidt and Schütte
265-
2010</ref></rs>; see <rs type="soft.url" target="#R19"><ptr
263+
<ptr type="software" xml:id="R241" target="#folker"/>FOLKER (<rs type="soft.bib.ref"
264+
ref="#R19"><ref type="bibl" target="#schmidts2010">Schmidt and Schütte
265+
2010</ref></rs>; see <rs type="soft.url" ref="#R241"><ptr
266266
target="https://exmaralda.org/en/folker-en/"/></rs>), but this was only because the
267267
conversion tools from and to <ptr type="software" xml:id="R20" target="#EXMARaLDA"/><rs
268268
type="soft.name" ref="#R20">EXMARaLDA</rs>, <ptr type="software" xml:id="R21"
@@ -279,22 +279,22 @@
279279
<ptr type="software" xml:id="R25" target="#folker"/>
280280
<rs type="soft.name" ref="#R25">FOLKER</rs> software fit within the process chain of
281281
<ptr type="software" xml:id="R26" target="#teicorpo"/><rs type="soft.name"
282-
target="#R26"> TEICORPO</rs>. This demonstrates the usefulness of a well-known and
282+
ref="#R26"> TEICORPO</rs>. This demonstrates the usefulness of a well-known and
283283
efficient format such as TEI.</p>
284284
<p>There are, however, differences between the two projects that make them nonredundant
285285
but complementary, each project having specificities that can be useful or damaging
286286
depending on the user’s needs. One minor difference is that the <ptr type="software"
287-
xml:id="R27" ref="#teicorpo"/>
288-
<rs type="soft.name" target="#R27">TEICORPO</rs> project is not a functionality of an
287+
xml:id="R27" target="#teicorpo"/>
288+
<rs type="soft.name" ref="#R27">TEICORPO</rs> project is not a functionality of an
289289
editing tool, but is a standalone tool for converting data between one format and
290290
another. This had certain effects on the user interface and explains some of the choices
291291
made in the development of the two tools.</p>
292292
<p>There are two major differences between <ptr type="software" xml:id="R28"
293293
target="#teicorpo"/>
294-
<rs type="soft.name" target="#R28">TEICORPO</rs> and Schmidt’s approach, which affected
294+
<rs type="soft.name" ref="#R28">TEICORPO</rs> and Schmidt’s approach, which affected
295295
both the design of the tools and how they can be used. The first difference is that in
296-
developing <ptr type="software" xml:id="R29" ref="#teicorpo"/><rs type="soft.name"
297-
target="#R29">TEICORPO</rs>, it was decided that the conversion between the original
296+
developing <ptr type="software" xml:id="R29" target="#teicorpo"/><rs type="soft.name"
297+
ref="#R29">TEICORPO</rs>, it was decided that the conversion between the original
298298
formats and TEI had to be lossless (or as lossless as possible) because we wanted to
299299
offer a means to store the research data for long-term conservation and dissemination in
300300
a standard XML format instead of in proprietary formats such as those used by <ptr
@@ -1004,7 +1004,7 @@
10041004
<rs type="soft.name" ref="#R117">TEICONVERT</rs> makes spoken language data available
10051005
for <ptr type="software" xml:id="R118" target="#txm"/><rs type="soft.name" ref="#R118"
10061006
>TXM</rs> (<rs type="soft.bib.ref" ref="#R118"><ref type="bibl" target="#heiden2010"
1007-
>Heiden 2010</ref></rs>; see <rs type="soft.turl" ref="#R118"><ptr
1007+
>Heiden 2010</ref></rs>; see <rs type="soft.url" ref="#R118"><ptr
10081008
target="http://textometrie.ens-lyon.fr"/></rs>), <ptr type="software" xml:id="R119"
10091009
target="#letrameur"/>
10101010
<rs type="soft.name" ref="#R119">Le Trameur</rs> (<rs type="soft.bib.ref" ref="#R119"
@@ -1149,11 +1149,13 @@
11491149
<rs type="soft.name" ref="#R144">TEICORPO</rs> includes the ability to use any
11501150
syntactic model. For French data, we used the PERCEO model (<ref type="bibl"
11511151
target="#benzitoun2012">Benzitoun, Fort, and Sagot 2012</ref>).</p>
1152-
<p>The command line to be used is: <code>java -cp <ptr type="software" xml:id="R208"
1152+
<p>The command line to be used is: <ptr type="software" xml:id="R240" target="#java"/><code>
1153+
<rs type="soft.name" ref="#R240">java</rs> -cp <ptr type="software" xml:id="R208"
11531154
target="#teicorpo"/>
11541155
<rs type="soft.name" ref="#R208">TEICORPO</rs>.jar fr.ortolang.<ptr type="software"
11551156
xml:id="R209" target="#teicorpo"/>
1156-
<rs type="soft.name" ref="#R209">TEICORPO</rs>.TeiTreeTagger filenames...</code>
1157+
<rs type="soft.name" ref="#R209">TEICORPO</rs>.<ptr type="software" xml:id="R239" target="#treetagger"/>
1158+
Tei <rs type="soft.name" ref="#R239">TreeTagger</rs> filenames...</code>
11571159
with additional parameters:</p>
11581160
<table xml:id="table2">
11591161
<row role="label">
@@ -1329,7 +1331,10 @@
13291331
<rs type="soft.name" ref="#R153">TreeTagger</rs> . The -model and -syntaxformat
13301332
parameters can be used in a similar way to specify the grammatical model to be used
13311333
and the output format. A command line example is:</p>
1332-
<p><code>java -cp "teicorpo.jar:directory_for_SNLP/*" fr.ortolang.teicorpo.TeiSNLP
1334+
<p><code><ptr type="software" xml:id="R236" target="#java"/>
1335+
<rs type="soft.name" ref="#R236">java</rs> -cp "<ptr type="software" xml:id="R237" target="#teicorpo"/>
1336+
<rs type="soft.name" ref="#R237">teicorpo</rs>.jar:directory_for_SNLP/*" fr.ortolang.<ptr type="software" xml:id="R238" target="#teicorpo"/>
1337+
<rs type="soft.name" ref="#R238">teicorpo</rs>.TeiSNLP
13331338
-syntaxformat svalue -model filename.tei_corpo.xml</code></p>
13341339
<p>The <term>directory_for_SNLP</term> is the name of the location on a computer where
13351340
all the <ptr type="software" xml:id="R212" target="#stanfordcorenlp"/>
@@ -1392,7 +1397,7 @@
13921397
<p>Export can be done from TEI into a format used by textometric software (see <ptr
13931398
target="#example_code_11" type="crossref"/>). This is the case for <ptr
13941399
type="software" xml:id="R160" target="#txm"/><rs type="soft.name" ref="#R160">TXM</rs>,<note>
1395-
<p>See the Textométrie website, last updated June 29, 2020, <rs type="soft.ulr" ref="#R160"
1400+
<p>See the Textométrie website, last updated June 29, 2020, <rs type="soft.url" ref="#R160"
13961401
><ptr target="http://textometrie.ens-lyon.fr/?lang=en"/></rs>.</p>
13971402
</note> a textometric software application. In this case, instead of using a partition
13981403
representation, the information from the grammatical analysis is inserted at the word
@@ -1591,7 +1596,7 @@
15911596
target="https://www.fon.hum.uva.nl/paul/papers/speakUnspeakPraat_glot2001.pdf"
15921597
/>.</bibl>
15931598
</rs>
1594-
<ptr type="software" xml:id="R226" target="#teimata"/>
1599+
<ptr type="software" xml:id="R226" target="#teimeta"/>
15951600
<rs type="soft.bib.ref" ref="#R226">
15961601
<bibl xml:id="etienne"><rs type="soft.agent" ref="#R226"><author>Etienne,
15971602
Carole</author></rs>, <rs type="soft.agent" ref="#R226"><author>Loïc

data/JTEI/13_2020-22/jtei-cc-ra-winslow-186-source.xml

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -530,8 +530,8 @@
530530
favoring one period or type of document over another, a generic element seems both
531531
desirable and advisable. The proposed element, as implemented in the TEI_CEI
532532
ODD,<note>See the <ref xml:id="ref13" type="bibl" target="#winslowetal2019"
533-
>CEI2TEI <ptr type="software" xml:id="GitHub" target="#GitHub"/><rs
534-
type="soft.name" ref="#GitHub">GitHub</rs> repository, accessed June 25,
533+
>CEI2TEI <ptr type="software" xml:id="R1" target="#github"/><rs
534+
type="soft.name" ref="#R1">GitHub</rs> repository, accessed June 25,
535535
2021</ref>, <ptr
536536
target="https://github.com/GVogeler/CEI2TEI/blob/master/tei_cei.odd"/>.</note>
537537
is simple (<ident>attList</ident> items suppressed for brevity: they follow the
@@ -576,8 +576,8 @@
576576
proposed vocabulary, provided in SKOS (Simple Knowledge Organization System) format
577577
(as part of the project’s <ref
578578
target="https://github.com/GVogeler/CEI2TEI/blob/master/Authentication/authen.skos.ttl">
579-
<ptr type="software" xml:id="GitHub" target="#GitHub"/><rs type="soft.name"
580-
ref="#GitHub">GitHub</rs> repository</ref>,<note>Accessed July 13, 2021, <ptr
579+
<ptr type="software" xml:id="R2" target="#github"/><rs type="soft.name"
580+
ref="#R2">GitHub</rs> repository</ref>,<note>Accessed July 13, 2021, <ptr
581581
target="https://github.com/GVogeler/CEI2TEI/blob/master/Authentication/authen.skos.ttl"
582582
/>.</note> at <ptr
583583
target="https://github.com/GVogeler/CEI2TEI/blob/master/Authentication/authen.skos.ttl"

data/JTEI/14_2021-23/jtei-barabuccietal-196-source.xml

Lines changed: 7 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -261,11 +261,11 @@
261261
text are presented in such a way that the reader is granted more informed access
262262
to them.</p>
263263
<p>The edition will be published online using a specifically tailored version of <ptr
264-
type="software" xml:id="R29" target="#evt"/><rs type="soft.name" ref="R29"
264+
type="software" xml:id="R29" target="#evt"/><rs type="soft.name" ref="#R29"
265265
>EVT</rs> (Edition Visualization Technology<note><quote source="#quote1">A
266266
light-weight, open source tool specifically designed to create digital
267267
editions from XML-encoded texts</quote>
268-
<rs type="soft.bib" ref="R20">(<ref type="bibl" target="#delturco2013"
268+
<rs type="soft.bib" ref="#R20">(<ref type="bibl" target="#delturco2013"
269269
xml:id="quote1">Rosselli Del Turco et al. 2013</ref>)</rs>.</note>) and
270270
will present, on the one hand, each witness in its continuum from facsimile to
271271
multiple levels of normalization and, on the other hand, the three main witnesses
@@ -790,7 +790,7 @@
790790
with no manual intervention on the resulting files.</item>
791791
<item>The generated editions files will conform to the TEI subset understood by
792792
<ptr type="software" xml:id="R30" target="#evt"/><rs type="soft.name"
793-
ref="R30">EVT</rs>.</item>
793+
ref="#R30">EVT</rs>.</item>
794794
</list>
795795
<p>Some of these desiderata clash with each other. For instance, the desire to
796796
directly edit the XML file makes it hard and error-prone to keep in a single
@@ -855,7 +855,7 @@
855855
target="#delturcond">Roberto Rosselli Del Turco (n.d.)</ref>: here two
856856
levels of edition are offered, a diplomatic and a more interpretative one. The
857857
user can compare the two editions visualizing them synoptically in the <ptr
858-
type="software" xml:id="R31" target="#evt"/><rs type="soft.name" ref="R31"
858+
type="software" xml:id="R31" target="#evt"/><rs type="soft.name" ref="#R31"
859859
>EVT</rs> software used for the edition.</p>
860860
</div>
861861
</div>
@@ -1474,10 +1474,10 @@
14741474
version</biblScope>. Accessed <date>October 22, 2021</date>. <ptr
14751475
target="http://vbd.humnet.unipi.it/beta2/"/>.</bibl>
14761476
<bibl xml:id="delturco2013"><ptr type="software" xml:id="R28" target="#evt"
1477-
/><author><rs type="soft.agent" ref="R28">Rosselli Del Turco,
1478-
Roberto</rs></author>, et al. <rs type="soft.bib" ref="R28">
1477+
/><author><rs type="soft.agent" ref="#R28">Rosselli Del Turco,
1478+
Roberto</rs></author>, et al. <rs type="soft.bib" ref="#R28">
14791479
<date>2013</date>. <title level="m">Edition Visualization Technology</title>.
1480-
</rs> Accessed <date>April 19, 2021</date>.<rs type="soft.url" ref="R28"><ptr
1480+
</rs> Accessed <date>April 19, 2021</date>.<rs type="soft.url" ref="#R28"><ptr
14811481
target="http://evt.labcd.unipi.it/"/></rs>.</bibl>
14821482
<bibl xml:id="stella2020"><editor>Stella, Francesco</editor>, ed. <date>2020</date>.
14831483
<title level="m">Corpus Rhythmorum Musicum.</title> Last modified <date>July

data/JTEI/14_2021-23/jtei-cc-pn-erjavec-195-source.xml

Lines changed: 6 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -326,9 +326,9 @@
326326
<head>Presentation of Parla-CLARIN</head>
327327
<p>Like the TEI Guidelines, the Parla-CLARIN recommendations are available on <ref
328328
target="https://github.com/clarin-eric/parla-clarin/"><ptr type="software" xml:id="R1"
329-
target="#GitHub"/><rs type="soft.name" ref="#R1">GitHub</rs></ref>, as a
329+
target="#github"/><rs type="soft.name" ref="#R1">GitHub</rs></ref>, as a
330330
project<note>Tomaž Erjavec and Andrej Pančur, Parla-CLARIN project <ptr
331-
type="software" xml:id="R2" target="#GitHub"/><rs type="soft.name" ref="#R2"
331+
type="software" xml:id="R2" target="#github"/><rs type="soft.name" ref="#R2"
332332
>GitHub</rs> site, last updated March 17, 2021, <ptr type="software" xml:id="R9"
333333
target="#parlaclarinscripts"/><rs type="soft.url" ref="#R9"><ptr
334334
target="https://github.com/clarin-eric/parla-clarin/"/></rs>.</note> of the CLARIN
@@ -581,7 +581,7 @@
581581
and into developing the <ptr type="software" xml:id="R12" target="#parlaclarinscripts"
582582
/><rs type="soft.name" ref="#R12">conversion from Akoma Ntoso to Parla-CLARIN</rs>. We
583583
have not included examples of the encoding, as these are readily available on the <ptr
584-
type="software" xml:id="R3" target="#GitHub"/><rs type="soft.name" ref="#R3">GitHub</rs>
584+
type="software" xml:id="R3" target="#github"/><rs type="soft.name" ref="#R3">GitHub</rs>
585585
documentation page of the project, and large Parla-CLARIN encoded corpora are openly
586586
available.</p>
587587
<p>Apart from the siParl 2.0 corpus mentioned above (<ptr type="crossref"
@@ -632,7 +632,7 @@
632632
specification from the default ones in the TEI Guidelines to ones taken or adapted from
633633
the collected parliamentary corpora.</p>
634634
<p>Second, as we have already done for ParlaMint, we plan to add to the <ptr type="software"
635-
xml:id="R4" target="#GitHub"/><rs type="soft.name" ref="#R4">GitHub</rs> Parla-CLARIN
635+
xml:id="R4" target="#github"/><rs type="soft.name" ref="#R4">GitHub</rs> Parla-CLARIN
636636
project more down-conversion scripts with which we would increase the usability of the
637637
Parla-CLARIN corpora. As mentioned, work also needs to be done to develop a conversion to
638638
RDF.</p>
@@ -803,7 +803,8 @@
803803
<bibl xml:id="kilgarriff14"><author>Kilgarriff, Adam</author>, <author>Vít Baisa</author>,
804804
<author>Jan Bušta</author>, <author>Miloš Jakubíček</author>, <author>Vojtěch
805805
Kovář</author>, <author>Jan Michelfeit</author>, <author>Pavel Rychlý</author>, and
806-
<author>Vít Suchomel</author>. <rs type="soft.bib.ref" ref="ewfew"><date>2014</date>.
806+
<author>Vít Suchomel</author>. <ptr type="software" xml:id="R30"
807+
target="#sketchengine"/><rs type="soft.name soft.bib.ref" ref="#R30"><date>2014</date>.
807808
<title level="a">The Sketch Engine: Ten Years On.</title></rs>
808809
<title level="j">Lexicography: Journal of ASIALEX</title>
809810
<biblScope unit="volume">1</biblScope> (<biblScope unit="issue">1</biblScope>):

data/JTEI/8_2014-15/jtei-8-boschetti-source.xml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -240,7 +240,7 @@
240240
</list>. The continuous integration and release are supported by open source Integrated
241241
Development Environments (IDEs) like <ptr type="software" xml:id="R10" target="#eclipse"/>
242242
<rs type="soft.name" ref="#R10">Eclipse</rs> or <ptr type="software" xml:id="R11"
243-
target="netbeans"/><rs type="soft.name" ref="#R11"> NetBeans</rs> and by a software
243+
target="#netbeans"/> <rs type="soft.name" ref="#R11">NetBeans</rs> and by a software
244244
configuration management tool such as <ptr type="software" xml:id="R13" target="#svn"/>
245245
<rs type="soft.name" ref="#R13">SVN</rs> or <ptr type="software" xml:id="R12"
246246
target="#git"/>
@@ -722,7 +722,7 @@
722722
Environment: Metadata, Vocabularies and Techniques in the Digital Humanities</title>,
723723
article no. 11. <pubPlace>New York</pubPlace>: <publisher>ACM</publisher>. doi:<idno
724724
type="doi">10.1145/2517978.2517990</idno>.</bibl>
725-
<ptr type="software" xml:id="R39" target="g2a"/>
725+
<ptr type="software" xml:id="R39" target="#g2a"/>
726726
<rs type="soft.ref.bib" ref="#R39">
727727
<bibl xml:id="bozzi13"><author><rs type="soft.agent" ref="#R39">Bozzi,
728728
Andrea</rs></author>. <date>2013</date>. <title level="a">G2A: A Web Application to

0 commit comments

Comments
 (0)