-
Notifications
You must be signed in to change notification settings - Fork 1
/
Copy pathindex.html
115 lines (100 loc) · 4.84 KB
/
index.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
<html>
<head>
<meta charset="utf-8">
<meta http-equiv="content-type" content="text/html; charset=utf-8" />
<title>ACL Special Interest Group on writing systems and written language (SIGWrit)</title>
<meta id="viewport" name="viewport" content="width=800" />
<meta name="description" content="ACL Special Interest Group on Writing Wystems and Written Language (SIGWrit)" />
</head>
<body>
<br>
<table cellpadding=15 class="center">
<tr>
<td>
<a href="sigzig2_cruciform_nisaba.png" target="_blank">
<img border="0" margin="5" height="150" src="sigzig2_cruciform_nisaba.png">
</a>
</td>
<td>
<font color="#0000ff" face="courier new, monospace" size="+2" style="background-color:transparent">
<b>ACL Special Interest Group on Writing Systems and Written Language (SIGWrit)</b></font><br>
<p><font>The purpose of SIGWrit is to promote a better understanding of the
relationship between language in its spoken or signed form, and
language in its written form, as well as to promote research on
computation and written language.<br>
</font>
</p>
</td>
</tr>
</table><br>
<font>
<b>Future events</b>:<br>
<ul>
<li><a href="https://sigwrit.org/workshops/cawl2024/">Second Workshop on Computation and Written Language (CAWL 2024)</a></li>
</ul><br>
<b>Past events</b>:<br>
<ul>
<li><a href="https://cawl.wellformedness.com/">ACL Workshop on Computation and Written Language (CAWL 2023)</a></li>
</ul><br>
<b>Overview</b><br><br>
<p>
Events sponsored by the SIG will focus on research addressing the
relationship between written and spoken language, the properties of
written language, the ways in which writing systems encode language,
and applications specifically focused on characteristics of writing
systems.
</p>
<p>
Most work on NLP focuses on language in its canonical written form.
This has often led researchers to ignore the differences between
written and spoken language or, worse, to conflate the two.
Instances of conflation are statements like "Chinese is a logographic language" or
"Persian is a right-to-left language", variants of which can be found frequently
in the ACL anthology (<a href="https://aclanthology.org/2023.cawl-1.1/">Gorman & Sproat, 2023)</a>.
These statements confuse properties of the language with properties of its writing system.
Ignoring differences between written and spoken language leads, among other things,
to conflating different words that are spelled the same (e.g., English <em>bass</em>),
or treating as different, words that have multiple spellings (e.g., Japanese <em>umai</em> 'tasty',
which can be written 旨い, うまい, ウマい, or 美味い).
</p>
<p>
Furthermore, methods for dealing with written language issues
(e.g., various kinds of normalization or conversion) or for
recognizing text input (e.g. OCR & handwriting recognition
or text entry methods) are often regarded as precursors to
NLP rather than as fundamental parts of the enterprise, despite the fact
that most NLP methods rely centrally on representations derived from text
rather than (spoken) language. This general lack of consideration of writing
has led to much of the research on such topics to largely appear outside of ACL venues,
in conferences or journals of neighboring fields such as speech technology
(e.g., text normalization) or human-computer interaction (e.g., text entry).
</p>
<p>
Topics of interest covered within the scope of the SIG and its
sponsored events will include (but not be limited to): text entry;
text tokenization; disambiguation of abbreviations and homographs;
grapheme-to-phoneme conversion, transliteration, and diacritization;
text normalization for speech and for processing “informal” genres of
text; the computational study of literary devices involving writing
systems, such as eye dialect; information-theoretic and
machine-learning approaches to decipherment; methods for specialized
text genres, e.g., clinical notes; optical character (incl.
handwriting) recognition and historical document processing;
orthographic representation for unwritten languages; spelling error
detection and correction; script normalization and encoding; and
writing system typology and its relevance to speech and language
processing.
</p>
<b>SIGWrit Constitution</b><br><br>
The constitution of the SIG can be found <a href="SIGWritConstitution.html">here</a>.<br><br>
<b>Officers</b> <br><br>
The current slate of officers is listed below:
<ul>
<li>President: <a href="https://wellformedness.com/">Kyle Gorman</a>, Graduate Center, City University of New York, USA</li>
<li>Vice President: <a href="https://lignos.org/">Costantine Lignos</a>, Brandeis University, USA</li>
<li>Secretary-Treasurer: <a href="https://zoeyliu18.github.io/">Zoey Liu</a>, University of Florida, USA</li>
<li>Student Member: <a href="https://csikasote.github.io/">Claytone Sikasote</a>, University of Cape Town, South Africa</li>
</ul>
</font>
</body>
</html>