Weighted-Transducers-based framework of Mohri et al. and Music Transcription

notes on a model of Weighted Symbolic Visibly Pushdown Automata for Music Notation Processing (Zettelkasten) at

markdown math code is not rendered in github please see the copy at https://gitlab.inria.fr/qparse/paper-wtt

Approach of Mohri et al.

to the computation of (string) edit-distances between (string) languages

principle:

a distance $d(s, t)$ between strings $s$ and $t$ is defined by a weighted-transducer $T$ (that includes Levenshtein edit-distance but goes well beyond)
the distance $d(L_1, L_2)$ between 2 string languages $L_1$ and $L_2$ is the min of $d(s, t)$ for $s ∈ L_1$ and $t∈ L_2$
for 2 weighted automata $A_1$ and $A_2$ the distance $d(A_1, A_2)$ is the min of $A_1(s) + d(s, t) + A_2(t)$ it is computed by the composed transducer $A_1 \circ T \circ A_2$.

content on distances and models

distance-languages.md :
- definition of edit-distance between string and string languages.
- results (essentially Mohri et al.) on the computation of edit-distances between unweighted and weighted languages over finite alphabet
- comments on the generalisation to infinite alphabets
  
  it needs appropriate generalisation of transducers
  - to nested words (for scores) : Visibly Pushdown Transducers [Alur, Madhusudan]
- to infinite alphabets : Symbolic Visibly Pushdown Transducers [Alur, d'Antoni]
DTW.md : generalities on Dynamic Time Warping in the context of the above similarity measure definition and computation.
NWA.md references for various extended models of automata
WSVPA.md proposal of automaton for dealing with the performance/ score representations below.

Application

to the computation of distance performance-score, distance performance-score language and transcription.

**principle: **

a performance is a string $s$ over an infinite alphabet = MIDI-like sequence of timestamped events
a score is a tree or equivalently a nested word (structured string) over a finite language see examples in rhythm-nested-words.md
a score language is a Weighted Visibly Pushdown Automaton (aka "score grammar")
a performance-score distance is defined by a weighted transducer $T$ between infinite and finite alphabet = Weighted Symbolic Visibly Pushdown Transducer?, see WSVPA.md for preliminary proposal)
MIDI2score transcription is a 1-best parsing problem for $ \{ s \} \circ T \circ A_2$ where
- $\{ s \}$ is the singleton language containing the performance
- $A_2$ is the score language

a general picture is presented in transcription.md (with focus on MIDI2score transcription).

more details on the comparison between performances and score / score language :

distance-performance-score.md :
- sequential representations of performances and scores (strings over finite and infinite alphabet)
- automata theoretic definition of score languages
- definition of similarity measures between these representations

Theoretical framework for automated music transcription and performance analysis.

More conventional than in the paper MCM'19 : based on small extensions of standard tools :

Approach of Mohri et al.

to the computation of (string) edit-distances between (string) languages

principle:

the distance $d(s, t)$ between two strings $s$ and $t$ is defined by a weighted-transducer $T$ (that permits to define Levenshtein edit-distance and goes well beyond)
the distance $d(L_1, L_2)$ between 2 string languages $L_1$ and $L_2$ is the min of $d(s, t)$ for $s ∈ L_1$ and $t ∈ L_2$
for two weighted automata $A_1$ and $A_2$, the distance $d(A_1, A_2)$ is the min of $A_1(s) + d(s, t) + A_2(t)$ it is computed by the composed transducer $A_1 \circ T \circ A_2$.

content on distances and models

distance-languages.md :
- definition of edit-distance between string and string languages.
- results (essentially Mohri et al.) on the computation of edit-distances between unweighted and weighted languages over finite alphabet
- comments on the generalisation to infinite alphabets
  
  it needs appropriate generalisation of transducers
  - to nested words (for scores) : Visibly Pushdown Transducers [Alur, Madhusudan]
  - to infinite alphabets : Symbolic Visibly Pushdown Transducers [Alur, d'Antoni]
DTW.md : generalities on Dynamic Time Warping in the context of the above similarity measure definition and computation.
NWA.md references for various extended models of automata (to be completed)
WSVPA.md proposal of automaton for dealing with the performance / score representations below (preliminary).

Application

to the computation of distance performance-score, distance performance-score language and automated music transcription.

principle:

a performance is a string $s$ over an infinite alphabet = MIDI-like sequence of timestamped events
a score is a tree or equivalently a nested word (structured string) over a finite language see examples in rhythm-nested-words.md
a score language is a Weighted Visibly Pushdown Automaton (aka "score grammar")
a performance-score distance is defined by a weighted transducer $T$ between infinite and finite alphabet = Weighted Symbolic Visibly Pushdown Transducer?, see WSVPA.md for preliminary proposal of model...
MIDI2score transcription is a 1-best parsing problem for $ \{ s \} \circ T \circ A_2$ where
- $\{ s \}$ is the singleton language containing the performance
- $A_2$ is the score language

a general picture is presented in transcription.md (with focus on MIDI2score transcription).

more details on the comparison between performances and score / score language :

distance-performance-score.md :
- sequential representations of performances and scores (strings over finite and infinite alphabet)
- automata theoretic definition of score languages
- definition of similarity measures between these representations

Name		Name	Last commit message	Last commit date
Latest commit History 333 Commits
ACL-ijcnlp2021-templates		ACL-ijcnlp2021-templates
FST-TCS		FST-TCS
FoSSaCS		FoSSaCS
notes		notes
pictures		pictures
.gitignore		.gitignore
DTW.md		DTW.md
NWA.md		NWA.md
README.md		README.md
SWVPA-weight.tex		SWVPA-weight.tex
WSVPA.md		WSVPA.md
WVPA-WTA.md		WVPA-WTA.md
abstract.tex		abstract.tex
algo-best-alt.tex		algo-best-alt.tex
algo-best-lemma.tex		algo-best-lemma.tex
algo-best.tex		algo-best.tex
algo-best0.tex		algo-best0.tex
app-Bar-Hillel-Perles-Shamir.tex		app-Bar-Hillel-Perles-Shamir.tex
app-bestsearch.tex		app-bestsearch.tex
app-labels.tex		app-labels.tex
app-product.tex		app-product.tex
app-trees.tex		app-trees.tex
distance-languages.md		distance-languages.md
distance-performance-score.md		distance-performance-score.md
distance.tex		distance.tex
epsilon-removal.tex		epsilon-removal.tex
examples.tex		examples.tex
hermann-scan.pdf		hermann-scan.pdf
image.tex		image.tex
introduction.tex		introduction.tex
keywords.txt		keywords.txt
labels.tex		labels.tex
learning.md		learning.md
lipics-v2021.cls		lipics-v2021.cls
llncs.cls		llncs.cls
main-V0.tex		main-V0.tex
main-V1.pdf		main-V1.pdf
main-V1.tex		main-V1.tex
main-V2.pdf		main-V2.pdf
main-V2.tex		main-V2.tex
main-article.tex		main-article.tex
main-eptcs.tex		main-eptcs.tex
main-lipics.tex		main-lipics.tex
main-llncs.tex		main-llncs.tex
main.tex		main.tex
main2-epsilon.tex		main2-epsilon.tex
musicography.sty		musicography.sty
ocg.bst		ocg.bst
ocg.cls		ocg.cls
parse-tree.tex		parse-tree.tex
polyphony.tex		polyphony.tex
proof-best.tex		proof-best.tex
references.bib		references.bib
related work.md		related work.md
rewriting.tex		rewriting.tex
rhythm-nested-words.md		rhythm-nested-words.md
semiring.tex		semiring.tex
splncs04.bst		splncs04.bst
summary.tex		summary.tex
tempo-Large.md		tempo-Large.md
test.md		test.md
transcription.md		transcription.md
transcription.tex		transcription.tex
tree-word.md		tree-word.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Weighted-Transducers-based framework of Mohri et al. and Music Transcription

Approach of Mohri et al.

Application

Approach of Mohri et al.

Application

About

Releases

Packages

Contributors 3

Languages

cedric-cnam/WSVPA

Folders and files

Latest commit

History

Repository files navigation

Weighted-Transducers-based framework of Mohri et al. and Music Transcription

Approach of Mohri et al.

Application

Approach of Mohri et al.

Application

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages