CHA file parser for Python

Class for parsing CLAN's CHA file.

Made by Leandro Garber from CIIPME-CONICET

Features

Utterances as a list of strings
MOR tier as objects
Easily add more custom tiers
Count tokens and types of words, utterances, nouns, verbs and adjectives. Filter by child directed, child produced and overheard speech.
Count main verbs, either referring to physical or mental actions. Auxiliary verbs present in periphrastic verbs are excluded. (spanish only)

Usage

Import

import sys
sys.path.insert(0, '<path_to_cloned_repo>')

from ChaFile import *

Instance

cha = ChaFile(<path_to_cha_file>)

Options

cha = ChaFile(<path_to_cha_file> )

Get utterances

lines = cha.getLines()

Each line is an object with:

LINE_UTTERANCE : The text of the utterance
LINE_NUMBER
LINE_SPEAKER
LINE_ADDRESSEE
LINE_BULLET : Timestamp
TIER_MOR : A list of objects with MOR data: MOR_UNIT_LEXEMA and MOR_UNIT_CATEGORIA
... any other tier

Cite

Garber, L. (2019). CHA file python parser. Zenodo. https://doi.org/10.5281/zenodo.3364020

Name		Name	Last commit message	Last commit date
Latest commit History 54 Commits
turnos		turnos
uttVacias		uttVacias
.gitignore		.gitignore
ChaFile.py		ChaFile.py
LICENSE		LICENSE
README.md		README.md
log.py		log.py
test.py		test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CHA file parser for Python

Features

Usage

Import

Instance

Get utterances

Cite

About

Releases 2

Packages

Languages

License

macramole/CHAFile

Folders and files

Latest commit

History

Repository files navigation

CHA file parser for Python

Features

Usage

Import

Instance

Get utterances

Cite

About

Resources

License

Stars

Watchers

Forks

Releases 2

Packages 0

Languages

Packages