Language Identifier

Identification of chosen languages from a live audio stream

This code base has been developed by ZKM | Hertz-Lab as part of the project »The Intelligent Museum«.

Please raise issues, ask questions, throw in ideas or submit code, as this repository is intended to be an open platform to collaboratively improve language identification.

BSD Simplified License.

Dependencies

openFrameworks
openFrameworks addons:
- ofxOSC (included with oF)
- ofxTensorFlow2
CLI11 parser: included in src
Language Identification: trained neural networks placed in bin/data

Tested Platforms

MacBook Pro 2017, macOS 10.15 & openFrameworks 0.11.2
MacBook Pro 2018, macOS 11.3.1 & openFrameworks 0.11.2

Structure

src/: contains the openFrameworks C++ code
bin/data/model_*: contains the SavedModels trained with TensorFlow2

Installation & Build

Overview:

Follow the steps in the ofxTensorFlow2 "Installation & Build" section for you platform
Generate the project files for this folder using the OF ProjectGenerator
Build for your platform

Generating Project Files

Project files are not included so you will need to generate the project files for your operating system and development environment using the OF ProjectGenerator which is included with the openFrameworks distribution.

To (re)generate project files for an existing project:

Click the "Import" button in the ProjectGenerator
Navigate to the project's parent folder ie. "apps/myApps", select the base folder for the example project ie. "LanguageIdentifier", and click the Open button
Click the "Update" button

If everything went Ok, you should now be able to open the generated project and build/run the example.

macOS

On macOS, a couple of additional manual steps are required to use ofxTensorflow2:

Enable C++14 in openFrameworks (only once, Xcode + Makefile)
Invoke macos_install_libs.sh in the Xcode project's Run Script build phases (after every project regeneration, Xcode only)

See the detailed steps in the ofxTensorflow2 readme.

For an Xcode build, open the Xcode project, select the "LanguageIdentifier Debug" scheme, and hit "Run".

For a Makefile build, build and run on the terminal:

cd LanguageIdentifier
make ReleaseTF2
make RunRelease

Linux

Build and run on the terminal:

cd LanguageIdentifier
make Release
make RunReleaseTF2

Usage

The openFrameworks application runs the language identification model using audio input. The detection status and detected language is sent out using OSC (Open Sound Control) messages.

Key Commands

l: toggle start/stop listening
a: toggle listening auto stop after detection

OSC Communication

Sending

By default, sends to:

address: localhost ie. 127.0.0.1
port: 9999

Message specification:

/detected status: detection status
- status: float, boolean 1 found - 0 lost
/lang index name confidence: detected language
- index: int, language map index
- name: string, language map name
- confidence: float, confidence percentage 0 - 100

Receiving

By default, listens on:

port 9898

Message specification:

/listen: start listening
/listen state: start/stop listening
- state: bool, 0 - stop, 1 - start
/autostop: enable listening auto stop after detection
/autostop state: enable/disable listening auto stop after detection
- state: bool, 0 - keep listening, 1 - stop on detection

Commandline Options

Additional run time settings are available via commandline options as shown via the --help flag output:

% bin/LanguageIdentifier --help
identifies spoken language from audio stream
Usage: LanguageIdentifier [OPTIONS]

Options:
  -h,--help                   Print this help message and exit
  -s,--senders TEXT ...       OSC sender addr:port host pairs, ex. "192.168.0.100:5555" or multicast "239.200.200.200:6666", default "localhost:9999"
  -p,--port INT               OSC receiver port, default 9898
  -c,--confidence FLOAT:FLOAT bounded to [0 - 1]
                              min confidence, default 0.75
  -t,--threshold FLOAT:INT bounded to [0 - 100]
                              volume threshold, default 25
  -l,--list                   list audio input devices and exit
  --inputdev INT              audio input device number
  --inputname TEXT            audio input device name, can do partial match, ex. "Microphone"
  --inputchan INT             audio input device channel, default 1
  -r,--samplerate INT         audio input device samplerate, can be 441000 or a multiple of 16000, default 48000
  --nolisten                  do not listen on start
  --autostop                  stop listening automatically after detection
  -e,--execute TEXT           command to execute on detection with key=value pair args
  -v,--verbose                verbose printing
  --version                   print version and exit

For example, to send OSC to multiple addresses use the -s option:

% bin/LanguageIdentifier -s localhost:9999 localhost:6666 192.168.0.101:7777

macOS

For macOS, the application binary can be invoked from within the .app bundle to pass commandline arguments:

bin/LanguageIdentifier.app/Contents/MacOS/LanguageIdentifier -h

This approach can also be wrapped up into a shell alias to be added to the account's ~/.bash_profile or ~/.zshrc file:

alias langid="/Applications/LanguageIdentifier.app/Contents/MacOS/LanguageIdentifier"

Reload the shell and application can now be invoked via:

% langid -v --inputdev 2

Executing commands

LanguageIdentifier can execute a command on a language detection via the -e/--execute option:

% bin/LanguageIdentifier -e `pwd`/script.sh

The arguments passed to the command are a series of key & values pairs where key=value:

selected key: string value, name of detected language
LANG key: float value, normalized detection confidence 0-1 for each supported language

Example args:

selected=noise noise=0.753628 chinese=0.000086 english=0.237782 french=0.001154 german=0.002538 italian=0.000791 russian=0.000018 spanish=0.004004

Note: In general, the command must include the full path if it is not in current shell PATH.

Demos

The demos consist of rapid prototypes built using the following components:

language identifier
visual front end: loaf

Custom visual front ends are written in Lua for loaf, a Lua interpreter with bindings for openFrameworks which includes a built-in Open Sound Control (OSC) server.

Usage

To set up a run environment on macOS, download loaf and place the .app in the system /Applications folder.

To run a loaf project, drag the main Lua script or project folder onto the loaf.app.

Notes

Sample Rate

The model inputs audio with a sample rate of 16 kHz, so the incoming stream is downsampled and the app's input sample rate needs to be a multiple of 16, ie. 48kHz, 92kHz, etc.

As 44.1kHz is also common, it is accepted and treated as 48kHz but the downsampled audio is then higher in pitch and may be noisy. In our tests, however detection is still acceptable.

Develop

Release steps

Update changelog
Update app version in Xcode project and ofApp.h define
Tag version commit, ala "0.3.0"
Push commit and tags to server:

git commit push git commit push --tags

The Intelligent Museum

An artistic-curatorial field of experimentation for deep learning and visitor participation

The ZKM | Center for Art and Media and the Deutsches Museum Nuremberg cooperate with the goal of implementing an AI-supported exhibition. Together with researchers and international artists, new AI-based works of art will be realized during the next four years (2020-2023). They will be embedded in the AI-supported exhibition in both houses. The Project „The Intelligent Museum” is funded by the Digital Culture Programme of the Kulturstiftung des Bundes (German Federal Cultural Foundation) and funded by the Beauftragte der Bundesregierung für Kultur und Medien (Federal Government Commissioner for Culture and the Media).

As part of the project, digital curating will be critically examined using various approaches of digital art. Experimenting with new digital aesthetics and forms of expression enables new museum experiences and thus new ways of museum communication and visitor participation. The museum is transformed to a place of experience and critical exchange.

Name		Name	Last commit message	Last commit date
Latest commit History 89 Commits
bin/data		bin/data
demos/demo1		demos/demo1
media		media
src		src
.gitattributes		.gitattributes
.gitignore		.gitignore
CHANGES.txt		CHANGES.txt
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE.txt		LICENSE.txt
Makefile		Makefile
README.md		README.md
addons.make		addons.make
config.make		config.make
langid		langid

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Language Identifier

Dependencies

Tested Platforms

Structure

Installation & Build

Generating Project Files

macOS

Linux

Usage

Key Commands

OSC Communication

Sending

Receiving

Commandline Options

macOS

Executing commands

Demos

Usage

Notes

Sample Rate

Develop

Release steps

The Intelligent Museum

About

Releases

Languages

License

zkmkarlsruhe/LanguageIdentifier

Folders and files

Latest commit

History

Repository files navigation

Language Identifier

Dependencies

Tested Platforms

Structure

Installation & Build

Generating Project Files

macOS

Linux

Usage

Key Commands

OSC Communication

Sending

Receiving

Commandline Options

macOS

Executing commands

Demos

Usage

Notes

Sample Rate

Develop

Release steps

The Intelligent Museum

About

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Languages