Skip to content

Latest commit

 

History

History
112 lines (79 loc) · 6.56 KB

README.md

File metadata and controls

112 lines (79 loc) · 6.56 KB

LangSign

ASL gesture recognition in the browser.

To run: start the two servers[flask and http-server, instructions below] and open index.html in a browser

Demo:

langsignGIF_2

Setting the whole thing up

Getting keys for dialogflow agent:

1. Setup an Agent in Dialogflow

default_intent

Google describes Agents as NLU (Natural Language Understanding) modules. They transform natural user requests into structured, actionable data.

  1. In the Dialogflow Console, create an agent
  2. Choose or create a Google Cloud Platform (GCP) Project.
  3. Dialogflow will automatically setup a Default Welcome Intent, which you can try from the test console.

2. Create a service account and download the service key json file

In order for your Bot to access your Dialogflow Agent, you will need to create a service account. A Service account is an identity that allows your bot to access the Dialogflow services on your behalf. Once configured, you can download the private key for your service account as a JSON file.

  1. Open the GCP Cloud Console, and select the project which contains your agent.
  2. From the nav menu, choose IAM & admin, Service accounts.
  3. Select Dialogflow Integrations (created by default by Dialogflow), or create your own.
  4. Under actions, select create key, select JSON and download the file.

3. Save it in the keys/ folder

You'll have to create a 'keys/' folder in the workspace if it doesn't exist.

4. ProjectId changes

Change the projectID variable in backend.py to your dialogflow projectID. (it can be found in your agent settings)

Serving the CNN model

  1. npm install http-server
  2. To serve the folder: node_modules/http-server/bin/http-server model_types/tfjs_version -c1 --cors
  3. It's now available on https://localhost:8080 (:8080 is the default port)

Hosting the REST Api locally

The REST Api (in flask) talks to dialogflow. It is called by the frontend.

  1. To host it: python backend.py
  2. It's now available on https://localhost:5000 (:5000 is the default port)

Making changes to index.js

Changes to index.js will not be reflected in the website. This is because browserify is used to bundle requirements.

  1. npm install -g browserify to install browserify globally
  2. browserify index.js > bundle.js to bundle up the requirements into a single file. bundle.js s the file we'll be linking to our index.html

The flow, diagramatically

This is how everything ties up:

flow

This should be your directory structure(well this is mine):

directory structure

References and Notes:

  1. this awesome tutorial on how to access webcam, load a model, and train the model on the browser itself with tf.js .

  2. host the tfjs_version directory locally by navigating inside it and typing python3 -m http.server(doesnt allow CORS)

npm init and npm install http-server and then node_modules/http-server/bin/http-server model_types/tfjs_version -c1 --cors This allows you to serve a directory with CORS(cross-origin requests) enabled. Can use the same https-server globally, anywhere if installed with -g

  1. This video and the video's article were also helpful in making model in python and loading for tf.js, by sentdex.

  2. Importing Keras models to tf.js.

  3. This article on how to do some image preprocesing with javascript using tf.js

  4. Fixing memory leak by referring to the tf.dispose() method, from this website.

  5. Capturing Keypresses on a website from stackOverflow ofcourse.

  6. Testing what keypress is what keycode, tester here. user would use spacebar to confirm action dialogflow integration

  7. use require() in the browser. look at Browserify docs. npm install -g browserify and browserify index.js > bundle.js

we use require() to use jquery-sendkeys, from here.

  1. BotUI starter from their website.
  2. pip install dialogflow and pip install flask-restful for python
  3. python backend.py to obtain an api endpoint which frontend can hopefully query. start the backend flask server with this command before loading index.html

example api call http://127.0.0.1:5000/request/camp deadline

  1. BotUI examples!!!

keyboard commands:

  • shift: captures current prediction
  • spacebar: inserts a space into the sentence being formed
  • Ctrl+E: sends the sentence formed to the chatbot input feild, clears the sentence buffer
  • enter: sends what was in the chatbot input feild to the chatbot

todo/expansion

  • send it directly to the chatbot input and manipulate stuff over there?
  • show "sentence buffer" every time finalStr is cleared