Skip to content
This repository has been archived by the owner on Sep 21, 2023. It is now read-only.

[Suggestion] Emoji recognition #27

Open
kschelonka opened this issue Jul 3, 2019 · 3 comments
Open

[Suggestion] Emoji recognition #27

kschelonka opened this issue Jul 3, 2019 · 3 comments

Comments

@kschelonka
Copy link

kschelonka commented Jul 3, 2019

Having to look up and copy-paste emoji slows down transcription. It might be useful for the OCR bot to output the emoji used in the text at the end of its comment. That way the user could copy-paste from the comment, rather than having to look up all the individual emoji.

A model could be trained to detect emoji separately from the OCR service, and invoked afterwards.

@codingJWilliams
Copy link

codingJWilliams commented Jul 3, 2019 via email

@itsthejoker
Copy link
Member

Either my google skills are failing me horribly or this hasn't been done before. I think this is a fascinating idea, but I'm struggling to come up with ways that it's technologically feasible.

Emojipedia offers an API, but only for the raw data (i.e. you have to already know which emoji you're looking at)... the biggest problem is keeping track of all the different vendor interpretations of various emoji. What I suppose might work is training a standard OCR model on only emoji and using all the vendor implementations of every emoji as the data. It would be... very difficult... to insert them into the appropriate places, but we might be able to make a list at the bottom that just says "these are all the emoji in the above transcription: ❤️ 🎉 🍰 😍 " or something along those lines.

Something else worth keeping in mind is that OCR is very expensive in terms of computation time. When we originally started, we ran our own instance of Tesseract for a few months and suffered a 17 minute queue time. Not something I'd particularly like to do again.

@itsthejoker
Copy link
Member

This might be possible with OpenCV -- I'm not sure that it's worth the effort as it looks like it will require effectively rolling our own OCR solution again, but I'm putting this in here so I don't lose it. https://stackoverflow.com/questions/35486522/ocr-an-ios-android-messaging-app-screenshot#comment74110958_35486522

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants