Multi-Language-Text-Detection

Multi-Language Text Detection

Problem Statement: Enable piracy detection for non-English text by translating the text into English and matching it against piracy-related keywords.

Requirements:

Use Google Translate API or Hugging Face Transformers for translation.

Detect piracy keywords using NLTK or spaCy.

Deliverables:

Python script for text translation and keyword detection.

Documentation detailing multi-language support.

Instructions to Candidate

Documentation: Provide step-by-step documentation, including the tools, libraries, and resources used.

Open-Source Only: Use free libraries and APIs for implementation.

Focus Areas: Highlight Full Stack Development, AI integration, and Media & Entertainment relevance in solutions.

Multi-Language Text Detection System

Introduction

This repository contains the implementation of the Multi-Language Text Detection System. The system is designed to detect piracy-related keywords from non-English text inputs. It uses a combination of translation models and natural language processing techniques to achieve this. Key features of this project include:

Translating non-English text to English using Hugging Face Transformers. Identifying piracy-related keywords using spaCy. This project has applications in industries like media, entertainment, and cybersecurity for combating piracy effectively.

Requirements

Python Version
Python 3.7+
Python Packages
Flask
transformers
torch
langdetect
spacy

Additional Tools (optional)

Hugging Face translation models: To perform language translation.

Install these libraries using the following command:

pip install -r requirements.txt

Project Structure

.

├── app.py # Main Flask application
├── text_process.py # Text processing functions (translation and keyword detection)
├── templates/
│ └── index.html # Webpage template
├── requirements.txt # Python dependencies

How to Use

Install dependencies:

pip install -r requirements.txt

Run the Flask application:

python app.py

Open your browser and navigate to:

http://127.0.0.1:5000/

Enter non-English text, click "Detect", and view the results.

Future Enhancements

Extend keyword detection to include more context-aware piracy identifiers.
Add support for detecting text from uploaded files (e.g., PDFs, images with OCR).
Include additional languages for keyword detection directly.

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
client		client
server		server
.gitignore		.gitignore
README.md		README.md
arabic.pdf		arabic.pdf
documentation.docx		documentation.docx
french.pdf		french.pdf
hindi-pirated.pdf		hindi-pirated.pdf
hindi.pdf		hindi.pdf
russian.pdf		russian.pdf
spanish.pdf		spanish.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Multi-Language-Text-Detection

Multi-Language Text Detection System

Introduction

Requirements

Additional Tools (optional)

Install these libraries using the following command:

Project Structure

How to Use

Install dependencies:

Run the Flask application:

Open your browser and navigate to:

Future Enhancements

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

pth55/Multi-Language-Piracy-Text-Detection

Folders and files

Latest commit

History

Repository files navigation

Multi-Language-Text-Detection

Multi-Language Text Detection System

Introduction

Requirements

Additional Tools (optional)

Install these libraries using the following command:

Project Structure

How to Use

Install dependencies:

Run the Flask application:

Open your browser and navigate to:

Future Enhancements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages