osint.scylla is an advanced data management solution inspired by platforms like IntelX and Snusbase. Built to handle billions of records efficiently, it offers powerful search capabilities and exceptional performance at scale.
🚀 Performance | 🔄 Processing | 💾 Storage |
• Multi-threading • Async I/O • Parallel processing |
• CSV/TXT parsing • Batch operations • Progress tracking |
• ScyllaDB backend • Memory optimization • Efficient indexing |
# Clone the repository
git clone https://github.com/ZeraTS/osint.scylla
cd osint.scylla
# Set up virtual environment
python -m venv venv
.\venv\Scripts\activate # Windows
source venv/bin/activate # Linux/Mac
# Install dependencies
pip install -r [requirements.txt]
🔧 Setup & Installation
How do I install ScyllaDB?
- Download ScyllaDB from official website
- Follow OS-specific installation instructions
- Verify installation:
scylla --version
- Start service:
sudo systemctl start scylla-server
What are the system requirements?
- Python 3.8 or higher
- ScyllaDB 5.1+
- Minimum 4GB RAM
- SSD storage recommended
- Windows/Linux/MacOS supported
How do I troubleshoot connection issues?
- Verify ScyllaDB is running:
nodetool status
- Check default ports (9042) are open
- Ensure correct host/port in config
- Check firewall settings
📊 Data Management
What file formats are supported?
- CSV files (*.csv)
- Text files (*.txt)
- JSON-formatted text files
- Line-delimited data
How large can my files be?
- Recommended: <1GB per batch
- Maximum: Unlimited (chunked processing)
- Memory usage is optimized
- Large files auto-partitioned
How do I optimize import speed?
- Use SSD storage
- Increase batch size
- Enable parallel processing
- Pre-format your data
🔍 Search Operations
How do I perform searches?
Use format: field:value
Examples:
email:user@domain.com
username:john_doe
phone:1234567890
What fields can I search?
Primary fields:
- username
- first_name
- last_name
- phone_number
- city
- state
Are searches case-sensitive?
- Email: Case-sensitive
- Username: Case-insensitive
- Names: Case-insensitive
- Other fields: Case-insensitive
⚡ Performance
How to handle large datasets?
- Enable chunked processing
- Use batch operations
- Implement proper indexing
- Monitor memory usage
How to improve search speed?
- Create custom indexes
- Use specific field searches
- Optimize query patterns
- Configure consistency levels
Best practices for scaling?
- Use SSD storage
- Configure proper memory allocation
- Enable compression
- Regular maintenance
🛡️ Security & Backup
How secure is the data?
- Transport encryption (TLS)
- Authentication required
- Role-based access
- Audit logging available
How to backup data?
- Use ScyllaDB snapshots
- Configure regular backups
- Export data periodically
- Maintain backup strategy
How to manage permissions?
- Create user roles
- Set access levels
- Configure authentication
- Monitor access logs