Hello! I'm Andrei 👋
Machine Learning and AI leader with primary expertise in Natural Language Processing (NLP) and Information Retrieval (IR) driving business value through leveraging ML, GenAI and agents. Leading a high-impact team delivering semantic search, GenAI, NLP, RAG and document understanding systems across 4+ products used by 900K+ professionals at 7.5K+ companies in aerospace, energy, AEC and other industries.
My articles about Information Retrieval:
- Deep Learning in Information Retrieval. Part I: Introduction and Sparse Retrieval
- Deep Learning in Information Retrieval. Part II: Dense Retrieval
- Deep Learning in Information Retrieval. Part III: Ranking
Open Projects:
- Retrivex -- RETRIeval EXplainability Library. Explainability for retrieval models, vector search, embeddings similarity, siamese encoders, dense retrieval.
- Awesome-feature-engineering -- A structured list of resources dedicated to feature engineering techniques for machine learning.
- BDM Tool -- Simple lightweight dataset versioning utility based purely on the file system and symbolic links.
Scientific paper on IR: Quality and cost trade-offs in passage re-ranking task (TREC 2021)
You can find me on:
- Medium:@aikho
- X:@AI_Kho_
- GitHub:aikho (You are here)

