Alibek Zhakubayev

Alibek Zhakubayev

Research Scientist at Meta ยท PhD in Computer Science

About

I am a Research Scientist on the FAISS Core Team at Meta, where I work on FAISS, one of the most widely used open-source libraries for vector similarity search.

I earned my PhD in Computer Science from Baylor University in 2025, advised by Dr. Greg Hamerly. My dissertation focused on accelerating the k-means clustering algorithm through probabilistic filtering, geometric heuristics, and dimensionality reduction. I was also part of Dr. Benton's bioinformatics lab, applying machine learning to study the effects of alcohol on skeletal health.

I hold my undergraduate and master's degrees from Nazarbayev University in Kazakhstan, where my master's thesis with Dr. Adnan Yazici focused on real-time activity recognition.

Outside of work, I love playing chess (1900+ rated in blitz and rapid on Lichess) and am a big football and basketball fan, supporting Liverpool FC, the New York Knicks, and Baylor Bears. I'm also a dedicated Fantasy Premier League manager with 4 top 50k finishes.

Education

2020 – 2025

PhD in Computer Science

Baylor University · Waco, TX

2018 – 2020

Master of Science in Computer Science

Nazarbayev University · Astana, Kazakhstan

2014 – 2018

Bachelor of Science in Computer Science

Nazarbayev University · Astana, Kazakhstan

Experience

Jun 2025 – Present

Research Scientist, Vector Search (FAISS Core Team)

Meta · Menlo Park, CA

Maintainer of FAISS, one of the most widely used open-source libraries for vector similarity search. Implementing and optimizing search algorithms and maintaining core library functionality.

Jan – Dec 2024

Adjunct Lecturer

Baylor University · Waco, TX

Taught Database Design and Applications for Data Science, and designed and taught Data Visualization, emphasizing data storytelling using Python.

Aug 2021 – Jun 2025

Research Assistant

Baylor University · Waco, TX

Doctoral research on accelerating k-means clustering (advised by Dr. Greg Hamerly) and bioinformatics research applying machine learning to study effects of alcohol on bone health.

Aug 2017 – May 2020

Research Assistant

Nazarbayev University · Astana, Kazakhstan

Master's thesis on real-time activity recognition using deep learning and transfer learning, classifying 14 daily activities from 80,000+ multimedia records with 88.6% accuracy.

Updates

Nov 2025

Published in Bone

Study on ethanol consumption effects on cancellous bone architecture in non-human primates published in Bone.

Jun 2025

Joined Meta

Started as a Research Scientist on the FAISS Core Team, working on vector similarity search.

May 2025

PhD Completed

Defended my doctoral dissertation "Accelerating k-means: Novel Strategies for Algorithm Improvement" at Baylor University.

Oct 2024

Two Papers at IEEE DSAA 2024

Presented "Beta k-means" and "Using Annealing to Accelerate Triangle Inequality k-means" at the IEEE International Conference on Data Science and Advanced Analytics.

Jun 2024

Published in Scientific Reports

Study on ethanol consumption effects on bone turnover markers in non-human primates published in Scientific Reports.

Publications

10 scholarly works · 82 citations · h-index: 4 · Google Scholar

Accelerating k-means: Novel Strategies for Algorithm Improvement

Alibek Zhakubayev

Doctoral Dissertation, Baylor University, 2025

Ethanol Consumption Has Minimal Effects on Cancellous Bone Architecture in Femur and Lumbar Vertebra in Two Species of Non-Human Primates

Alibek Zhakubayev, Kathleen A Grant, Lara H Sattgast, Russell T Turner, Urszula T Iwaniec, Mary Lauren Benton

Bone, 2025

Using Annealing to Accelerate Triangle Inequality k-means

Alibek Zhakubayev, Greg Hamerly

The 11th IEEE International Conference on Data Science and Advanced Analytics (DSAA), 2024

Beta k-means: Accelerating k-means Using Probabilistic Cluster Filtering

Alibek Zhakubayev, Greg Hamerly

The 11th IEEE International Conference on Data Science and Advanced Analytics (DSAA), 2024

Ethanol consumption in non-human primates alters plasma markers of bone turnover but not tibia architecture

Alibek Zhakubayev, Lara Sattgast, Anne Lewis, Kathleen Grant, Russell Turner, Urszula Iwaniec & Mary Lauren Benton

Scientific Reports, 2024

Legal Natural Language Processing from 2015 to 2022: A Comprehensive Systematic Mapping Study of Advances and Applications

Ernesto Quevedo, Tomas Cerny, Alejandro Rodriguez, Pablo Rivas, Jorge Yero, Korn Sooksatra, Alibek Zhakubayev, Davide Taibi

IEEE Access, 2023

Image processing approach provides robust feature extraction for classification with small sample sizes

Alibek Zhakubayev, Thomas Andersen, Annie Vesterby, Lene Warner Thorup Boel, Kathleen Grant, Urszula Iwaniec, Russell Turner, Erich Baker, Mary Lauren Benton

The 7th International Conference on Information System and Data Mining (ICISDM), 2023

Clustering Faster and Better with Projected Data

Alibek Zhakubayev, Greg Hamerly

The 6th International Conference on Information System and Data Mining (ICISDM), 2022

Quantum Machine Learning: A Case Study of Grover's Algorithm

Bikram Khanal, Pablo Rivas, Javier Orduz, Alibek Zhakubayev

International Conference on Computational Science and Computational Intelligence (CSCI), 2021

Learning the Relationship between Asthma and Meteorological Events by Using Machine Learning Methods

Alibek Zhakubayev, Adnan Yazici

The 13th International Conference on Application of Information and Communication Technologies (AICT), 2019