Wednesday 16 April 2025
Scientists have long been trying to crack the code of relevance in job search engines, and a recent breakthrough has shed new light on how to make these searches more effective. The challenge lies in creating an algorithm that can accurately identify the most relevant job listings for a user’s query.
The team behind this innovation took a unique approach by freezing their job corpus and executing queries in low-inventory locations to capture a representative mix of high- and low-quality search results. These query-job pairs were then labeled by human annotators using a custom rubric designed to reflect relevance and user satisfaction.
To fine-tune the new retrieval algorithm, the team employed Bayesian optimization, a machine learning technique that allows for efficient exploration of complex parameter spaces. By iteratively evaluating the performance of different boost values and BM25 parameters, they were able to identify the optimal settings that maximized NDCG@5, a widely-used metric for ranking evaluation.
The new algorithm was tested online against the existing search engine, and the results were striking. The team saw a statistically significant increase in both right pane impression set CTR (3.67% to 10.68%) and apply starts per user (0.45% to 13.66%). This means that users who interacted with the new search engine were more likely to engage with job listings and ultimately apply for positions.
The key to this success lies in the algorithm’s ability to balance recall, precision, and relevance. By incorporating a novel approach to lexical scoring and leveraging OpenSearch’s query DSL, the team was able to create a system that not only matched the recall of the existing engine but also improved its overall performance.
One of the most significant implications of this research is its potential to improve the user experience for job seekers. As the search engine continues to evolve, it may be possible to incorporate more advanced techniques such as transformer models and dense vector search technology to further enhance relevance.
The team’s approach to collecting data and evaluating algorithmic performance has also shed new light on how to measure improvements in recall. By labeling query-job pairs using a custom rubric, they were able to capture a nuanced understanding of user satisfaction that traditional metrics may not have been able to capture.
As the search engine continues to evolve, it will be important for researchers and developers to continue exploring innovative approaches to relevance ranking. With this breakthrough, we are one step closer to creating job search engines that truly understand what users are looking for.
Cite this article: “Relevance Rebooted: A Bayesian Optimization Approach to Search Engine Migration”, The Science Archive, 2025.
Job Search, Relevance Ranking, Algorithmic Performance, Recall Precision, Lexical Scoring, Opensearch, Query Dsl, Transformer Models, Dense Vector Search, User Satisfaction







