The Science that powers fair, skills-first hiring

Intelligent automation and science-backed assessments working together to elevate the pre-hire process with speed, fairness, and consistency. Our models are designed and continuously validated by world-class scientists to guarantee decisions that are reliable, transparent, and equitable.

Request a demo

Explore agents

Meet the people behind the Science

Dr. Aiden Loe

Head of Science

As Head of Science, Aiden leads Maki’s scientific vision, ensuring that every AI-driven assessment is grounded in psychometric rigor, ethical AI principles, and empirical validation.

He oversees the research and development of Maki’s AI assessment models, guiding the intersection of computational psychology, machine learning, and human behavior.

Learn more

Bridging psychology, data science, and ethical AI to make hiring both scientific and human

His research has been published in leading journals and proceedings, including Nature Scientific Reports, The Lancet (Public Health, Psychiatry), Psychological Medicine, AAAI, IJCAI, and Philosophy and Theory of Artificial Intelligence. With 50+ peer-reviewed papers and 5,000+ citations, his work bridges the gap between academic excellence and real-world application in HR technology.

Former roles

Lead Psychometrician (Cambridge Psychometrics Centre), Visiting Lecturer (University of Cambridge), Academic Collaborator (University of Oxford)

Education

PhD in Psychology, University of Cambridge, MSc in Psychometrics & Social Psychology

Awards

UK Psychometrics Forum Excellence in Psychometrics Award (2017); John B. Carroll Award (2016)

Team members

Behind every assessment, model, and validation loop stands Maki’s dedicated Science Team: together, they design assessments, ensure psychometric rigor, and monitor fairness and validity across every new product release.

Mayssam Bouraad

Senior Business Psychologist

Learn more

Jinji Shen

People Scientist

Learn more

Mollie Tatlow

Senior Psychometrician

Learn more

The rationale behind Maki’s approach

Recent Research has shown that the most predictive and equitable selection methods are job-specific and behaviorally grounded, such as structured interviews and situational judgement tests
Personality traits alone provide limited predictive validity for overall job performance outcomes
Composite behavioral measures achieve similar validity to standalone ability tests, with much smaller group differences
Real responses to realistic challenges outperform abstract reasoning or trait self-reports

That’s why Maki builds AI-scored, psychometrically robust assessments grounded in real behavior and data, to make hiring decisions that are both predictive and fairer

Skills-first hiring, made immersive

We leverage a combination of human expertise and LLMs to create and evaluate 300+ skills
We assess soft and behavioural skills, cognitive aptitude, coding, job-specific knowledge, and written & spoken language proficiency
Our assessments span multiple formats, including immersive tasks, conversational interactions, situational judgement tests (SJTs), free-text responses (written or spoken), and audio/video questions
We support practical “knock-out” questions such as availability, location, and salary expectations

A 12-step science-backed process

Define skills to assess

Identify the specific skills needed for the role

Conduct literature review

Explore research to confirm the best ways to measure skills

AI-based item generation

Produce initial test questions aligned with skill definitions

Refine and perfect items

Improve questions for clarity, accuracy, and fair measurement

Client review

Validate content with client input to ensure relevance

Pilot testing

Test questions with real candidates to check performance

Standardization

Validate items on a larger sample for reliability

Documentation & manual creation

Compile methods and criteria into a detailed assessment guide

Immersive test creation

Build engaging scenarios that reflect realistic job situations

Full-scale implementation

Release the assessment and monitor early real-world results

Ongoing review and updates

Continuously refine items to maintain accuracy and fairness

Develop clones of validated items

Create reliable variations to reduce exposure and cheating

Scoring like human experts: audited, validated, and at global scale

Speech accuracy

Benchmarked using WER, CER, BLEU and multilingual metrics for human-level transcription

Language proficiency

Written & spoken proficiency in 20+ languages, aligned to IELTS/CEFR and calibrated with psychometric models

Pronunciation scoring

Fluency, rhythm, completeness, and clarity evaluated via advanced neural models

We are relentless about raising the quality of our assessments

Our commitment to science does not stop at deployment. Maki’s in-house Science team continuously collects and analyzes performance data to ensure assessments remain accurate, fair, and predictive over time.

We run ongoing psychometric analyses to strengthen reliability, validity, and relevance, including checks on internal consistency, content coverage, fairness metrics, and predictive performance.

This continuous work helps us refine each assessment so it reflects real job success, reduces unwanted bias, and supports confident decisions at scale.

Consistent, transparent, and legally defensible

Structured open-ended answers are scored with BARS (Behaviorally Anchored Rating Scales) for each competency
Ensures consistent, objective, transparent scoring
Complies with employment law standards (UGESP, EEOC, Equality Act)
AI-scored open-ended items are benchmarked against human raters with ICC, ANOVA, Bland–Altman, and Welch methods

Adding new languages in weeks, not months

Double-blind translations

Double-blind translations by certified agencies

Advanced review

Native speaker review & psychometric consistency checks

Fast process

2–4 weeks to add a new language

Multilingual

Already available in 40+ languages

Designed for inclusive hiring from day one

Job-relevant, objective, culturally neutral assessments
Inclusive design: clear instructions, practice opportunities, multiple languages, support for non-native and neurodiverse candidates
Rigorous fairness checks: DIF (item bias) and Adverse Impact (80% rule) at build-time and ongoing
Easy accommodations: flexible timing, alternative formats, simple language, supportive feedback, no proof required

Combining psychometrics & machine-learning evaluations

Every assessment is validated using industry-leading methods, including IRT modeling, reliability checks, and predictive validity studies
AI scoring is benchmarked against expert human ratings to ensure accuracy, fairness, and real-world alignment
Continuous monitoring uses ML techniques such as drift detection, bias analysis, and performance audits to maintain scoring that is scientifically robust and trustworthy

Always-on governance

We evaluate LLMs with a 10-step loop

Define objectives

Clarify what we need to measure

Generate data

Produce diverse, realistic example responses

Select & calibrate LLMs

Test models and choose the best fit

Prompt development

Create prompts that elicit assessable responses

Scoring logic design

Set clear rules to score responses consistently

Human & fairness evaluation

Compare model outputs with expert ratings

Deployment

Integrate the model and scoring into the platform

Fine-tuning

Improve prompts and scoring based on results

Feedback loop

Gather feedback to refine prompts and scoring

Continuous monitoring

Track performance and update the model over time

Transparency supports better decision-making

Hiring decisions carry real consequences. That’s why we design our systems to make scoring criteria, model behavior, and evaluation steps as clear and reviewable as possible, helping organizations understand how results are produced.

Structured, rubric-based scoring using LLMs

Our AI agents follow clearly defined scoring criteria. Every result is grounded in observable features and rubric-based indicators, allowing stakeholders to understand how scores are derived without relying on black-box reasoning.

Scientific rigor, not shortcuts

Developed by psychometricians and validated by independent reviewers, our models comply with international standards for reliability and fairness. We measure validity, bias, and impact, then continuously monitor for drift.

Compliant by default

Fully aligned with the EU AI Act, GDPR, and NYC Local Law 144, our systems are audited to meet the highest global standards for ethical AI in hiringtechtech. Compliance isn’t a checkbox; it’s a competitive advantage.

Externally certified, enterprise-ready

Maki’s infrastructure is ISO 27001 certified, built on Google Cloud Platform with AES-256 encryption, TLS-secured data transmission, and redundant storage in Belgium, France, and Germanytech.

Privacy-first by architecture

Candidates own their data. Maki enforces data minimization and automated deletion policies to honor the right to be forgotten under GDPRtech.

Built-in fairness monitoring

Every assessment undergoes Differential Item Functioning (DIF) and Adverse Impact checks before launch, and remains under continuous observation to ensure equitable outcomes for all groups.

Ready to see the science on your roles?

Request a demo

Explore agents

45%

faster hiring

50%

lower turnover

250,000+

recruiter hours saved

99%

candidate approval