A multi-component approach to personality-based compatibility assessment
MaybeCoffee. November 18th, 2025
We believe in algorithmic transparency and open-source principles. Unlike many matching platforms that treat their algorithms as proprietary black boxes, we are committed to full transparency. We've invested significant effort in developing a sophisticated compatibility matching algorithm because we genuinely care about creating meaningful connections—not just maximizing engagement metrics or user retention. This document provides a complete mathematical specification of our approach, allowing users to understand exactly how their matches are determined.
Our compatibility matching algorithm consists of three distinct components: (1) Vector Similarity, which measures semantic alignment in embedding space; (2) Neural Network Compatibility, which captures non-linear relationships between personality traits and values; and (3) Social Shipping, which incorporates community-driven signals into the compatibility score.
Our algorithm employs a multi-faceted approach that integrates natural language processing, neural network technology, and established psychological research frameworks to identify compatible personality matches. Unlike traditional matching systems that rely exclusively on multiple-choice responses, our methodology analyzes nuanced language patterns in open-ended questions, processes structured data through validated psychological frameworks, and leverages machine learning models trained on extensive relationship research datasets.
The algorithm is designed to identify users with similar personality traits and values, based on research demonstrating that similarity in core dimensions predicts relationship success and satisfaction.
Each user's responses are encoded into a multi-dimensional feature vector that captures semantic, psychological, and behavioral characteristics. Let and represent the feature vectors for users A and B respectively, where is the dimensionality of the feature space. These vectors are constructed through a multi-stage encoding process:
The complete feature vector is the concatenation: .
The compatibility score between two users is computed as a weighted linear combination of two normalized components, with an additional social shipping component:
Where:
The base compatibility score (from vector similarity and neural network) is scaled to a maximum of 0.9 (90%), ensuring that without any social shipping, the maximum achievable compatibility is 90%. The shipping component can add up to 0.1 (10%) to reach a maximum of 1.0 (100%) compatibility.
The vector similarity component measures semantic alignment in the embedding space. We compute cosine similarity between the semantic subvectors and , which encode the semantic content of open-ended responses:
where is the angle between the semantic vectors. This metric quantifies similarity in expression patterns, values, and perspectives. The semantic embeddings are generated using transformer-based language models that capture contextual meaning, emotional tone, and conceptual relationships.
Cosine similarity is bounded: , but in practice, after normalization and preprocessing, we observe , where 0 indicates orthogonal semantic spaces (completely different) and 1 indicates identical semantic direction (highly similar).
The neural network component captures non-linear relationships and complex interactions between personality traits, values, and behavioral patterns. Our model is a deep neural network trained on a comprehensive dataset combining:
Where:
The network architecture processes structured data (multiple-choice responses, numerical ratings) alongside personality trait vectors to identify patterns that predict compatibility beyond simple similarity metrics. The inclusion of pairwise differences allows the model to learn which trait divergences are compatible versus incompatible.
The model employs attention mechanisms to weight different feature dimensions and includes prompt injection protection to ensure robust predictions resistant to manipulation attempts.
The component weights are calibrated through cross-validation on historical relationship outcome data. The current configuration:
This allocation assigns equal weight (50% each) to semantic similarity and neural network predictions, emphasizing both linguistic expression patterns and deep personality compatibility. The balanced weighting ensures that both surface-level semantic alignment and complex trait interactions contribute equally to the final compatibility score.
The weighting strategy is informed by research demonstrating that similarity in core personality dimensions and values predicts relationship success and satisfaction. The algorithm is explicitly designed to identify compatible matches—users with similar traits and values—rather than complementary opposites.
Our questionnaire format includes three types of questions, each processed differently:
Processed through advanced language models to extract semantic meaning, keywords, and emotional tone. Responses are converted into high-dimensional text embeddings that capture nuanced expression patterns.
Used for MBTI calculation and structured personality assessment. These responses feed directly into our neural network alongside vector embeddings.
Processed mathematically to calculate quantitative alignment. These scores are normalized and integrated into the neural network input features.
We maintain algorithmic transparency while respecting user privacy. Our matching system displays users' top 3 compatibility matches with abstracted or summarized response information. Only initials are displayed in the dashboard, ensuring privacy while providing meaningful insights into compatibility.
The algorithm produces compatibility scores ranging from 0 to 1, where higher scores indicate greater compatibility. Users receive their top 3 matches, ranked by compatibility score, with detailed insights into why each match was selected. We facilitate connections by automatically scheduling dates at partner restaurants and venues, making the transition from match to meeting seamless.
This approach represents a significant advancement over traditional matching systems, combining the depth of psychological research with the power of modern machine learning to create meaningful, lasting connections.
7. Social Shipping Component
The social shipping component allows users to express support for potential matches between other users. This community-driven mechanism incorporates social signals into the compatibility calculation:
Mechanism:
Constraints:
This component acknowledges that social validation and community perception can be meaningful indicators of compatibility, while maintaining safeguards to prevent manipulation. The constraints ensure that shipping remains a genuine social signal rather than a mechanism for artificially inflating scores.