You're only scraping the surface of OkCupid's goodness. Join and indulge.
Sign Up NowThis is a brief, but technical, explanation of how your match percentages are calculated. It's a little complicated, but our method's quite interesting—even unique. Also, there's a patent pending, because we don't want pay sites to hijack our ideas.
If you have general questions about OkCupid, you should visit our normal FAQ.
We start wanting to calculate a match percentage for you and someone else. And we want to avoid mistakes at all costs!
When you answer a question in our improve matches area we learn: (1) your answer, (2) how you'd like someone else to answer, and (3) how important the question is to you. We collect these three values for all users.
Your match percentage with a given person on OkCupid, let's call him b, is based on the values of (1), (2),and (3) for questions you've both answered. We'll call that set S later in this explanation:
Questions youNow let's look at two example questions and see how we use all this information to make a match.
Back to Top| your answer | how you want someone else to answer | the question's importance to you |
|---|---|---|
| 3 | 2 or 3 | very important |
| b's answer | how b wants someone else to answer | very important |
|---|---|---|
| 2 | 2 | a little important |
| your answer | how you want someone else to answer | the question's importance to you |
|---|---|---|
| 2 | 2 | a little important |
| b's answer | how b wants someone else to answer | the question's importance to b |
|---|---|---|
| 1 | 2 | somewhat important |
First of all, since we use computers to do this, we need to assign numerical values to ideas such as "somewhat important" and "very important". We chose the following scale:
| Level of Importance | Point Value |
|---|---|
| Irrelevant | 0 |
| A little important | 1 |
| Somewhat important | 10 |
| Very important | 50 |
| Mandatory | 250 |
When we look at how each of your answers satisfied the other's preferences, we'll use these values to give our calculations the correct weight. Your match percentage with b is figured by answering the following two questions:
You indicated that b's answer to the first question was very important to you. And that his answer to the second question was not. So we placed 50 importance points on the first question and 1 point on the second question. Of those 51 possible points, b earned 50 by answering the first question how you wanted. So b's answers were 50/51 = 98% satisfactory.
Well, b placed 1 importance point on your answer to the first question and 10 on your answer to the second. Of those 11, you earned 10 points. So your answers were 10/11 = 91% satisfactory.
To get a match percentage for you and b, we just multiply your satisfactions, and then take the square root: sqrt(91% * 98%) = 94%.
This is a mathematical expression of how happy you'd be with each other...if these two questions were the only things that mattered in a relationship!
Back to TopA. Because we like to think of each match percentage as the probability you'd get along. That's the product of them, assuming they're independent. Intuitively, this makes more sense anyway; two people matching each other 95% are a better match than two others who match 90% and 100%.
Back to TopA. Even though two users have satisfied each other on a few common questions, they may not actually be a good match. That is, while the set of questions you've both answered, S, is small, we can't have much confidence in the match percentage yielded by the above calculations.
With any poll, there's a margin of error that needs accounting for, and here's how we do it:
True Match = Calculated Match +/- Reasonable Margin of Error
We've defined the Reasonable Margin of Error as 1/sqrt(size of S). This is the accepted definition for data like ours. It's used in political polling all the time. When you read that "40% of Americans like George W. Bush", with "a margin of error of 5%", that means 400 people were polled: 1/sqrt(400) = 5%. Cool, huh?
In OkCupid, when the size of S = 400, meaning you and someone else have answered 400 of the same questions, and we've calculated your match to be, say, 84% based on your answers, that means your True Match is between 79% and 89%.
To give you the most confidence in the match process, we always publish the lowest possible percentage your match can be. In this example, that would be 79%.
So when we were comparing you and b above, your Calculated Match was 94.5%, but you'd only answered 2 questions. The Margin of Error for a S of that size is 71%! So the published match percentage of you and b would only be 23.5%, which is 94.5% - 71%, as per our True Match formula.
Examine the following:
| Size of S | Margin of Error | Highest Possible Match |
|---|---|---|
| 1 | 1.00 | 0% |
| 2 | 0.71 | 29% |
| 3 | 0.58 | 42% |
| 4 | 0.50 | 50% |
| 5 | 0.45 | 55% |
| 10 | 0.32 | 68% |
| 20 | 0.22 | 78% |
| 50 | 0.14 | 86% |
| 100 | 0.10 | 90% |
| 200 | 0.07 | 93% |
| 500 | 0.04 | 96% |
| 1000 | 0.03 | 97% |
You have to answer 100 questions, for a 90% match to even be possible! A consequence of this is that we're highly confident in our published match scores...we've chosen the lowest statistically valid value. Our users have to tell us a lot about themselves before we can pretend like we know them.
Back to TopA. We have a system for sorting questions by how well they divide the population. Users are exposed typically to the best questions they haven't answered yet.
Back to TopA. We record your answer, but the question's importance is cast as "Irrelevant" when matching people to you. Your answer may obviously still affect the match, of course.
Back to TopA. Using the "Mandatory" and "Very important" votes selectively will heavily focus matches on users who meet your most important criteria. However, purely filtering matches by the "Mandatory" vote would upset many users who use the term more liberally. As a rule of thumb, save that vote for the case where you couldn't POSSIBLY IMAGINE dating someone who answered incorrectly. Still, keep an open mind. Brokeback Mountain.
Back to TopA. It's likely you would get confused and screw up your matches.
Back to TopA. The best way to think about those numbers is to see what they imply about the relative values of questions. For example, 10 "a little important" questions are equal to 1 "somewhat important" question. And 5 "very important" questions are worth 1 "mandatory" question. If we let you edit them, you might put in something ridiculous like (0,1,2,3,4) and that would be bad for your matching. Nonetheless, we might add this as a feature soon... It would be very easy to program, it's just a question of whether or not we trust you.
Back to TopA. That's it!
Back to TopA. Definitely not, Creep. You could increase your average match score by picking answers that you think the average person wants to hear, but your matches won't like you as much.
Look at it this way: Ok matching effectively sorts people by how much you'd like them and vice versa. Lying doesn't introduce you to better people; it screws the order up. By answering honestly, you'll find people who really like you best for who you are. Cheesy, but true.
We always appreciate your feedback. If you have other questions, read the FAQ.
Back to Top