My Homescreen



Sign Up!

OkCupid is Free Online Dating.

FAAAQ! - Frequently Asked for Answers About Questions

This is a brief, but technical, explanation of how your match percentages are calculated. It's a little complicated, but our method's quite interesting—even unique. Also, there's a patent pending, because we don't want pay sites to hijack our ideas.

If you have general questions about OkCupid, you should visit our normal FAQ.

We start wanting to calculate a match percentage for you and someone else. And we want to avoid mistakes at all costs!

When you answer a question in our improve matches area we learn: (1) your answer, (2) how you'd like someone else to answer, and (3) how important the question is to you. We collect these three values for all users.

Your match percentage with a given person on OkCupid, let's call him b, is based on the values of (1), (2),and (3) for questions you've both answered. We'll call that set S later in this explanation:

venn diagram Questions you
answered
S Questions b
answered

Now let's look at two example questions and see how we use all this information to make a match.

Back to Top

Question 1: How messy are you?

  1. very messy
  2. average
  3. very organized
your answer how you want someone else to answer the question's importance to you
3 2 or 3 very important
b's answer how b wants someone else to answer very important
2 2 a little important

Question 2: Have you ever cheated in a relationship?

  1. yes
  2. no
your answer how you want someone else to answer the question's importance to you
2 2 a little important
b's answer how b wants someone else to answer the question's importance to b
1 2 somewhat important
Back to Top

Calculating The Match

First of all, since we use computers to do this, we need to assign numerical values to ideas such as "somewhat important" and "very important". We chose the following scale:

Level of Importance Point Value
Irrelevant 0
A little important 1
Somewhat important 10
Very important 50
Mandatory 250

When we look at how each of your answers satisfied the other's preferences, we'll use these values to give our calculations the correct weight. Your match percentage with b is figured by answering the following two questions:

  1. How much did b's answer make you happy?

    You indicated that b's answer to the first question was very important to you. And that his answer to the second question was not. So we placed 50 importance points on the first question and 1 point on the second question. Of those 51 possible points, b earned 50 by answering the first question how you wanted. So b's answers were 50/51 = 98% satisfactory.

  2. How much did your answers make b happy?

    Well, b placed 1 importance point on your answer to the first question and 10 on your answer to the second. Of those 11, you earned 10 points. So your answers were 10/11 = 91% satisfactory.

To get a match percentage for you and b, we just multiply your satisfactions, and then take the square root: sqrt(91% * 98%) = 94%.

This is a mathematical expression of how happy you'd be with each other...if these two questions were the only things that mattered in a relationship!

Back to Top

Q. Why do you multiply (as opposed to say, average) the two match scores together, to get a final score?

A. Because we like to think of each match percentage as the probability you'd get along. That's the product of them, assuming they're independent. Intuitively, this makes more sense anyway; two people matching each other 95% are a better match than two others who match 90% and 100%.

Back to Top

Q. What if a user and I have only answered one question in common, and we happen to satisfy each other's requirements? Does that mean we're suddenly a 100% match?

A. Even though two users have satisfied each other on a few common questions, they may not actually be a good match. That is, while the set of questions you've both answered, S, is small, we can't have much confidence in the match percentage yielded by the above calculations.

With any poll, there's a margin of error that needs accounting for, and here's how we do it:

True Match = Calculated Match +/- Reasonable Margin of Error

We've defined the Reasonable Margin of Error as 1/sqrt(size of S). This is the accepted definition for data like ours. It's used in political polling all the time. When you read that "40% of Americans like George W. Bush", with "a margin of error of 5%", that means 400 people were polled: 1/sqrt(400) = 5%. Cool, huh?

In OkCupid, when the size of S = 400, meaning you and someone else have answered 400 of the same questions, and we've calculated your match to be, say, 84% based on your answers, that means your True Match is between 79% and 89%.

To give you the most confidence in the match process, we always publish the lowest possible percentage your match can be. In this example, that would be 79%.

So when we were comparing you and b above, your Calculated Match was 94.5%, but you'd only answered 2 questions. The Margin of Error for a S of that size is 71%! So the published match percentage of you and b would only be 23.5%, which is 94.5% - 71%, as per our True Match formula.

Examine the following:

Size of S Margin of Error Highest Possible Match
1 1.00 0%
2 0.71 29%
3 0.58 42%
4 0.50 50%
5 0.45 55%
10 0.32 68%
20 0.22 78%
50 0.14 86%
100 0.10 90%
200 0.07 93%
500 0.04 96%
1000 0.03 97%

You have to answer 100 questions, for a 90% match to even be possible! A consequence of this is that we're highly confident in our published match scores...we've chosen the lowest statistically valid value. Our users have to tell us a lot about themselves before we can pretend like we know them.

Back to Top

Q. How are questions chosen?

A. We have a system for sorting questions by how well they divide the population. Users are exposed typically to the best questions they haven't answered yet.

Back to Top

Q. What if I check all the "acceptable answers" boxes, or none of them?

A. We record your answer, but the question's importance is cast as "Irrelevant" when matching people to you. Your answer may obviously still affect the match, of course.

Back to Top

Q. Shouldn't "Mandatory" be some kind of filter?

A. Using the "Mandatory" and "Very important" votes selectively will heavily focus matches on users who meet your most important criteria. However, purely filtering matches by the "Mandatory" vote would upset many users who use the term more liberally. As a rule of thumb, save that vote for the case where you couldn't POSSIBLY IMAGINE dating someone who answered incorrectly. Still, keep an open mind. Brokeback Mountain.

Back to Top

Q. Why can't I place different importance values on each acceptable answer?

A. It's likely you would get confused and screw up your matches.

Back to Top

Q. The importance values you mentioned above (0,1,10,50,250) seem wrong. I know what is important to me and I want to assign my own values. Ok?

A. The best way to think about those numbers is to see what they imply about the relative values of questions. For example, 10 "a little important" questions are equal to 1 "somewhat important" question. And 5 "very important" questions are worth 1 "mandatory" question. If we let you edit them, you might put in something ridiculous like (0,1,2,3,4) and that would be bad for your matching. Nonetheless, we might add this as a feature soon... It would be very easy to program, it's just a question of whether or not we trust you.

Back to Top

Q. What besides user questions affects my match percentages?

A. That's it!

Back to Top

Q. Is there incentive for me to lie?

A. Definitely not, Creep. You could increase your average match score by picking answers that you think the average person wants to hear, but your matches won't like you as much.

Look at it this way: Ok matching effectively sorts people by how much you'd like them and vice versa. Lying doesn't introduce you to better people; it screws the order up. By answering honestly, you'll find people who really like you best for who you are. Cheesy, but true.

We always appreciate your feedback. If you have other questions, read the FAQ.

Back to Top