Morgan Lalique
Member
Z
Last edited:
I’ve been watching rating functioning for over a month, now, and I have some observations and suggestions.
First, the picture size is too small. Sometimes it is a large pic shrunk down, but sometimes it is a small pic, and there is no way to tell. When you look at the babe after you make your rating, you find this small pic that loses resolution when it is expanded to fit the screen. There have been any numbe of pics I would have rated differently if I had known. After all, that is what we are rating, the picture.
Otherwise every pic of a babe would be rated the same.
Second, while the battle of the day is many votes between two babes, the individual ratings are not, and lead to some screwy results. After rating a pair of pics, I look at each to see the wins and losses, and enough of them are affected either by favoritism or possibly self-interest to skew the results. Is it possible to be given your own babes to rate, or is that source of bias weeded out?
Anyway, I don’t think any matchup should be decided on single votes. Once a matchup gets created, it should be handed out to others periodically until at least 20 votes have been received. This will keep great pics from being downgraded and junk photos getting good ratings because of flaky voters.
How does the battle of the day affect picture ratings? Is it a pic vs pic thing, or a babe vs babe thing? If the latter, it would be helpful if several pics for each babe were chosen.
How are the top 100 pics chosen? I see a lot of pics in there that are simply not good enough. Yesterday I saw one of just an ass (which bemuses me because I own it). No pic should make the top 100 if you can’t identify the babe. Also, it should require a minimum number of votes, like 500 or more. There are lots of iconic photos on this site that are far far better than a lot of what is in the top 100. Some of them have had their ratings distorted by the voting problem I mention above.
There should be other requirements: the photos should have a minimum number of ratings against other top 100 photos, to make sure they are ranked correctly.
Also, they should be ranked against other shots of the same babe, to make sure the best photo of the babe is the one that makes the list.
I think these criteria will provide a list that more people would understand and agree with.
If you think any of this may be sour grapes because I’m new here, it isn’t, I have 15-20 of the top 100 spots. Many of my photos in the top 100 I don’t think should be there. I do have a few others that are great photos but aren’t on the list, and my reaction is WTF? Why these and not them? The answer is the automated rating system and how it functions.
All images should be cropped of their frames, or whitespace used to pad a photo so it is square for mobile devices, so it will display at its true size.
But we are voting on the picture as it stands, not some mythical picture as it may be in the future. We should see its size. The pictures as displayed are too small. A large out of focus photo will look much sharper in detail when it is shrunk down, because of the data lost in shrinkage, and can look just as good as a sharp, shrunken photo. We cannot judge accurately as things stand. Trying to provide parity for small images by shrinking big ones down to their size results in a loss of resolution in larger (and very likely better) images. It skews the results against large images, because we cannot see their full quality.
So, hypothetically, if I voted enough to reach these limits for each babe, I couldn't get any more pictures to vote on?
Isn't each matchup a single vote? Are you talking about the BotD? I'm talking about the votes that go into the rating, and each one certainly is a single vote.
So the one pic vs pic matchup that includes multiple votes (and is therefore more reliable) isn't even included in a babe's rating? Naturally, you wouldn't include each individual vote, but the outcome of the matchup would be far more reliable than the individual votes that you do use. I'm saying that each pic vs pic match should have a minimum of 20 votes before it could be used in determining babe rating.
Which really helps make my points. The 9.9 rating on that pic isn't an accurate measure of its quality. Individual votes against other pics are given too much weight in deciding pic vs pic matchups. Many of the pic's "wins" would become losses if a 20 vote minimum were required for the matchup to count as a win or loss.
Then there is the distribution problem. In a one vote match, you would expect some bias, and so would need to use a probability curve to decide where in the range the pic actually falls. A 9.9 pic could be expected to defeat some higher ranking pics (and hence its true rank would be somewhere below the top pics it defeated). Also, many of the pics it lost to are lower ranking, and better pics, why weren't these given more weight? Winning vs lower ranked pics should count for less than losing to lower ranked pics, where if the ranking was correct it would have won?
I propose two challenges to demonstrate this point:
Have a faceoff day, put up battles for the Chriqui pic against her top 20 wins, each to be decided by a minimum of 20 votes.
Also, put up 20 battles vs lower ranked pics I choose by hand, choices to be significantly lower in rank and similarly clothed (ie no skimpy bikini shots vs her pic in a dress). This will demonstrate the inaccuracy of her rating, and the ratings on the other pics I choose.
I expect this will give you hard evidence of how well your system is performing. If her pic gets slaughtered (like losing 15 or more matches in each of the two challenges), then it is time to make some changes to improve the accuracy of your system.
I do hope your goal is to have the best rating system on the web.
It is gone now. I don't remember whose ass it was, even though I bought it. I figured it would get hits. It was scored 60ish on the list yesterday. I found 6 or 7 pics today that don't belong there though. Either there is a coding bug or the pics aren't flagged right when they enter your system.
Then you can only determine the relative ranking of a babe's pics indirectly. It makes it harder to get the girl's top pic on the list. I think that your concern of a girl hurting her own ranking is larger than warranted. With all the pics on your system, how often would such a matchup occur right now, totally at random? And wouldn't such a rating be valid in ranking one photo over the other?
Well, I'm sure my tastes differ from the average adolescent male. Being a photographer, I put a higher emphasis on image quality, resolution, lighting, focus, depth of field, etc. than most people. I also rate on setting, composition, clothing, and style.
Honest isn't the same thing as unbiased. You minimize the effects of individual bias by group confirmation (requiring multiple votes for a pic vs pic matchup). Of course, you still have the problem of group bias, but that's a social issue too big for a web site like this.
Here Are some examples of blurry photos that are probably ranked higher than they deserve because during rating the images are shrunk to a point you can't tell:
https://hotnessrater.com/full-sized-picture/2065184/audrey-allen
https://hotnessrater.com/full-sized-picture/3590382/jessica-ashley
So this picture:
https://hotnessrater.com/full-sized-picture/3956645/rosie-jones
defeated:
https://hotnessrater.com/full-sized-picture/1515407/hannah-ferguson
Only in a one-vote biased matchup, where pics are shrunk so you can't see their true image quality. With 20+ votes on full-sized images it wouldn't happen. These sorts of outcomes are strongly skewing your ratings.
The Rosie Jones pic also defeated:
https://hotnessrater.com/full-sized-picture/3898847/kara-del-toro
https://hotnessrater.com/full-sized-picture/3793702/lais-ribeiro
https://hotnessrater.com/full-sized-picture/593479/nina-agdal
Well, some of those might end up around 50/50. But what about:
https://hotnessrater.com/full-sized-picture/5509572/stacy-keibler
https://hotnessrater.com/full-sized-picture/1444547/whitney-cowart
https://hotnessrater.com/full-sized-picture/3895905/miranda-janine
https://hotnessrater.com/full-sized-picture/2919181/karen-lima
(Crop white blocks from this last pic so it can be displayed fully)
I think in full-sized matchups with multiple voters, the Jones pic loses most or all of those "wins".
There is another question: How good are the pictures it beat that are judged to be in its range? What is that range, btw?
Here are some pics the Jones image won against that I don't think deserve their ratings, and so shouldn't be used to boost the rating of the Jones pic:
https://hotnessrater.com/full-sized-picture/1948964/jessica-alba
(I love Jess, have loads of her pics on this site, and over 5,000 on my hard drive, but this one kinda blows, 9.67? Really?)
https://hotnessrater.com/full-sized-picture/1948964/jessica-alba 9.71???
https://hotnessrater.com/full-sized-picture/4499525/catherine-zeta-jones 9.75, crop of another pic, and shouldn't even be on the site
https://hotnessrater.com/full-sized-picture/3853166/noemi-olah 9.63 again
BTW, are ratings recalculated if a pic the ratings are based on is removed?
https://hotnessrater.com/full-sized-picture/4499525/catherine-zeta-jones 9.75, crop of another pic, and shouldn't even be on the site
RE the cropped picture, my point is your moderators/uploadrs should not have allowed/put the photo on your system to begin with..
I came across a girl yesterday whose hotnessrater material was over 3000 boring catalog shots. Eliminate them.
It would help reduce individual bias distorting your results.
As for showing you a pic with 200+ votes that is rated wrong, look at the Chriqui pic. I'd really like to see the results of the two challenges I proposed, I'm about half done choosing my contenders.
Well, I don't see anything obvious about it. Since people are prevented from voting on photos in their true size, you have no actual data, just your opinion on why people vote how they do. When it comes to voting on different pictures, there is no valid stastical method to differentiate their reasons.
The idea that size, focus, lighting, and resolution make no difference to people and how they vote, especially on photos of similar hotness, goes against common sense, and feedback that pretty much every web site gets from its customers bears out that it matters. The fact that you try to normalize it away by shrinking great photos down to match inferior photos so the inferior photos have a better chance is acknowledgement on your part that what I say is true.
So, I took a SS of the BotD, saved off the images as presented on the BotD screen, and inspected them. I zoomed in my ipad screen to get the largest version of the image that I could, and took SS of them as well. I compared those SS the images available after voting, when you can look at the real images, and the difference is noteworthy. Trying to zoom up the shrunk image results in a crappy image with lots of digital artfacts.
Add links to the page so people can pop up the full pic if they want to see it before voting, that's all. You don't need to revamp the whole screen. As you pointed out, there is only so much room to display them side by side, a very valid point. So is avoiding scrolling around.
Since clicking on the pic does nothing until you have voted, you could use that to give you a full size popup. I rather expected it to work that way the first time I voted on a BotD, I was a bit surprised when it didn't do that in the first place.
Bottom 5 paragraphs of post 3 in this thread, I proposed two challenges, one between the Chriqui pic and her top 20 victories, and one between 20 pics chosen by me, pics to be similarly clothed and lower ratings (like in the 9.0-9.7 area). Hmmm. They should be pics not in her list of existing matchups, too, but that is a lot harder for me to manage, paging through her wins and losses takes a lot of time.
And I do think that the limited number of voters has skewed her rating, the pic is not a 9.9, 9.5 maybe.
General statistical principles are still the soundest way to go, though. More votes per matchup equals greater reliability.
Actually, in college football coach's poll and things like that they do consider who you played and what the score was.
Well, as I said before, I'd institute gradual changes.
To start, I'd eliminate shlock pics that 98% of your raters would prefer not to use their rating time on, and 98% of your viewers don't care if they ever get seen or rated
At the same time, I'd raise the rating standard to best 2 of 3 (so two votes would be enough if they are in agreement)
Once you have a large enough base of 3 vote ratings, then raise the standard to 5
I'd give points to players for doing rating, say, 1 point per 10 ratings.
and a lot of the coding is pretty straightforward
I think you need to advertise your forums more, seems like the two of us are the only ones interested atm. Links on every page would help; like down at the bottom?
Let's say I have 10 pictures and 50 potential votes. The lower the picture number the hotter the picture. So Pic1 is the hottest Pic10 is the least hottest.
We will assume a 10 vote match eliminates all error.
I could structure the votes like this:
Pic1 vs Pic2 - 10 votes = Pic1 wins
Pic3 vs Pic4 - 10 votes = Pic3 wins
Pic5 vs Pic6 - 10 votes = Pic5 wins
Pic7 vs Pic8 - 10 votes = Pic7 wins
Pic9 vs Pic10 - 10 votes = Pic9 wins
or I can give each picture 5 votes against random pictures and get
Pic1 is 5-0
Pic2 is 4-1
Pic3 is 4-1
Pic4 is 3-2
Pic5 is 2-3
Pic6 is 2-3
Pic7 is 2-3
Pic8 is 2-3
Pic9 is 1-4
Pic10 is 0-5
With the top method, you have no idea of the hotness relevance between Pic1, Pic3, Pic5, Pic7 or Pic9. You can really only give 2 number ratings, half are a 1 and half are 10s
The bottom method (which I use) gives you a pretty good idea of relative rating after the 50 votes. It's not perfect but I can already hand out 6 different rating levels opposed to the other method that can only give out 2.
Both methods aren't entirely reliable and will of course benefit from more votes
So lets say we get 50 more votes. Now with your method you add
Pic1 vs Pic3 = Pic1 wins
Pic5 vs Pic7 = Pic5 wins
Pic9 vs Pic2 = Pic2 wins
Pic4 vs Pic6 = Pic4 wins
Pic8 vs Pic10 = Pic8 wins
Now we have
Pic1 with 2 wins
Pic2 with 1 win
Pic3 with 1 win
Pic4 with 0 wins
Pic5 with 2 wins
Pic6 with 0 wins
Pic7 with 1 wins
Pic8 with 1 win
Pic9 with 1 win
Pic10 with 0 wins
So which one is hotter Pic5 or Pic6. How about Pic4 or Pic5? Pic6 or Pic10?
You really still don't have any idea how to rank these after 100 votes. You're not really even sure if Pic1 is better than Pic5.
With my method, there might be some bad votes but you even with that calculated in, you can get a better idea of relative ratings and my way gets more accurate faster as votes are added... especially vs large sets of pictures.
Well, instead of the above, use 5 vote matches in this hypothetical example. It isn't going to be perfect, but it will be stronger than single vote matches, the study of probability and statistics makes this clear.