Request for Transparency

Level Designer Contests, discussion about and help with the Level Designer, information about glitches, and levels and level series can be found here.

Moderator: Level Moderator

Final judge discussion?

Yes
10
63%
No
6
38%
 
Total votes : 16

Request for Transparency

Thumbs up x1

Postby -BY » April 30th, 2015, 8:55 am

Another LDC ended and the drama knocks, pleading people to finally let it in.
Going by the chat discussions and very soon, also the topic responses, there really is something off with the current judging system.
I'm myself requesting a change, people actually might be able to agree on.

It's in fact pretty simple. When the judges are done with their work, they should contact each other and spend a bit of time on discussing their judgings.
Doing everything anonymous while planting bombs, waiting for the results to explode are definitely not the way to go. Never was, never are and never will be.
I'm not talking about everyone going for uniform scores. But really extreme differences between the scores shall get evened out. There has been no single moment in the entire LDC history, in which these differences have been justified. You may call it subjective. But it's in fact people blindly hating, for whatever reasons. While we're at it, you might also compare all levels to each other in that final discussion, wondering if you actually find out that specific levels should get a bit of a lower/higher score in general. These serious discussions should also help everyone reflecting on your own thoughts.
Maybe you will find out that you have been biased without proper noticing it in the end.

So yeah. I'm at a point I cannot keep watching this anymore. Let's discuss about this in all seriousness and let's see if we can implement this in the future LDC's.
Image

Big thank you to FrozenFire who created this masterpiece. : )
User avatar
-BY
Honorary Member

 
Posts: 900
Joined: May 23rd, 2012, 11:43 pm
Location: 298

Razzian Fighter

Thumbs Up given: 72 times
Thumbs Up received: 100 times

Re: Request for Transparency

Postby Venexis » April 30th, 2015, 9:10 am

Just said this in IRC, but as a backup, I would love to see this become the norm.

My biggest worry was of not being consistent- I could forgive myself even if nobody else did if I had rated every single level, but that wasn't the case, and I've been known to be a bit more strict than the average in the past. The most stressful part was that one of my scores would fall well outside the "acceptable range" as defined by full judges, and... it kinda happened. I rated MK's level significantly harsher than any other judge, and MoD's/SK's half a point better than anyone else.

I guess when you break it down those gaps aren't that significant compared to levels like CY's, who was affected by nearly 1.5 points, but... it'd definitely be cool to communicate freely with other judges, if not participants (I think there are merits for this too, did I totally miss the point of an abstract level like MKs? Never gonna know for sure unless I talk to the creator), as I at least encountered a significant amount of difficulty trying to not unintentionally ♥♥♥♥ up a score by rating as I typically would as a full judge.

I feel like my scores overall were much higher than normal to attempt to passively compensate for that, by 1-2.5 points depending on the case.
Spoiler: show
Image
10/10, thanks FrozenFire :3

Or add me, at Venexis#9902.
User avatar
Venexis
Prophet of Shadowsquid

 
Posts: 1342
Joined: August 19th, 2009, 7:15 am
Location: Headquarters of EVIL! And definitely NOT my garage... Nope...

Thumbs Up given: 89 times
Thumbs Up received: 211 times

Re: Request for Transparency

Thumbs up x1

Postby *Emelia K. Fletcher » April 30th, 2015, 10:01 am

yeah, a moderation session between judges would be nice for getting a general idea of things

although i still stand by the fact that playing a level is about the experience a player (and THE player) will have, and not the experience a creator or anyone else wanted them to have unless the idea hinges on it (which technically was the case for my level but hell did i communicate that poorly)


i mean look at the ♥♥♥♥ disparity between mine and anyone else's rating


');
');





');





User avatar
*Emelia K. Fletcher
Who's this douchebag?

Error contacting Twitter
Error contacting last.fm
 
Posts: 2926
Joined: July 24th, 2010, 3:40 am
Location: A\//\\/A

Cookie
Venexis: "He had everything out seven hours after I had sent the results, give or take. And most of those hours were in the dead of night, lawl. 11/10 would hire as host of a game show."

Thumbs Up given: 42 times
Thumbs Up received: 211 times

Re: Request for Transparency

Postby MessengerOfDreams » April 30th, 2015, 10:52 am

https://docs.google.com/spreadsheets/d/ ... sp=sharing

Just some data before we go into accusations:

The placements for each judge show that they each have some levels, around a small handful, that they differed from everyone else on, some moreso than others. Also, nearly everyone had one level that they scored way higher or way lower than average, by anywhere between 2 and 6 points.

However, from what I've found, the average placements almost dead-on reflect the average score, if both were ranked by those averages. If we ranked in a WITBLOAT style, where we ranked from 1 to 18, the only differences would be that Doram and BY switch and some mild shifting in the lower ranks.

I don't think there's negative, biased trends there. That seems to be derived entirely from low personal opinion from some spectators to some judges. As it stands, the patterns are consistent but affect little.

Later, I'll collect data on how scores would have changed under the old Supershroom system if everyone's highest and lowest scores had been removed, to see if it truly can bring equity with 6 judgings.
Image
Image

My Most Recent Works: show
I switch my signature a lot. If you wanna see some of my past ones, here you go.
Silent Conversations and a Crow's Final Song!
My latest story, and one of my personal favorites. A girl bound in silence finds the words to say to her prospective girlfriend as they visit her religious father in a dusty town on the edge of Kansas, where the crows' migration south brings forth anchored memories, the path to resolution, and a new start.

Form (25quared)
This might be like nothing you've seen before.
Updated Works! Some of my past best and current stuff: show
Writing Works!
Fanfic: Shut Up and Dance
why do I write so much about dancing you don't dance you've never danced in your entire life
lying little ♥♥♥♥ with your ♥♥♥♥ story ♥♥♥♥ you
also Diddy/Lucina <3

Fanfic: Worth a Thousand Words
Because the world needed a Samus/Dedede story
Fanfic: Ecstatic Silence
Just wanted to write and ♥♥♥♥ like this happens, you'd think I'd know better.
Fanfic: Far From the Edge
It's a dance that's been a long time coming for a brand new man and an unchanging woman, but once one takes the plunge there's no falling back up.
Original: Jealous Ghosts of the Mississippi
The story of Rachel meeting Amber after a lifetime of silence and being shunned
Original: Your Hand in Mine
One of my most personal stories about a dangerous romance. Now to be published in a college lit journal!
-------------------------
Recent LDC Work:
Level Series: Leaves From The Vine (ft Star King)
For the 29th LDC, a theme of grassland taken through the growth of our designing society. Won the 29th LDC!
--
Levels at Large:
Level Topic: Collection of MoD's Levels!
If you've ever wanted to see any level I made worth a damn, go here! From the quiet 14th LDC entrant Finis to the megasmash level series Dark, you can find links here!
"You were always a revolutionary, now there's just less of a chance of you crying in the corner." ~Ridder
User avatar
MessengerOfDreams
Moderator

Error contacting Twitter
 
Posts: 6615
Joined: August 16th, 2009, 11:31 am
Location: When I figure it out, I'll let you know.

Winter
2016 Story Contest

Thumbs Up given: 519 times
Thumbs Up received: 707 times

Re: Request for Transparency

Postby Karyete » April 30th, 2015, 11:35 am

Wait, does that graph tell me that MK gave me the highest score of all his judgings? Fo reals?
Image
Major credit goes to l.m for this!

Museum of Monsters

Most recent SM63 / LL level: Flickering Forest
User avatar
Karyete
Moderator

 
Posts: 1745
Joined: March 6th, 2011, 4:35 am
Location: Yes

LL Level Designer Contest Winner
LL LDC #4: Ruins/Temple

Thumbs Up given: 273 times
Thumbs Up received: 148 times

Re: Request for Transparency

Thumbs up x1

Postby *Emelia K. Fletcher » April 30th, 2015, 11:41 am

Karyete wrote:Wait, does that graph tell me that MK gave me the highest score of all his judgings? Fo reals?

<Chaukai> But for the most part your score trends similar to the average
<Chaukai> Except your karyete score
<EmeliaK> because karyete made a goddamn good level that was fun to play
<EmeliaK> it was probably the most fun i had out of the entire ldc


');
');





');





User avatar
*Emelia K. Fletcher
Who's this douchebag?

Error contacting Twitter
Error contacting last.fm
 
Posts: 2926
Joined: July 24th, 2010, 3:40 am
Location: A\//\\/A

Cookie
Venexis: "He had everything out seven hours after I had sent the results, give or take. And most of those hours were in the dead of night, lawl. 11/10 would hire as host of a game show."

Thumbs Up given: 42 times
Thumbs Up received: 211 times

Re: Request for Transparency

Thumbs up x1

Postby Karyete » April 30th, 2015, 11:49 am

now I feel all warm and fuzzy
Image
Major credit goes to l.m for this!

Museum of Monsters

Most recent SM63 / LL level: Flickering Forest
User avatar
Karyete
Moderator

 
Posts: 1745
Joined: March 6th, 2011, 4:35 am
Location: Yes

LL Level Designer Contest Winner
LL LDC #4: Ruins/Temple

Thumbs Up given: 273 times
Thumbs Up received: 148 times

Re: Request for Transparency

Postby Venexis » April 30th, 2015, 12:38 pm

MessengerOfDreams wrote:I don't think there's negative, biased trends there. That seems to be derived entirely from low personal opinion from some spectators to some judges. As it stands, the patterns are consistent but affect little.


To be perfectly clear, I agree with this. I don't really think this is necessary, but it sure would be incredibly nice to have, especially as a resource for backup judges.

Like, usually my "average" score is in the 10-12 range as opposed to the commonly seen 13-15 range... that could definitely be interpreted as bias, even if it is unintentional, as those harsher ratings would only apply to the few levels additional judges are needed for. A group communication session would go a long way toward detecting and preventing similar issues that otherwise would not have been evident until after the contest's results post.
Spoiler: show
Image
10/10, thanks FrozenFire :3

Or add me, at Venexis#9902.
User avatar
Venexis
Prophet of Shadowsquid

 
Posts: 1342
Joined: August 19th, 2009, 7:15 am
Location: Headquarters of EVIL! And definitely NOT my garage... Nope...

Thumbs Up given: 89 times
Thumbs Up received: 211 times

Re: Request for Transparency

Postby MessengerOfDreams » April 30th, 2015, 12:51 pm

Well, judges having higher standards or higher praise doesn't really affect overall standards if that's just how they are unless, like you, they're alternate judges judging only a small amount of levels, or one is simply worried about overall LDC standings comparable through history

Also, I updated my chart significantly. The first is taking the "let's cut off the worst and best judgings" verdict from way back to show it makes nearly zilch difference. The next is just laying out exactly which order each judge took. Some are weird orders, but overall there are just a few special cases in a uniform order.
Image
Image

My Most Recent Works: show
I switch my signature a lot. If you wanna see some of my past ones, here you go.
Silent Conversations and a Crow's Final Song!
My latest story, and one of my personal favorites. A girl bound in silence finds the words to say to her prospective girlfriend as they visit her religious father in a dusty town on the edge of Kansas, where the crows' migration south brings forth anchored memories, the path to resolution, and a new start.

Form (25quared)
This might be like nothing you've seen before.
Updated Works! Some of my past best and current stuff: show
Writing Works!
Fanfic: Shut Up and Dance
why do I write so much about dancing you don't dance you've never danced in your entire life
lying little ♥♥♥♥ with your ♥♥♥♥ story ♥♥♥♥ you
also Diddy/Lucina <3

Fanfic: Worth a Thousand Words
Because the world needed a Samus/Dedede story
Fanfic: Ecstatic Silence
Just wanted to write and ♥♥♥♥ like this happens, you'd think I'd know better.
Fanfic: Far From the Edge
It's a dance that's been a long time coming for a brand new man and an unchanging woman, but once one takes the plunge there's no falling back up.
Original: Jealous Ghosts of the Mississippi
The story of Rachel meeting Amber after a lifetime of silence and being shunned
Original: Your Hand in Mine
One of my most personal stories about a dangerous romance. Now to be published in a college lit journal!
-------------------------
Recent LDC Work:
Level Series: Leaves From The Vine (ft Star King)
For the 29th LDC, a theme of grassland taken through the growth of our designing society. Won the 29th LDC!
--
Levels at Large:
Level Topic: Collection of MoD's Levels!
If you've ever wanted to see any level I made worth a damn, go here! From the quiet 14th LDC entrant Finis to the megasmash level series Dark, you can find links here!
"You were always a revolutionary, now there's just less of a chance of you crying in the corner." ~Ridder
User avatar
MessengerOfDreams
Moderator

Error contacting Twitter
 
Posts: 6615
Joined: August 16th, 2009, 11:31 am
Location: When I figure it out, I'll let you know.

Winter
2016 Story Contest

Thumbs Up given: 519 times
Thumbs Up received: 707 times

Re: Request for Transparency

Thumbs up x2

Postby Harmless » April 30th, 2015, 8:03 pm

If we're suddenly going for 'let's have all the judges rate a level evenly' then we might as well pick only one judge per LDC from now on.

The variety of judges and their outlooks/expectations are the reason why I took up participating in LDC's. So I can hear multiple viewpoints. Yurimaster may have thought Supershroom's level was the best goddamn thing on the Earth, but EmKay clearly said otherwise that it felt untested and very frustrating.

Though a meeting between the judges and encouraging the judges to talk with each other is a good idea. I did discuss some of this with MoD during the LDC, and having judges combine their thoughts would probably lead to better judgings. It would be ridiculous to ask for all the judgings to be roughly the same though.
Expect something cool here soon!

~ Tesla Bromonovich
User avatar
Harmless
Is it lunch time yet?

 
Posts: 2793
Joined: June 25th, 2011, 11:53 am
Location: Mother Russia!

Runouw Votes Winner
For winning Master of a Hidden Talent in the RV Summer 2017

Thumbs Up given: 271 times
Thumbs Up received: 240 times

Next

Return to Super Mario 63 Level Designer