Okay, here are the cliff notes on what I'd like to see.
Rough descriptionsGameplay: Score out of 12Is the level fun and entertaining to play in general? Does it have much of a replay factor, and does it master the art of platforming, puzzling or scavenger-hunting or alike? Does it have a consistent degree of difficulty, or is it frustrating at times, or is it boring and unoriginal? Are there any bugs or other technical problems? Is there something very special and innovative? Do loading times and eventual lag have an influence on the gameplay?
Atmosphere: Score out of 8How do the overall graphics of the level feel like? Does it have an artistic appeal, is it a coherent and well-balanced work? Are there any obvious flaws like cutoff or messy item placement? Is there a lack of tile variation décor (plants, fences, rocks) where there should be, or is it overworked so it becomes confusing? Furthermore, how do other atmospheric aspects affect the level? Is there a nice music chosen, or does the music choice ruin it? Is there an interesting plot or an amazing storyline that fits with what the level offers?
Guidelines- Judges should play each level until they have figured it out sufficiently. Normally at least two playthroughs are required (at least I always do that).
- Judges should be aware of their playing skill. The question "is this segment too difficult / too unforgiving" is the easiest source of controversy, that's what an ldcjudgings channel is really important for. If a level indeed suffers from bugs or too much difficulty, yet shows effort and interesting stuff, you shouldn't penalize it too much.
- Lag and loading time deductions / bonuses should be kept to a reasonable limit. If a level is small, so it requires heavy tampering to give it a large loading time, it doesn't really deserve a bonus for no loading time. If a level has a large or even insane loading time, check the cause. Have items been used really wastefully, or is it just the theme of the level which requires all kinds of décor? Similar stuff goes for lags - is there blatant spam of enemies or 30-coin-blocks, or is there just no way to avoid the transition being tall or crowded? If you can't come up with a proper idea on how to lift or ease the lag/loading time issues, a large deduction (1 point or more) is not really justified.
- Story and music, whether it's in-game or external, should be usually considered for Atmosphere. You can vary its emphasis, but the recommendation for it is 1.5 points each (the people also have spoken on that). Actually, there can be barely a plot at all (e.g. pure platformer), in this case, you can just omit it entirely if you want to.
- Round your scores to quarters. (Without Other, this is also a lot easier to follow)
Further commentsUnder these circumstances, judges will stay different individuals, and you can still generally judge the way you like, you don't have to change much. Freedom of judging style stays granted while formal inconsistency can, and should be prevented at the same time. There HAS to be a uniform way to go; we always went by the same formal system during 30 LDCs, but sometimes people have undermined its barely defined guidelines to do really stupid stuff when judging, especially in Other, and THAT used to cause controversy and anarchy.
At least, two different scalings in the same LDC is not beneficial at all. There can be room for switching to 10/10 on a music-or-story-based LDC theme upon the host's / the team's discretion, but 12/8 (or 3:2, expressed as proportion) should be the standard thing. If you're having issues with the numbers being not divisible by 5, you can alternatively do it as 15/10 and it will be hit down from 25 to 20 points.
I wouldn't mind actually if scores are increasing a little. With the new system we probably still have to wait a long time (or even infinitely) for the first perfect 20 from a single judge. But more 18s and 19s would be nice. There is no need at all to re-do previous LDCs and decide if Parallel Spires would have won to Destinations with the new system. Past is past.
I'm also starting to fully realize the issues about backup judges. We've had a few LDCs where they were missing and the judges participating had 3 scores while everyone else had four. It would cause a few bias either way, but omitting a backup judge is probably the better choice (unless you would desperately need one since you only have 3 full judges, like in the 5th mini-LDC). Using #ldcjudgings is a MUST to take care of questionable scores and larger discrepancies (5 points or more); even if it delays the results a little, it's worth the effort.
So now, a new general poll is there. Hoping for enforcement.