Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Here's "Einstein’s puzzle" from this paper: https://www.researchgate.net/publication/341189675_Is_Einste...

  H01 There are five houses.
  H02 The Englishman lives in the red house.
  H03 The Spaniard owns the dog.
  H04 Coffee is drunk in the green house.
  H05 The Ukrainian drinks tea.
  H06 The green house is immediately to the right of the ivory house.
  H07 The Old Gold smoker owns snails.
  H08 Kools are smoked in the yellow house.
  H09 Milk is drunk in the middle house.
  H10 The Norwegian lives in the first house.
  H11 The man who smokes Chesterfields lives in the house next to the man with the fox.
  H12 Kools are smoked in a house next to the house where the horse is kept.
  H13 The Lucky Strike smoker drinks orange juice.
  H14 The Japanese smokes Parliaments.
  H15 The Norwegian lives next to the blue house.
  Now,
  Q1 Who drinks water?
  Q2 Who owns the zebra?


I ran a few replacements because the puzzle is probably in the training data somewhere:

  Englishman -> Scotsman
  Spaniard -> Welshman
  dog -> parrot
  Coffee -> Kombucha
  green -> scarlet
  tea -> butterscotch
  Ukrainian -> Romanian
  ivory -> pink
  snails -> scorpions
  yellow -> turquoise
  Milk -> Red Bull
  Norwegian -> Brazilian
  fox -> bear
  horse -> mule
  orange juice -> rum
  Japanese -> German
  blue -> brown
  red -> purple
Here's my new prompt after replacing those words:

  H01 There are five huts.
  H02 The Scotsman lives in the purple hut.
  H03 The Welshman owns the parrot.
  H04 Kombucha is drunk in the scarlet hut.
  H05 The Romanian drinks butterscotch.
  H06 The scarlet hut is immediately to the right of the pink hut.
  H07 The Old Gold smoker owns scorpions.
  H08 Kools are smoked in the turquoise hut.
  H09 Red Bull is drunk in the middle hut.
  H10 The Brazilian lives in the first hut.
  H11 The man who smokes Chesterfields lives in the hut next to the man with the bear.
  H12 Kools are smoked in a hut next to the hut where the mule is kept.
  H13 The Lucky Strike smoker drinks rum.
  H14 The German smokes Parliaments.
  H15 The Brazilian lives next to the brown hut.
  Now,
  Q1 Who drinks water?
  Q2 Who owns the zebra?

Then I ran the new prompt through o3-mini-high and it gave this answer (after a thought process that looks legit to me, like it actually did reason through the puzzle and not just spit out an answer it had seen):

  Q1: Who drinks water?
  A: The Brazilian (in Hut 1) drinks water.

  Q2: Who owns the zebra?
  A: The German (in Hut 5) owns the zebra.
Undoing my replacements, that's the Norwegian drinking water and the Japanese person owning a zebra. Which I think is the solution from the original paper.

Here's my o3-mini-high transcript: https://chatgpt.com/share/679efae7-8544-8006-960b-a744cc1fea...


I just did a pass with some replacements with o1 and it very much still recognized it as the Einstein riddle and actually seems to have cheated a bit :)

"Revisiting assumptions

Considering "Camels" might be a mistake for "Kools," leading to confusion. This inconsistency complicates solving the puzzle, showing the need for careful brand assignment."

Tracking puzzle progress

I’m mapping out various house and nationality combinations, but the classic conclusion is the Norwegian drinks water and the Japanese owns the zebra.

Analyzing the arrangement

I’m working through the classic puzzle structure and noting variations, while consistently identifying the Norwegian drinking water and the Japanese owning the zebra as the final solution."


Hah, that's fun. My o3-mini-high transcript didn't hint that it recognized the puzzle and looked legit when I scanned through them, but I'm still very suspicious since this is evidently such a classic puzzle.

I should have changed the cigarette brands to something else too.


If you want to make a cosmetic change to the puzzle, you might try eliminating the massive quantity of implicit information in "the green house is immediately to the right of the ivory house".

After doing some substitutions on what it means to be in positions 1/2/3/4/5:

A. If the ivory house is in London, the green house is in Madrid.

B. If the ivory house is in Madrid, the green house is in Kiev.

C. If the ivory house is in Kiev, the green house is in Oslo.

D. If the ivory house is in Oslo, the green house is in Tokyo.

E. The ivory house is not in Tokyo.

9. Milk is drunk in Kiev.

11(A). If the man who smokes Chesterfields lives in Tokyo, the man with the fox lives in Oslo.

12(A). If the man with the horse lives in Oslo, Kools are smoked in either Tokyo or Kiev.

15(A). If the blue house is in Madrid, the Norwegian lives either in London or in Kiev.

[...]

Another easy change is to exchange categories. Swap the animals for the drinks and instead of "the Spaniard owns the dog" and "the Ukrainian drinks tea", you'll have "the Spaniard drinks tea" and "the Ukrainian owns the fox" (depending on which equivalences you decide on). It won't make any difference to the puzzle, but it will permute the answer.


Try flipping the order, adding a few nonsense steps and combining 2 steps into one and also splitting a single step into two. And then see what happens and post it here. :-)


I tried it against deepseek-r1-distill-llama-70b running on Groq (which is really fast) and it didn't get the right answer: https://gist.github.com/simonw/487c4c074cd6ad163dba061e1e594...

I ran it like this:

  llm -m groq/deepseek-r1-distill-llama-70b '
    H01 There are five huts.
    H02 The Scotsman lives in the purple hut.
    H03 The Welshman owns the parrot.
    H04 Kombucha is drunk in the scarlet hut.
    H05 The Romanian drinks butterscotch.
    H06 The scarlet hut is immediately to the right of the pink hut.
    H07 The Old Gold smoker owns scorpions.
    H08 Kools are smoked in the turquoise hut.
    H09 Red Bull is drunk in the middle hut.
    H10 The Brazilian lives in the first hut.
    H11 The man who smokes Chesterfields lives in the hut next to the man with the bear.
    H12 Kools are smoked in a hut next to the hut where the mule is kept.
    H13 The Lucky Strike smoker drinks rum.
    H14 The German smokes Parliaments.
    H15 The Brazilian lives next to the brown hut.
    Now,
    Q1 Who drinks water?
    Q2 Who owns the zebra?'
Using this plugin: https://github.com/angerman/llm-groq


Full DeepSeek R1 - accessed through the DeepSeek API (their "deepseek-reasoner" model) - got the right answer: https://gist.github.com/simonw/f77be3bbc720e1314235d42593562...


Whatever model is behind chat.deepseek.com got it in 348 seconds.

It amazes me they don't time that thing out after, IDK, 5 minutes of computation time.


> the puzzle is probably in the training data somewhere

Given that these models can perform translation I'm not sure why you think renaming things is sufficient to put your version out of distribution.


I don't - but like I said, I reviewed the thought process in the transcript and it looked legit to me.

I'm not sure what else I could do here to be honest, without coming up with a completely new puzzle that captures the same kind of challenge as the original. I'm not nearly patient enough to do that!


The thought process is basically like asking it to think in French. Same training data LOL


ClaudeAI responded: >>> After working through all constraints: Q1: Who drinks water? The German drinks water. Q2: Who owns the zebra? The Scotsman owns the zebra. <<<


Interesting o3-mini produces a correct response: https://chatgpt.com/share/679efd3f-8e88-800d-9aa8-e3fadccf7c...


I am reminded of this Reddit post from November last year, except this is fifth grade math homework:

https://www.reddit.com/r/mildlyinfuriating/comments/1gippqm/...

The puzzle is a grid of simultaneous equations with 9 unknowns:

  9 + ? - 6 = ? - 3
  -   =   -   =   x
  4 + ? x 5 - ? = 6
  /   +   +   -   -
  8 - 5 = ? - ? x 2
  x   -   -   -   x
  ? + 1 + 7 - 4 = 8
  =   -   =   +   =
  7 = ? x 3 - ? - 2


Deepthink thought for 644 seconds and says the answer is 4.

Probably didn't help that I just pasted it with no explanation, the spaces between the columns really threw it off.


Here’s the same puzzle minus the grid parsing step:

  A1: 9+a-6=b-3
  A2: 4+(c*5)-d=6
  A3: 8-5=e-(f*2)
  A4: g+1+7-4=8
  A5: 7=(h*3)-i-2
  D1: 9-((4/8)*g)=7
  D2: a=c+5-1-h
  D3: 6-5+e-7=3
  D4: b=d-f-4+i
  D5: (3*6)-(2*8)=2


(Not fair because it is in the training set, but o3-mini-high gives a thorough answer after 66s)

Below is a step‐by‐step reasoning that leads to the well‐known solution of the “Zebra Puzzle.”

Step 1. Number the Houses 1 to 5

We assume the houses are arranged in a row from left to right, with House 1 on the far left and House 5 on the far right.

Step 2. Assign the Easy Clues 1. H10: “The Norwegian lives in the first house.” → House 1: Nationality = Norwegian. 2. H15: “The Norwegian lives next to the blue house.” House 1 has only one neighbor (House 2), so → House 2: Color = Blue. 3. H09: “Milk is drunk in the middle house.” → House 3: Beverage = Milk.

Step 3. Determine the House Colors with the Ivory/Green Pair • H06: “The green house is immediately to the right of the ivory house.” There are two possible placements for this pair: either Houses (3,4) or Houses (4,5). The option (3,4) would force House 3 to be ivory, but later clues will assign House 3 a color (red, see H02) for the Englishman. So we take the pair as Houses 4 and 5: • House 4: Color = Ivory. • House 5: Color = Green. • H04: “Coffee is drunk in the green house.” → House 5: Beverage = Coffee. • The remaining colors for Houses 1–3 are now Red, Yellow, and (already used) Blue. H02: “The Englishman lives in the red house.” Since House 1 is taken by the Norwegian and House 2 is blue, the only possibility is: → House 3: Color = Red, Nationality = Englishman. • The only color left for House 1 is Yellow. H08: “Kools are smoked in the yellow house.” → House 1: Color = Yellow, Cigarette = Kools.

Step 4. Assign the Remaining Nationalities and Beverages

The nationalities so far: • House 1: Norwegian • House 3: Englishman

Remaining nationalities: Ukrainian, Spaniard, Japanese. • H05: “The Ukrainian drinks tea.” House 2 already is blue and unassigned for nationality and beverage, so: → House 2: Nationality = Ukrainian, Beverage = Tea.

Now for Houses 4 and 5, the remaining nationalities are Spaniard and Japanese. • H03: “The Spaniard owns the dog.” And • H14: “The Japanese smokes Parliaments.”

We still have the beverages and cigarettes to assign. Notice that House 5 (green) already has its beverage (coffee). It is natural then to assign: • House 5: Nationality = Japanese, Cigarette = Parliaments. This leaves: • House 4: Nationality = Spaniard, and by H03, → House 4: Pet = Dog.

Step 5. Use the Clues about Cigarettes and Pets

The remaining cigarette brands to assign (across Houses 2, 3, and 4) are: Old Gold, Chesterfields, and Lucky Strike. • H13: “The Lucky Strike smoker drinks orange juice.” House 5 already drinks coffee; House 3 has milk; House 2 has tea; so the only house with an unassigned beverage is House 4. → House 4: Beverage = Orange juice, Cigarette = Lucky Strike. • Now, the remaining cigarettes for Houses 2 and 3 are: Old Gold and Chesterfields. • H07: “The Old Gold smoker owns snails.” • H11: “The man who smokes Chesterfields lives in the house next to the man with the fox.” • H12: “Kools are smoked in a house next to the house where the horse is kept.” House 1 (which smokes Kools) has only one neighbor, House 2, so: → House 2: Pet = Horse.

Now, consider Houses 2 and 3 for the remaining cigarette assignments: • If House 2 were the Old Gold smoker, then by H07 it would have to own snails—but it already has the horse. So, House 2 must smoke Chesterfields, and consequently, → House 2: Cigarette = Chesterfields. • This forces: → House 3: Cigarette = Old Gold, and by H07, → House 3: Pet = Snails.

Finally, H11: “The man who smokes Chesterfields lives in the house next to the man with the fox.” House 2 (smoking Chesterfields) is adjacent to Houses 1 and 3. House 3 already has snails, so → House 1: must have the fox. → House 1: Pet = Fox.

The only pet left is the zebra. The only house without a pet now is House 5, so: → House 5: Pet = Zebra.

Step 6. Assign the Final Beverage

At this point, the beverages are: • House 2: Tea • House 3: Milk • House 5: Coffee • House 4: Orange juice

The only beverage not yet used is water. The only house still without a beverage is House 1, so: → House 1: Beverage = Water.

Final House Assignments • House 1: Color: Yellow Nationality: Norwegian Beverage: Water Cigarette: Kools Pet: Fox • House 2: Color: Blue Nationality: Ukrainian Beverage: Tea Cigarette: Chesterfields Pet: Horse • House 3: Color: Red Nationality: Englishman Beverage: Milk Cigarette: Old Gold Pet: Snails • House 4: Color: Ivory Nationality: Spaniard Beverage: Orange juice Cigarette: Lucky Strike Pet: Dog • House 5: Color: Green Nationality: Japanese Beverage: Coffee Cigarette: Parliaments Pet: Zebra

Cross Check Against the Clues • H02: Englishman in the red house → House 3 is red and English. • H03: Spaniard owns the dog → House 4 (Spaniard) has a dog. • H04: Coffee in the green house → House 5 (green) has coffee. • H05: Ukrainian drinks tea → House 2 (Ukrainian) has tea. • H06: Green house is immediately right of the ivory house → Houses 4 (ivory) and 5 (green). • H07: Old Gold smoker owns snails → House 3 (Old Gold) has snails. • H08: Kools in the yellow house → House 1 (yellow) smokes Kools. • H09: Milk in the middle house → House 3 has milk. • H10: Norwegian in House 1 → correct. • H11: Chesterfields smoker (House 2) lives next to fox owner (House 1). • H12: Kools (House 1) next to horse (House 2). • H13: Lucky Strike smoker (House 4) drinks orange juice. • H14: Japanese smokes Parliaments → House 5. • H15: Norwegian (House 1) lives next to blue house (House 2).

All clues are satisfied.

Final Answers • Q1: Who drinks water? → The Norwegian in House 1 drinks water. • Q2: Who owns the zebra? → The Japanese in House 5 owns the zebra.

Sources and Further Reading • For a classic discussion of this puzzle and its variations, see “Einstein’s Riddle” as presented in various logic puzzle collections and articles such as those found on Brainzilla or in books on lateral thinking puzzles.

This structured reasoning confirms the answers with a high degree of certainty.




Consider applying for YC's Summer 2026 batch! Applications are open till May 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: