Some thoughts on Z340 (dual-alphabet?)

darrenc · 2018-07-29T13:30:04Z

Hi forum! I have been having some thoughts on how the Zodiac may have enciphered Z340 after the quick break of Z408. I think by now everyone agrees it is not a vanilla homophone cipher - but I also don't believe it is going to be overly complicated. He would obviously want the cipher to be stronger but would presumably be limited to pen and paper. I have seen some discussion on the odd/even properties of Z340 but I don't think anyone has (yet) suggested that perhaps Z340 was enciphered with a dual-alphabet system - one for the odds and one for the evens. This would have been relatively straightforward for the Zodiac to implement as he would just have to double-up his efforts from the Z408 - the overall system would remain the same. This could also help explain some of the odd properties of Z340. I believe some of the patterns we can see may actually be intentional red-herrings on the part of Z. I also believe the the "ZODAIK" signature is legitimate ciphertext but misspelled because that was the closest he could get with the symbols he had to chose from. Anyway, I decided to test the dual-alphabet idea by passing Z340 through a remapping process which reassigned every second symbol to a new alphabet, effectively creating a new version of Z340 which contains 112 unique symbols. I could then process this in zkdecrypto as normal. The modified message is at the bottom of this post. Splitting the symbols does flatten the frequency curve a little (14, 10, 8, 7, ...) It can be further flattened at the high end by assuming the set (B, - and mirrored-D) are homophones (I have good reasons for suspecting this but that is a subject for another day). Unfortunately it also creates 22 single-occurrence symbols. Running through zkdecrypto produces the usual english-looking-rubbish (ELR), but with much fewer "breaks" in the flow than what I get with vanilla Z340. This may be simply due to having more symbols available to assign. I decided to test further by creating my own known plaintext cipher of a similar length using the same dual-alphabet approach. I also found this message produced nothing but ELR in zkdecrypto - only one word was correct (YOUR). If I had not known the plaintext I would have had no hope of breaking it. I am not sure where to proceed from here. I am just putting this out here for any interested parties. Here is the modified Z340: H!R"Ð#^$P%I&L'G(Ä )Ð*B,·.º0W1•2»3Æ4 B5„6M*u7G8¢4L9¤:J ;Ð=½?Ì@»$´=O*+AK( ¸QM*¤UÊYI[F+]³%/ =¾A^_Ì.-`ÄaË_>(D, ·[+3Ñbƒ(u6XcVd¤eI ,G(JfÊ9O*¸)yg+hLQ Ä2M*Â*ZA±_B6Ÿi°jK k¤#u$+?J*O=½2FlŸk u*RmµUEnD1B=Â'M3O (<6ÌAJn»[TjMd+]B_ ¤o¼;Ÿ9+)I[FlÃ,ƒ@R #G_N?Æ[±jÂdÃ$³U+* ŸlX&»p³QC!>$u7µk+ nÃd´gB3¢.Ð?•fMqG( R6T*L&°a<*F#WlI4L *+4Wa¤8ÃO;H'/,£= I_ËYW2½UBry.B`-aÃ "M0H)Ð%SgZ.¾iI3ƒ*

smokie treats

(@smokie-treats)

Posts: 1626

Noble Member

Largo: See the 5th post down for another interpretation of the unique symbol strings, and the discussion carries over to the next page.

viewtopic.php?f=81&t=3196&start=70

Posted : August 12, 2018 9:54 pm

Jarlve

(@jarlve)

Posts: 2549

Famed Member

I am considering excluding sequences that terminate at the end of the cipher because their length is uncertain. Good point Largo.

What about normalizing the ones at the end by the fraction of the max possible length?
For example, Z340 has a max possible unique length of 63 due to its alphabet.
Towards the beginning, a unique string of length 10, out of a possible 63, would score: 63/63 = 1. Add 1 to the "10" column in the histogram.
But let’s say towards the end, a unique string of length 10 is found, but a max length of only 40 is possible. So it would score: 40/63 = 0.63. Add 0.63 to the "10" column in the histogram.

Not sure how much it would matter.

Only the sequences that terminate at the last position of the cipher are uncertain and normalization does not fix that.

AZdecrypt

Posted : August 13, 2018 4:41 pm

Largo

(@largo)

Posts: 454

Honorable Member

Largo: See the 5th post down for another interpretation of the unique symbol strings, and the discussion carries over to the next page.

viewtopic.php?f=81&t=3196&start=70

Thank you smokie! I’ll read these posts.

Posted : August 13, 2018 5:49 pm

Largo

(@largo)

Posts: 454

Honorable Member

Largo: See the 5th post down for another interpretation of the unique symbol strings, and the discussion carries over to the next page.
viewtopic.php?f=81&t=3196&start=70

…from that thread:

Conclusion: It does seem that there is some encoding mechanism at work besides just cycling homophones, and whatever is disrupting the cycles does not appear to be random symbol selection. And that slope, the sudden drop after x = 17 for the 340 may be another indicator of the mechanism.

Has anyone ever examined the 26 segments in more detail? For example, whether there are repeated bigrams within a segment, or whether the repeated bigrams extend over several segments. It would also be interesting to compare the unigrams of all 26 segments.
If so, and if I overlooked it: Sorry =)

I wonder how the start and stop positions for the unique strings of length about 17 compare with the period 19 bigram repeat symbol positions. My guess is that, since the + is highest count with 24, it is most frequently terminating unique strings. And because it is heavily represented in the period 19 repeats, I guess that those positions would match up. But I haven’t looked.

Has someone done this yet? It would be interesting to color the segments with Peek-a-boo, then apply P19 and see if there is any noticeable pattern. I’ll put that on my todo list. Let’s see if and when I can do that in the next few days.

3. Some sort of exotic cycling scheme which cuts the cycles short at some point.

Hm…that alone does not explain why z340 cannot be solved. AZDecrypt / TheRaccoon have no problems with completely random cycles. So maybe some kind of " interlocked parts " when creating the final cipher (e.g. 75% of the cipher is in the correct reading order, the other 25% are interspersed according to a pattern. Or a pattern of zeros inside a transposition. That sounds a little vague, but I have concrete ideas on this. I’ll give you an example soon.

PS: I am not sure if I understood everything in the linked posts correctly. There’s a lot for me to catch up.

Translated with http://www.DeepL.com/Translator

Posted : August 15, 2018 12:01 am

smokie treats

(@smokie-treats)

Posts: 1626

Noble Member

Has anyone ever examined the 26 segments in more detail? For example, whether there are repeated bigrams within a segment, or whether the repeated bigrams extend over several segments. It would also be interesting to compare the unigrams of all 26 segments.
If so, and if I overlooked it: Sorry =)

I started thinking about doing that just last weekend, but didn’t get very far. I have to get myself psychologically motivated to do stuff like that now, but it is starting to bother me I was thinking about the 26 segments on my commute to work this morning.

Has someone done this yet? It would be interesting to color the segments with Peek-a-boo, then apply P19 and see if there is any noticeable pattern. I’ll put that on my todo list. Let’s see if and when I can do that in the next few days.

Again, I have not done this. It would be pretty easy to do though.

**********
viewtopic.php?f=81&t=3196&start=70

It doesn’t appear that I transposed the plaintext first before encoding.

O.k., so we are measuring the distance between A and B2, where the segments can look something like this:

A…..B1………B2
A..B1…………B2
A…………B1..B2

Forgive if I cannot remember all before. I know that there is a measurement for the distance between B1 and B2, but I am not sure how it compares. Is there a measurement for the distance between A and B1?

Has anyone looked at the A to B2 segments for English plaintext?

Posted : August 15, 2018 4:48 pm

Largo

(@largo)

Posts: 454

Honorable Member

The longer I look at the cycles, the more doubts arise in me whether they are really relevant.

I’m one of those people who don’t just rely on numbers and statistics. Instead, I try to reconstruct how I would have created z340 instead of Zodiac. I am absolutely aware that this is largely a game of chance. An example I have already mentioned:
Zodiac knew that repeated bigrams were the main target for cracking z408. A logical conclusion is that this point should be avoided in the future. This assumption is the easiest to explain the incomplete cycles. Of course, this assumption can be completely wrong. One fishes in the mud. However, we also do so if we look at all the known statistics. P15/P19, cycles, IOC, Unigram distribution and what else. All these are clues. Sure… the sigma says something else. I am not suggesting that these statistics are a coincidence or irrelevant. But they have not yet led us to a solution. Maybe we’re too focused on it and don’t see the essential things. Once a question is reasonably clarified, three new questions arise from these answers.

Another point to consider (I think smokie or I have described this somewhere before): If z340 is based on transposition, why bother to hide the cycles? In 1969 one could be very sure that a well transposed text in combination with homophonic substitution would be extremely difficult to solve. If at all. Moreover: Clean cycles would rather lead on the wrong track. Here, too, one enters the realm of speculation. It is difficult to put yourself in the position of a disturbed personality like Zodiac and to search for logical conclusions at the same time.

For the moment I will focus my further efforts in a different direction. I try to look as little as possible at statistics and prefer creative transpositions or interlocked encryption instead. I have found some other, quite simple possibilities that cannot be solved by the AZDecrypt transposition solver. Perhaps this will also help in favouring other points of view.

And as always: Maybe tomorrow I will change my mind again =)

https://en.wikipedia.org/wiki/Occam%27s_razor

Translated with http://www.DeepL.com/Translator

Posted : August 15, 2018 9:45 pm

smokie treats

(@smokie-treats)

Posts: 1626

Noble Member

The longer I look at the cycles, the more doubts arise in me whether they are really relevant.

I agree and this is why sometimes I stop looking at them.

Another point to consider (I think smokie or I have described this somewhere before): If z340 is based on transposition, why bother to hide the cycles?

I don’t know who said it but I have thought it. I agree.

Check out the first post here. Maybe he just did some creative pattern with his homophone groups. He may not have been trying to hide anything.

http://www.zodiackillersite.com/viewtop … f=81&t=267

It is annoying though to not know the answer. It would be interesting to know why there are 26 segments of length 17.

Posted : August 15, 2018 10:02 pm

doranchak

(@doranchak)

Posts: 2614

Member Admin

The cycles still baffle me. There’s just enough left in Z340 to seem statistically significant. For example, my perfect L2 cycle score is only met or exceeded in 1 in 5,000 shuffles of Z340 (compared to 1 in 500,000 shuffles for Z408). 6 sigma vs 32 sigma. But if higher-order cycles were ever present, they got really destroyed by whatever extra steps Zodiac did. For example, about 1 in 30 shuffles have L4 scores as good as Z340’s score. The sigma goes up with L *a lot* for Z408, but down with L for Z340.

Z408 cycle shuffle sigmas:
L2: 31.5
L3: 78.0
L4: 160.7

Z340 cycle shuffle sigmas:
L2: 5.8
L3: 3.9
L4: 2.2

Perhaps the transposition is brutal to higher order cycles but leaves a bit of L2 cycling behind, since they are harder to fully disrupt than higher-order cycles.

Or we’re not really looking at a cycling phenomenon. What about a shifting alphabet?

http://zodiackillerciphers.com

Posted : August 15, 2018 10:14 pm

Largo

(@largo)

Posts: 454

Honorable Member

Check out the first post here. Maybe he just did some creative pattern with his homophone groups. He may not have been trying to hide anything.

http://www.zodiackillersite.com/viewtop … f=81&t=267

Let’s say z340 is solved. Let’s also say, there was a transposition and it was substituted with a creative pattern. Since we would then know the solution, we would also see the pattern. How would you rate the pattern then? In what way would it have helped us to find a solution in retrospect? I really think that a possibly existing cycle pattern is only important if there is a transposition/route in the final cipher and not only in plain text.

Or we’re not really looking at a cycling phenomenon. What about a shifting alphabet?

I’ve thought of that, too. This might also explain the frequent occurrence of the + symbol. But I thought there was a thread where multiple alphabets were excluded. Otherwise, we could really look at it again. I had done some tests earlier with heavily repeated text fragments. The 26×17 phenomenon can also be caused by this. However, I never managed to get as few ngrams as z340, so repeated text would definitely become apparent.

Posted : August 15, 2018 10:26 pm

Largo

(@largo)

Posts: 454

Honorable Member

But…multiple alphabets / shifted alphabet + P15/P19? I don’t think both can be at the same time.

Posted : August 15, 2018 10:41 pm

Jarlve

(@jarlve)

Posts: 2549

Famed Member

Or we’re not really looking at a cycling phenomenon. What about a shifting alphabet?

I like that. How do you imagine the shift?

AZdecrypt

Posted : August 16, 2018 10:19 pm

doranchak

(@doranchak)

Posts: 2614

Member Admin

I like that. How do you imagine the shift?

No idea. Vigenere comes to mind. Or what if he did something to the alphabet every X assignments, where X is chosen to sufficiently explain the oddities we see in Z340’s stats? If X is greater than one, maybe that would make the cipher only partially polyalphabetic while making the stats reflect some echoes of monoalphabetic substitution. Maybe he periodically cycled the alphabet just like he cycled individual homophones.

http://zodiackillerciphers.com

Posted : August 16, 2018 10:29 pm

smokie treats

(@smokie-treats)

Posts: 1626

Noble Member

I like that idea too. An Alberti, or Caesar shift, maybe on a period.

See several posts here where I toyed with the idea in a different context to try to explain the regional bias symbols. Depending on the shift, of which there are 25 possible, the resulting ciphertext, which looks just like letters of the alphabet, the frequencies will be different. I called it the shift differential. And with the right period, or number of letters before shifting the key, maybe that could explain 26×7 and some other things.

viewtopic.php?f=81&t=3196&p=56084&hilit=alberti#p56084

In other words, I use a particular key on my Alberti disk to encode X number of letters. Then I rotate the disk, and encode X number of letters again. Repeat until I get to the end of the message. Then encode homophonic.

Consider trying a lot of different combinations of rotations and X number of letters to see what happens. Could that cause P19 too somehow?

I also like it because in a cryptography book that is organized chronologically, the Alberti cipher will be in the chapter next to the homophonic cipher. They were both Renaissance era.

Posted : August 17, 2018 3:19 am

doranchak

(@doranchak)

Posts: 2614

Member Admin

Interesting. I wonder if something like an Alberti cipher could explain the Kasiski spike at shift 78: http://www.zodiackillersite.com/viewtop … 225#p48225

http://zodiackillerciphers.com

Posted : August 17, 2018 4:37 am

smokie treats

(@smokie-treats)

Posts: 1626

Noble Member

I have been messing around with it. No transposition, Alberti, odd rows one random alphabet key, even rows the other random alphabet key, perfect cycles, homophone keys of varying efficiency ( number of symbols in proportion to number of Alberti letters in the message to almost same number of symbols, 3 to 4, per Alberti letter ).

The bigram spike is almost always at period 1, and at any other period higher than 17 ( the next row ) not even close to 340 P19. The lengths of the segments vary, and the spike can be at longer than 17, especially for inefficient homophone keys, but no spike very close to 26. And we have already seen that randomization shortens the segments. The coincidence count graphs look similar, and there is often a lone spike that is fairly close.

In my opinion, that ain’t it.

EDIT: Source of plaintext random selections from a novel.

Posted : August 18, 2018 7:43 pm

Zodiac Discussion Forum