Re: Generate a one-time pad from say a 256bit key?
- From: Unruh <unruh-spam@xxxxxxxxxxxxxx>
- Date: 7 Aug 2006 19:51:59 GMT
ggr@xxxxxxxxxxxx (Gregory G Rose) writes:
In article <1154937437.825861.65700@xxxxxxxxxxxxxxxxxxxxxxxxxxxx>,
TC <gg.20.keen4some@xxxxxxxxxxxxxxx> wrote:
<
<Gregory G Rose wrote:
<.
<>Two-byte Unicode characters can be considered as if the first byte
<> specifies the page of a book of characters, and the second byte
<> specifies the character within the page. Hebrew and Arabic are on
<> different pages, and (IIRC) not all of the character spaces are filled in.
<> So, for every pair of bytes of the ciphertext, there is a slightly higher
<> than random chance that they were actually enciphered by XORing
<> with two consecutive zero bytes... that is, they are actually plaintext.
<
<> So, for each pair of ciphertext bytes, classify them into one of three
<> categories:
<> 1. looks like valid Arabic character
<> 2. looks like valid Hebrew character
<> 3. everything else.
<>
<> Category 3 will swamp the other two categories, but with enough ciphertext,
<> and assuming the plaintext was indeed one of the two character sets, either
<> category 1 or 2 will start to dominate more than can be explained by random
<> chance. As soon as the probability of their deviance becomes statistically
<> significant to, say, 99.9%, you have your answer.
<
<I don't get it.
<
<Even if there were squintillions of 1's, and no 2's at all, how could
<you conclude from this, that the plaintext is in Arabic?
<
<AFAICS, you could only conclude that *if* the plaintext *is* in either
<Arabic or Hebrew, *then*, it is more likely to be in Arabic than in
<Hebrew.
<
<But that's a long way from saying that you can exploit the bias to tell
<what language the plaintext is in. What if the plaintext was in German,
<and German happens to have lots of 1's and not many 2's?
All I claimed originally was that you'd be able to
distinguish between Hebrew and Arabic... reading
back I guess I could have phrased it better, since
you think I was claiming that one could determine
the language a priori. That wasn't what I meant,
although it's still at least feasible. The bias
toward zero bytes from RC4 would allow you to build up a
frequency histogram that would eventually begin to
resemble the classical English (or German)
frequency tables.
I still have difficulty. From the small bias toward zero ( is it a bias
toward zero, or a bias toward "if one is zero then the next will also be
zero), I still find it difficult to believe that the any such frequency
histogram would show anything since the bias would be buried in the natural
variability of the languages.
And the difference between languages lies not so much in the relative
frequencies of the different letters, but in the distribution of digrams
and tridrams in the language. And those distributions are highly dependent
on the texts. They will be different in Shakespeare from in a treatise on
botany. Ie, again I believe that the natural variability of the language
will swamp the biases. In teh case of unicode
Arabic vs Hebrew, the zero entropy in the first code page character would
of course let you distinguish since it has zero entropy. If you know some
feature has zero entropy, then small changes in the bias entropy can
eventually be distinguished.
But it's always easier, statistically, to test
between two hypotheses (eg. German vs. not German,
or Hebrew vs. Arabic) than to try to classify the
input.
.
- Follow-Ups:
- Re: Generate a one-time pad from say a 256bit key?
- From: David Wagner
- Re: Generate a one-time pad from say a 256bit key?
- References:
- Generate a one-time pad from say a 256bit key?
- From: Dave -Turner
- Re: Generate a one-time pad from say a 256bit key?
- From: Bill Unruh
- Re: Generate a one-time pad from say a 256bit key?
- From: Gregory G Rose
- Re: Generate a one-time pad from say a 256bit key?
- From: TC
- Re: Generate a one-time pad from say a 256bit key?
- From: Gregory G Rose
- Generate a one-time pad from say a 256bit key?
- Prev by Date: Re: best cipher for SSH2
- Next by Date: Re: best cipher for SSH2
- Previous by thread: Re: Generate a one-time pad from say a 256bit key?
- Next by thread: Re: Generate a one-time pad from say a 256bit key?
- Index(es):
Relevant Pages
|