Re: can anyone help me with the calculation of statistical probability?
- From: Unruh <unruh-spam@xxxxxxxxxxxxxx>
- Date: Wed, 19 Mar 2008 05:22:04 GMT
flame.dawn@xxxxxxxxx writes:
Here is the question. This concerns a claim of plagiarism. There are
two indexes of a similar text numbering about 750,000 words. The first
index has 27,740 terms in it, while the second index has 3,500 terms
in it. The authors of the first index claim that the authors of the
second plagiarized their index, but it turns out the indexes are
mostly different, and only a few terms are similar. Can anyone
calculate what the random similarity would be, i.e., if we assume that
there was no plagiarism and that index 1 (27740 terms) and index 2
(3500 terms) were independently derived, what would be the probability
that some of the terms would still be identical if the text to which
the indexes refer is 80%-90% similar.
??? They are indexing the same text? Of course there are similarities. It
is like claiming that two photos of the whitehouse are plagerized because
both have a building in them with white columns.
The only way they could perhaps have substatiated it is by including false
terms in teh index-- eg terms which do not actually appear in the text, or
are ascribe to the wrong pages.
No statistical test is going to determine anything since they two are
correlated by being indices of the same text. Ie, what you are trying to
measure is completely irrelevant to the claim.
.
- References:
- can anyone help me with the calculation of statistical probability?
- From: flame . dawn
- can anyone help me with the calculation of statistical probability?
- Prev by Date: unsigned arithmetic
- Next by Date: Re: Stupid is what stupid does
- Previous by thread: Re: can anyone help me with the calculation of statistical probability?
- Next by thread: Re: can anyone help me with the calculation of statistical probability?
- Index(es):
Relevant Pages
|