Re: ??Difference Between utf8encoder.GetBytes and Encoding.ASCII.GetBytes

From: Joe Kaplan \(MVP - ADSI\) (joseph.e.kaplan_at_removethis.accenture.com)
Date: 02/24/05


Date: Thu, 24 Feb 2005 13:19:43 -0600

Generally speaking, the different encoding classes will give you an array of
bytes from a string corresponding to how that encoding actually represents a
string. Unicode (UTF16) represents each character as 2 bytes. UTF8 will
use a variable number of bytes for each character, but uses only one for
ASCII characters, so it generally uses much less space to store the same
Unicode data.

ASCII converts characters into a single byte using only 7 bits of each byte.
Since it only supports ASCII characters, it can result in data loss if the
string in question contains non-ASCII characters. It rarely has a use in
.NET crypto since strings are unicode in .NET.

If your encryption key is stored as text, it is probably stored in Base64.
In that case, you probably want to use Convert.FromBase64String to convert
the string key into a byte array.

Joe K.

"Phil C." <charlestek@rcn.com> wrote in message
news:u6LpKgpGFHA.2616@tk2msftngp13.phx.gbl...
> Hi. (Using VB.Net) I have a symmetric encryption key stored as text,
> encrytped by DPAPI in my web config that I use a handler
> class to decrypt by the DPAPI and pass to the class that does the
> encryption/decryption.
> The decrypted DPAPI key is a string and needs to be converted to a byte
> array for use by the encryption/decryption class. I'm confused as to the
> difference using utf8encoder.GetBytes() or Encoding.ASCII.GetBytes() to do
> this.
>
> Thanks,
>
> Phil
> Boston, MA
>



Relevant Pages

  • Re: Fast UTF-8 strlen function
    ... >> Is there a fast UTF-8 string length function floating around? ... Length in bytes, or length in characters? ... For UTF-8, the main basic "change" you have to make to your string routines ... then I could individually look up the characters in my UNICODE ...
    (alt.lang.asm)
  • Re: Prothon should not borrow Python strings!
    ... """It does not make sense to have a string without knowing what encoding ... same cul de sac as Python. ... Prothon_String_As_ASCII // raises error if there are high characters ... Python's split between byte strings and Unicode strings is ...
    (comp.lang.python)
  • Re: VB6 ANSI to Unicode conversion wrong with fixed-length strings in structures
    ... when a string is passed ByRef. ... One way to show Unicode ... > I have a Fortran DLL that returns a structure containing fixed-length ... > returning Chinese characters. ...
    (microsoft.public.vb.bugs)
  • Re: UTF of Java strings?
    ... Java's Unicode support is a mess. ... characters could be represented as 16-bit integers into the /design of the ... a Java String is now ... But what's worse, as I understand it, is that Unicode data is now ...
    (comp.lang.java.programmer)
  • Re: fgets question
    ... documentation didn't say if fgets put \0 after a string literal. ... fgets() has nothing at all to do with string literals. ... within the bounds of the array, then it is not a string. ... strlenon an array of characters that is not a string, ...
    (comp.lang.c)