• @ SCOS Message Format ?

    From Byrl Raze Buckbriar@21:1/5 to All on Sun Feb 9 08:00:29 2025
    @@@ BEGIN MSG.XZ.B64.17.76.SCOS @@@ JTMY$ew:&y020Z{vYn;Gxxn\=5p{R!.;.OJQ4){i'$&vwe@4F$d|LFdM0_FEswtR*0ZJ]"+ fQN}t7$gB]()N]^*kWMk@tbJcV@3vm{<\+LH?J~e:&532S{);uIuXOh%@M((kV|GL^fJBM= vh'#{+b,EB67MjQ'keML~~lvSX$+CA:\kj=EII't>IJ@*JFXKzhJ<7viqpUriUPONm"K[8_ &2XJ|(iQd[~m0mNJ&Q5[;}2zUbx-0eiJ-v9e]y3$OFF=dhbSC07.|P5-HsS;pTI_ru!5#az zg\j,'m8iNE4h<?(]jIcC,:uE{A[fH|_f6R\aKl_wn$%Blh-^yoTL<-Xnh|}7x#Nh$!meM[ g5b,U&sjGV2.0UgOodJF[;8hnvDu48/HumuIC%sT;=G]gsiD/!UF+|!fh,a!KNGE{@BR8B" -MUD.56i.&(*1<_*3hw6Y67U|S/\32'0.;4HQQ:XJ4[WuJId!wea.TtxQSdQ[aSoE(+lSBK 5nUZ9'CzpO)*9KM}p=fbL'$xmJ#8WDYL(vxolNniOlP;fuM9I.KC#&L_vP*IB58iW,bS>i^ 05PhF[-wMBArfQ/!8Ycrd=tt2VD[#wr|Hn^3Q|rqquSNy*Jn:w2o#"E{:n)96UR6=3s7OP% xe!P+6/!Wf#4Stl>H$3X-@+5yBPz:LA||(hUM)xoG:MConE<W.rYO$A=ZNX"x07CFA1igeJ !+fH-?pQO)1kTC]9JveV@MC_6c8rgi?]_oRA}]@hZpS]y3-e^pYH;^yl0Y
    @@@ END SCOS @@@

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Onion Courier@21:1/5 to Byrl Raze Buckbriar on Sun Feb 9 21:40:58 2025
    Byrl Raze Buckbriar wrote:


    @@@ BEGIN MSG.XZ.B64.17.76.SCOS @@@ JTMY$ew:&y020Z{vYn;Gxxn\=5p{R!.;.OJQ4){i'$&vwe@4F$d|LFdM0_FEswtR*0ZJ]"+
    [...]

    $ card get vqr77COIkKA2GalJ3liURWsdCZVFAfcjp7gPksqpiPo

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Richard Heathfield@21:1/5 to Byrl Raze Buckbriar on Mon Feb 10 00:12:44 2025
    On 09/02/2025 14:00, Byrl Raze Buckbriar wrote:

    @@@ BEGIN MSG.XZ.B64.17.76.SCOS @@@

    Key: 12 34

    ewBn :k8PsbZ 1]T0 ; k!Rz&Ou 'f*Ea#!


    --
    Richard Heathfield
    Email: rjh at cpax dot org dot uk
    "Usenet is a strange place" - dmr 29 July 1999
    Sig line 4 vacant - apply within

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Rich@21:1/5 to Richard Heathfield on Mon Feb 10 02:21:25 2025
    Richard Heathfield <rjh@cpax.org.uk> wrote:
    On 09/02/2025 14:00, Byrl Raze Buckbriar wrote:

    @@@ BEGIN MSG.XZ.B64.17.76.SCOS @@@

    Key: 12 34

    ewBn :k8PsbZ 1]T0 ; k!Rz&Ou 'f*Ea#!

    Key: 43 21

    4Lj q=/b m) /EZt @F Nl 3{Vc[D \cm_'KeQ

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Richard Heathfield@21:1/5 to Rich on Mon Feb 10 03:29:37 2025
    On 10/02/2025 02:21, Rich wrote:
    4Lj q=/b m) /EZt @F Nl 3{Vc[D \cm_'KeQ

    010000010110001001110011011011110110110001110101011101000110010101101100011110010010000100001010

    --
    Richard Heathfield
    Email: rjh at cpax dot org dot uk
    "Usenet is a strange place" - dmr 29 July 1999
    Sig line 4 vacant - apply within

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Richard Heathfield@21:1/5 to David Entwistle on Sun Feb 23 10:55:41 2025
    On 23/02/2025 10:19, David Entwistle wrote:
    On Sat, 22 Feb 2025 08:57:02 +0000, Richard Heathfield wrote:

    Assumes "facts" not in evidence.

    Quite right to ask. :)

    You are probably correct in a lot of what you say, and I accept I may be wrong in much of what I say. Particularly emulation, maybe that isn't the cause of my problem.

    I'd have thought it unlikely.

    I've posted some test sequences of the SCOS character set to uk.telecom.
    Feel free to have a look using your preferred reader.

    I did. I found no articles under your name in the current feed
    for that group.

    I've concentrated on
    the sequences where I've noticed I have a problem. There may be others I haven't thought about. They set off as a tidy column of numbers, letters
    and punctuation marks that are taken from the SCOS character set. I have included back tick, for completeness.

    I have found nothing of the kind in that group.


    I've then read the posts using Pan and Thunderbird. Both exhibit some
    issues which would prohibit successful decryption in certain
    circumstances.

    Some of the sequence of characters that could be interpreted as emoticons
    get removed and are replaced by unicode characters (or something else) and displayed as an emoticon in both readers. In both cases the underlying characters never reach the reader's display, so any attempt at
    decrytption, using the ciphertext as presented, will fail. This includes:

    :) – Smiley face
    :( – Sad face
    ;) – Wink
    :'( – Crying

    I can make three of these (the last three) into valid C code:

    a=b?c:(d+e);
    for(;;)
    f=g?h:'(';

    If your newsreader turns valid C into sad faces, your newsreader
    is broken. Solution: download a newsreader that works.

    Coupled with the other formatting issues make it feel a bit
    of a uphill task, which is easy to avoid if we accept some restriction on
    the character set.

    I don't think it's necessary. Working newsreaders are easy to find.


    Please don't feel I getting at you over this.

    I don't... and this is the main reason for my reply. I want you
    to be absolutely clear about this, David: your articles are
    unfailingly polite, and they ooze good faith. I will always take
    good faith at face value, regardless of how much you bicker.
    Bickering is, after all, the essence of Usenet, yes?

    I will not deny that I find it a little frustrating that you have
    continually ignored my substantive point, which is that you are
    attempting to fix the wrong software. But a reasonable man is
    open to persuasion, and you do strike me as being a reasonable
    man, so I live in hope.

    I'd like to take a greater
    part in sci.crypt, but I don't think we're all seeing the same things,
    which makes participation difficult.

    The fix is simple. Get a newsreader that doesn't mangle articles,
    not least because much of what is posted here is source code, and
    an attempt to change programming language syntax to indulge Pan's
    corrupting whim is not likely to succeed.

    --
    Richard Heathfield
    Email: rjh at cpax dot org dot uk
    "Usenet is a strange place" - dmr 29 July 1999
    Sig line 4 vacant - apply within

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From David Entwistle@21:1/5 to Richard Heathfield on Sun Feb 23 10:19:37 2025
    On Sat, 22 Feb 2025 08:57:02 +0000, Richard Heathfield wrote:

    Assumes "facts" not in evidence.

    Quite right to ask. :)

    You are probably correct in a lot of what you say, and I accept I may be
    wrong in much of what I say. Particularly emulation, maybe that isn't the
    cause of my problem.

    I've posted some test sequences of the SCOS character set to uk.telecom.
    Feel free to have a look using your preferred reader. I've concentrated on
    the sequences where I've noticed I have a problem. There may be others I haven't thought about. They set off as a tidy column of numbers, letters
    and punctuation marks that are taken from the SCOS character set. I have included back tick, for completeness.

    I've then read the posts using Pan and Thunderbird. Both exhibit some
    issues which would prohibit successful decryption in certain
    circumstances.

    Some of the sequence of characters that could be interpreted as emoticons
    get removed and are replaced by unicode characters (or something else) and displayed as an emoticon in both readers. In both cases the underlying characters never reach the reader's display, so any attempt at
    decrytption, using the ciphertext as presented, will fail. This includes:

    :) – Smiley face
    :( – Sad face
    ;) – Wink
    :'( – Crying

    The use of char(94) breaks the line feed in Pan's display window. Not a
    huge problem, but just a p.i.t.a.
    The use of char(42) causes text to be in bold in Pan's display window.
    Again not a big problem.

    I could work through the somewhat mangled ciphertext I see and replace the unicode characters with the original ASCII ones, but that isn't going to
    be a particularly easy learning experience for someone new to
    cryptography. Coupled with the other formatting issues make it feel a bit
    of a uphill task, which is easy to avoid if we accept some restriction on
    the character set.

    Please don't feel I getting at you over this. I'd like to take a greater
    part in sci.crypt, but I don't think we're all seeing the same things,
    which makes participation difficult.

    Best wishes,

    --
    David Entwistle

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From David Entwistle@21:1/5 to Richard Heathfield on Mon Feb 24 09:51:05 2025
    On Sun, 23 Feb 2025 10:55:41 +0000, Richard Heathfield wrote:

    I did. I found no articles under your name in the current feed for that group.

    I have found nothing of the kind in that group.


    Oh dear. Stephan found them. Perhaps your reader. ;)

    they should start at:

    Subject: Character Test
    Message-ID: <vpeakb$cgsc$1@dont-email.me>


    If your newsreader turns valid C into sad faces, your newsreader is
    broken. Solution: download a newsreader that works.

    Okay, but I've now used a couple of popular newsgroup readers, and with
    default settings I don't see valid C code in the reader message display of either. I don't see the original characters of the ciphertext. I have programmed in C, but I don't at the moment (I should). The younger
    participants in the National Cipher Challenge (some of who, from my
    experience, are very very gifted) are generally using Python.

    My concern is that if we get three new human readers of sci.crypt, keen to learn, but relatively inexperienced, and as an introduction we post a SCOS-encrypted ciphertext as a challenge, and then ask each how many
    characters are there in the ciphertext, we will likely get three different answers. At least two of those readers will not have a valid ciphertext
    which they can decrypt. We have now presented a whole different challenge. There are ways around it - tell them to get a new reader, or convert the characters themselves, or validate the characters with a hash function,
    but that isn't an easy and friendly introduction to cryptography. You
    could, possibly do, argue that cryptography isn't and shouldn't be easy,
    nor friendly, but we've given each individual a different challenge
    dependant on their newsreader - there is little prospect for really useful
    and valid support if individual human readers are looking at a different problem.


    I will not deny that I find it a little frustrating that you have
    continually ignored my substantive point, which is that you are
    attempting to fix the wrong software. But a reasonable man is open to persuasion, and you do strike me as being a reasonable man, so I live in hope.

    Sorry for the frustration - I recognize I must be causing it and it pains
    me. However, I feel I am speaking on behalf of the other new and less- experienced readers and potential readers.


    The fix is simple. Get a newsreader that doesn't mangle articles, not
    least because much of what is posted here is source code, and an attempt
    to change programming language syntax to indulge Pan's corrupting whim
    is not likely to succeed.

    Pan's and Thunderbird's whim... I'll think about it.

    Thanks for the patience.

    --
    David Entwistle

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Richard Harnden@21:1/5 to David Entwistle on Mon Feb 24 10:17:27 2025
    On 24/02/2025 09:51, David Entwistle wrote:
    On Sun, 23 Feb 2025 10:55:41 +0000, Richard Heathfield wrote:

    I did. I found no articles under your name in the current feed for that
    group.

    I have found nothing of the kind in that group.


    Oh dear. Stephan found them. Perhaps your reader. ;)

    they should start at:

    Subject: Character Test
    Message-ID: <vpeakb$cgsc$1@dont-email.me>

    I think you said uk.telecom. That is uk.test :)

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Richard Harnden@21:1/5 to Richard Heathfield on Mon Feb 24 10:45:42 2025
    On 24/02/2025 10:28, Richard Heathfield wrote:
    On 24/02/2025 10:17, Richard Harnden wrote:
    On 24/02/2025 09:51, David Entwistle wrote:
    On Sun, 23 Feb 2025 10:55:41 +0000, Richard Heathfield wrote:

    I did. I found no articles under your name in the current feed for that >>>> group.
    I have found nothing of the kind in that group.


    Oh dear. Stephan found them. Perhaps your reader. ;)

    they should start at:

    Subject: Character Test
    Message-ID: <vpeakb$cgsc$1@dont-email.me>

    I think you said uk.telecom.  That is uk.test :)

    That would certainly explain it. Test groups tend to be huge. Let David
    post here if he wants us to read his examples.


    That one isn't.

    Anyway, in my thunderbird: x^2 superscripts, x ^ 2 does not.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Richard Heathfield@21:1/5 to David Entwistle on Mon Feb 24 10:19:32 2025
    On 24/02/2025 09:51, David Entwistle wrote:
    On Sun, 23 Feb 2025 10:55:41 +0000, Richard Heathfield wrote:

    I did. I found no articles under your name in the current feed for that
    group.

    I have found nothing of the kind in that group.


    Oh dear. Stephan found them. Perhaps your reader. ;)

    My reader is Thunderbird.

    My server is Eternal September.

    they should start at:

    Subject: Character Test
    Message-ID: <vpeakb$cgsc$1@dont-email.me>

    They don't. I see no articles from you in the current feed, which
    purports to go back to 2017.


    If your newsreader turns valid C into sad faces, your newsreader is
    broken. Solution: download a newsreader that works.

    Okay, but I've now used a couple of popular newsgroup readers, and with default settings I don't see valid C code in the reader message display of either.

    What C code? Perhaps we are talking at cross-purposes.

    I am not talking about any specific C code in any specific
    message. I am talking about newsreaders that could corrupt valid
    C code by turning valid syntax into emoticons. You seem to think
    that such corruption would be justifiable. I disagree.

    My concern is that if we get three new human readers of sci.crypt, keen to learn, but relatively inexperienced, and as an introduction we post a SCOS-encrypted ciphertext as a challenge, and then ask each how many characters are there in the ciphertext, we will likely get three different answers.

    I consider that to be unlikely, since the only way it could
    happen is that two of them have broken newsreaders (and, for
    there to be /three/ answers, they'd have to be broken in two
    different ways).

    I will not deny that I find it a little frustrating that you have
    continually ignored my substantive point, which is that you are
    attempting to fix the wrong software. But a reasonable man is open to
    persuasion, and you do strike me as being a reasonable man, so I live in
    hope.

    Sorry for the frustration - I recognize I must be causing it and it pains
    me. However, I feel I am speaking on behalf of the other new and less- experienced readers and potential readers.

    These new and less experienced readers presumably all use new and
    less experienced newsreaders.

    The fix is simple. Get a newsreader that doesn't mangle articles, not
    least because much of what is posted here is source code, and an attempt
    to change programming language syntax to indulge Pan's corrupting whim
    is not likely to succeed.

    Pan's and Thunderbird's whim... I'll think about it.

    I have found no problem with Thunderbird corrupting text. Can you
    post an example that my reader (115.18.0 (64-bit)) will mangle in
    the way you claim?

    --
    Richard Heathfield
    Email: rjh at cpax dot org dot uk
    "Usenet is a strange place" - dmr 29 July 1999
    Sig line 4 vacant - apply within

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Richard Heathfield@21:1/5 to Richard Harnden on Mon Feb 24 10:28:05 2025
    On 24/02/2025 10:17, Richard Harnden wrote:
    On 24/02/2025 09:51, David Entwistle wrote:
    On Sun, 23 Feb 2025 10:55:41 +0000, Richard Heathfield wrote:

    I did. I found no articles under your name in the current feed
    for that
    group.
    I have found nothing of the kind in that group.


    Oh dear. Stephan found them. Perhaps your reader. ;)

    they should start at:

    Subject: Character Test
    Message-ID: <vpeakb$cgsc$1@dont-email.me>

    I think you said uk.telecom.  That is uk.test :)

    That would certainly explain it. Test groups tend to be huge. Let
    David post here if he wants us to read his examples.

    --
    Richard Heathfield
    Email: rjh at cpax dot org dot uk
    "Usenet is a strange place" - dmr 29 July 1999
    Sig line 4 vacant - apply within

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Jan Panteltje@21:1/5 to Entwistle on Mon Feb 24 10:25:51 2025
    On a sunny day (Mon, 24 Feb 2025 09:51:05 -0000 (UTC)) it happened David Entwistle <qnivq.ragjvfgyr@ogvagrearg.pbz> wrote in <vphfe9$10s64$1@dont-email.me>:

    On Sun, 23 Feb 2025 10:55:41 +0000, Richard Heathfield wrote:

    I did. I found no articles under your name in the current feed for that
    group.

    I have found nothing of the kind in that group.


    Oh dear. Stephan found them. Perhaps your reader. ;)

    they should start at:

    Subject: Character Test
    Message-ID: <vpeakb$cgsc$1@dont-email.me>


    If your newsreader turns valid C into sad faces, your newsreader is
    broken. Solution: download a newsreader that works.

    Okay, but I've now used a couple of popular newsgroup readers, and with >default settings I don't see valid C code in the reader message display of >either. I don't see the original characters of the ciphertext. I have >programmed in C, but I don't at the moment (I should). The younger >participants in the National Cipher Challenge (some of who, from my >experience, are very very gifted) are generally using Python.

    My concern is that if we get three new human readers of sci.crypt, keen to >learn, but relatively inexperienced, and as an introduction we post a >SCOS-encrypted ciphertext as a challenge, and then ask each how many >characters are there in the ciphertext, we will likely get three different >answers. At least two of those readers will not have a valid ciphertext
    which they can decrypt. We have now presented a whole different challenge. >There are ways around it - tell them to get a new reader, or convert the >characters themselves, or validate the characters with a hash function,
    but that isn't an easy and friendly introduction to cryptography. You
    could, possibly do, argue that cryptography isn't and shouldn't be easy,
    nor friendly, but we've given each individual a different challenge
    dependant on their newsreader - there is little prospect for really useful >and valid support if individual human readers are looking at a different >problem.


    I will not deny that I find it a little frustrating that you have
    continually ignored my substantive point, which is that you are
    attempting to fix the wrong software. But a reasonable man is open to
    persuasion, and you do strike me as being a reasonable man, so I live in
    hope.

    Sorry for the frustration - I recognize I must be causing it and it pains
    me. However, I feel I am speaking on behalf of the other new and less- >experienced readers and potential readers.


    The fix is simple. Get a newsreader that doesn't mangle articles, not
    least because much of what is posted here is source code, and an attempt
    to change programming language syntax to indulge Pan's corrupting whim
    is not likely to succeed.

    Pan's and Thunderbird's whim... I'll think about it.

    Long aga,in 1998 I got hold of Linux
    As there was no Free-Agent newsreader for Liux I wrote my own
    Still using it to this days:
    https://panteltje.nl/panteltje/newsflex/index.html
    That is all x86 code.
    Also ported it to my Raspberry Pi 4 .. (code not yet on my site), using it to write this reply.
    It uses an older version of libforms.
    I now do have a Usenet archive of postings I found interesting going back to that time.
    All written in C ..
    No problem with C code display, I use 'joe' as editor in replies
    and sometimes to view stuff.
    Added a 'html' button but not much html posting these days anymore..

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Richard Harnden@21:1/5 to Richard Heathfield on Mon Feb 24 11:44:19 2025
    On 24/02/2025 11:18, Richard Heathfield wrote:
    On 24/02/2025 10:45, Richard Harnden wrote:
    On 24/02/2025 10:28, Richard Heathfield wrote:
    On 24/02/2025 10:17, Richard Harnden wrote:
    On 24/02/2025 09:51, David Entwistle wrote:
    On Sun, 23 Feb 2025 10:55:41 +0000, Richard Heathfield wrote:

    I did. I found no articles under your name in the current feed for >>>>>> that
    group.
    I have found nothing of the kind in that group.


    Oh dear. Stephan found them. Perhaps your reader. ;)

    they should start at:

    Subject: Character Test
    Message-ID: <vpeakb$cgsc$1@dont-email.me>

    I think you said uk.telecom.  That is uk.test :)

    That would certainly explain it. Test groups tend to be huge. Let
    David post here if he wants us to read his examples.


    That one isn't.

    Okay. Found them.


    Anyway, in my thunderbird: x^2 superscripts, x ^ 2 does not.

    In mine, neither do.

    x^2 (x hat 2) is perfectly valid C code, so superscripting it wrongly
    changes its meaning.

    Given this input:

    32   ^  ^ *
    ...
    126 ~ ^~ ^~*

    scos2 e 98 76 gives this output:

    You don't need the e/d flag, just negate the numbers.
    So: -5 -78 (the 98 has clocked around to 5).
    It's more rot13ny that way. It was Ben's idea.


    8q   l  U F
    ...
    vfS X /@ zwT

    $md5sum charset.txt
    d5c6d06587dbac07fed831293ff0580d  charset.txt
    $ md5sum charset.scos2
    87ea4967605a5ba4d69ff6cf0fb541f5  charset.scos2

    Anyone get anything different?


    I get the same (without the scos headers).

    One thing that is mildly annoying: if the encoded line of text happens
    to start with a "> ", then thunderbird thinks its quoted text.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From David Entwistle@21:1/5 to Richard Harnden on Mon Feb 24 11:50:19 2025
    On Mon, 24 Feb 2025 10:17:27 +0000, Richard Harnden wrote:

    I think you said uk.telecom. That is uk.test

    Apologies. That uk.telecom takes me back.

    --
    David Entwistle

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Richard Heathfield@21:1/5 to Richard Harnden on Mon Feb 24 11:18:35 2025
    On 24/02/2025 10:45, Richard Harnden wrote:
    On 24/02/2025 10:28, Richard Heathfield wrote:
    On 24/02/2025 10:17, Richard Harnden wrote:
    On 24/02/2025 09:51, David Entwistle wrote:
    On Sun, 23 Feb 2025 10:55:41 +0000, Richard Heathfield wrote:

    I did. I found no articles under your name in the current
    feed for that
    group.
    I have found nothing of the kind in that group.


    Oh dear. Stephan found them. Perhaps your reader. ;)

    they should start at:

    Subject: Character Test
    Message-ID: <vpeakb$cgsc$1@dont-email.me>

    I think you said uk.telecom.  That is uk.test :)

    That would certainly explain it. Test groups tend to be huge.
    Let David post here if he wants us to read his examples.


    That one isn't.

    Okay. Found them.


    Anyway, in my thunderbird: x^2 superscripts, x ^ 2 does not.

    In mine, neither do.

    x^2 (x hat 2) is perfectly valid C code, so superscripting it
    wrongly changes its meaning.

    Given this input:

    32 ^ ^ *
    33 ! ^! ^!*
    34 " ^" ^"*
    35 # ^# ^#*
    36 $ ^$ ^$*
    37 % ^% ^%*
    38 & ^& ^&*
    39 ' ^' ^'*
    40 ( ^( ^(*
    41 ) ^) ^)*
    42 * ^* ^**
    43 + ^+ ^+*
    44 , ^, ^,*
    45 - ^- ^-*
    46 . ^. ^.*
    47 / ^/ ^/*
    48 0 ^0 ^0*
    49 1 ^1 ^1*
    50 2 ^2 ^2*
    51 3 ^3 ^3*
    52 4 ^4 ^4*
    53 5 ^5 ^5*
    54 6 ^6 ^6*
    55 7 ^7 ^7*
    56 8 ^8 ^8*
    57 9 ^9 ^9*
    58 : ^: ^:*
    59 ; ^; ^;*
    60 < ^< ^<*
    61 = ^= ^=*
    62 > ^> ^>*
    63 ? ^? ^?*
    64 @ ^@ ^@*
    65 A ^A ^A*
    66 B ^B ^B*
    67 C ^C ^C*
    68 D ^D ^D*
    69 E ^E ^E*
    70 F ^F ^F*
    71 G ^G ^G*
    72 H ^H ^H*
    73 I ^I ^I*
    74 J ^J ^J*
    75 K ^K ^K*
    76 L ^L ^L*
    77 M ^M ^M*
    78 N ^N ^N*
    79 O ^O ^O*
    80 P ^P ^P*
    81 Q ^Q ^Q*
    82 R ^R ^R*
    83 S ^S ^S*
    84 T ^T ^T*
    85 U ^U ^U*
    86 V ^V ^V*
    87 W ^W ^W*
    88 X ^X ^X*
    89 Y ^Y ^Y*
    90 Z ^Z ^Z*
    91 [ ^[ ^[*
    92 \ ^\ ^\*
    93 ] ^] ^]*
    94 ^ ^^ ^^*
    95 _ ^_ ^_*
    96 ` ^` ^`*
    97 a ^a ^a*
    98 b ^b ^b*
    99 c ^c ^c*
    100 d ^d ^d*
    101 e ^e ^e*
    102 f ^f ^f*
    103 g ^g ^g*
    104 h ^h ^h*
    105 i ^i ^i*
    106 j ^j ^j*
    107 k ^k ^k*
    108 l ^l ^l*
    109 m ^m ^m*
    110 n ^n ^n*
    111 o ^o ^o*
    112 p ^p ^p*
    113 q ^q ^q*
    114 r ^r ^r*
    115 s ^s ^s*
    116 t ^t ^t*
    117 u ^u ^u*
    118 v ^v ^v*
    119 w ^w ^w*
    120 x ^x ^x*
    121 y ^y ^y*
    122 z ^z ^z*
    123 { ^{ ^{*
    124 | ^| ^|*
    125 } ^} ^}*
    126 ~ ^~ ^~*

    scos2 e 98 76 gives this output:

    8q l U F
    *z p cH .^3
    ZJ | ]6 sYM
    {8 % jf B<!
    gS H /^ zgT
    @( 6 qY I;(
    nb R |} 6qa
    ,] ' xv PN}
    va a G/ %zh
    E[ [ 4r WJ;
    2j g N\ -5o
    L? . "1 dT<
    9s 8 Ua #/v
    SC A )7 kZE
    *1 % bf ?<2
    ZL U [# rtL
    {! l iD A!9
    gU > :2 yUS
    ;9 u pM H_*
    oT E \" 5dZ
    <) 3 wV O~{
    vc N F- $mg
    E# $ 3e V,@
    2l W M' _vn
    + !n cF,
    9u f T< ]4u
    SE L () jkD
    *3 8 aa //1
    aD Y }; qxK
    }2 6 hY |;8
    hM d ;> x2R
    ;" [ or GJ&
    oV e <, 43Y
    <- \ v5 NX=
    ve x EP "=f
    E' H 2^ Ug'
    2n 6 LY );m
    L< Q 9{ bp>
    !m & Sh [|t
    T, Z *: iyC
    (v } Zq :I0
    aF i {A p7J
    }4 / gz \R7
    hO r @J w*Q
    ;% B n8 Fa^
    oX 0 ,S 3[X
    <= K u( Mj+
    vg 9 Db !?e
    F+ T 1] Ts~
    3f ) Kk (Cl
    M@ c 8. a1.
    !o # Rt }Ls
    T\ l &D h!B
    (x > Y2 ;Uz
    aH u =M o_I
    }6 E f" <d6
    hQ 3 'V v~P
    ;& N m- Em%
    pP $ >e 2,W
    \^ + tn LF-
    wY s CK 9(d
    F{ ' 0v SN#
    3h a J/ *zk
    M: ' 7v ZN?
    !q ` h` Q`B
    )z B Y8 ;az
    bJ 0 =S o[I
    [8 K f( <j6
    aI: s $K e(\
    7pZ ] Es "Kf
    |=7 T l] Ds$
    gOA 1 )T k]E
    %vi . K1 (Tl
    F~* c r. J1)
    mUJ ! }c q.K
    _1r D Q! {cr
    L?[ l xD P!}
    saS _ @l wDQ
    [8q M W_ 'lx
    RA} u 3M V_@
    yhR ~ >u 2MW
    ;^z V c~ .u3
    XG/ 3 9V b~>
    4na , B3 8Vc
    ,-8 e i, A39
    dMB $ &e h,B
    !tj F H$ ^ei
    C]( n oF G$&
    jTA + +n nFH
    *0i O N+ -no
    I/* w uO M++
    pZJ M #_ tlN
    =6r ! Tc ].u
    O\[ # 0t SL#
    vfS X /@ zwT

    $md5sum charset.txt
    d5c6d06587dbac07fed831293ff0580d charset.txt
    $ md5sum charset.scos2
    87ea4967605a5ba4d69ff6cf0fb541f5 charset.scos2

    Anyone get anything different?

    --
    Richard Heathfield
    Email: rjh at cpax dot org dot uk
    "Usenet is a strange place" - dmr 29 July 1999
    Sig line 4 vacant - apply within

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From David Entwistle@21:1/5 to Richard Heathfield on Mon Feb 24 12:56:05 2025
    On Mon, 24 Feb 2025 10:28:05 +0000, Richard Heathfield wrote:

    That would certainly explain it. Test groups tend to be huge. Let David
    post here if he wants us to read his examples.

    Apologies. That was rather stupid of me.

    Posting the following four lines, which could I think could occur in a
    SCOS ciphertext:

    57 9)
    58 :)
    59 ;)
    60 <)

    The defaults settings of Thunderbird 128.6.Oesr (64-bit) downloads the
    correct character sequence, but displays:

    57 9)
    58 🙂
    59 😉
    60 <)

    Where the 58 and 59 lines include unicode characters and have the
    following hex values

    0x35 0x38 0x20 0xF0 0x9F 0x99 0x82 0x0A
    0x35 0x38 0x20 0xF0 0x9F 0x98 0x89 0x0A

    The default settings of Pan (0.149) displays something similar, but
    different representations of the smileys.

    57 9)
    58
    59
    60 <)

    I can turn this (dis)functionality off, but many users new to USENET and encryption will not be aware of what is happening.

    I thiunk an easy compromise is already in place. I'd suggest, as an introduction, SCOSc, for compatibility with any newsreader, is used
    without the punctuation characters. I don't see any issues with that, if
    no one finds the idea disagreeable.

    On another subject, reading "A12.1 Trigraph Sequences" of the ANSI
    standard version of "The C Programming Language", does that apply to
    Strings, at preprocessing - where '??=' gets replaced with '#' etc. during preprocessing? Sorry if this a stupid question - I'm no longer familiar
    with C.
    --
    David Entwistle

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Richard Heathfield@21:1/5 to Richard Harnden on Mon Feb 24 12:19:31 2025
    On 24/02/2025 11:44, Richard Harnden wrote:
    On 24/02/2025 11:18, Richard Heathfield wrote:
    On 24/02/2025 10:45, Richard Harnden wrote:
    On 24/02/2025 10:28, Richard Heathfield wrote:
    On 24/02/2025 10:17, Richard Harnden wrote:
    On 24/02/2025 09:51, David Entwistle wrote:
    On Sun, 23 Feb 2025 10:55:41 +0000, Richard Heathfield wrote:

    I did. I found no articles under your name in the current
    feed for that
    group.
    I have found nothing of the kind in that group.


    Oh dear. Stephan found them. Perhaps your reader. ;)

    they should start at:

    Subject: Character Test
    Message-ID: <vpeakb$cgsc$1@dont-email.me>

    I think you said uk.telecom.  That is uk.test :)

    That would certainly explain it. Test groups tend to be huge.
    Let David post here if he wants us to read his examples.


    That one isn't.

    Okay. Found them.


    Anyway, in my thunderbird: x^2 superscripts, x ^ 2 does not.

    In mine, neither do.

    x^2 (x hat 2) is perfectly valid C code, so superscripting it
    wrongly changes its meaning.

    Given this input:

    32   ^  ^ *
    ...
    126 ~ ^~ ^~*

    scos2 e 98 76 gives this output:

    You don't need the e/d flag, just negate the numbers.

    I prefer the flags, as they're easier to remember.

    One thing that is mildly annoying: if the encoded line of text
    happens to start with a "> ", then thunderbird thinks its quoted
    text.

    Which doesn't matter as long as Thunderbird correctly renders
    quoted text as > quoted text instead of playing silly buggers
    with coloured bars.

    --
    Richard Heathfield
    Email: rjh at cpax dot org dot uk
    "Usenet is a strange place" - dmr 29 July 1999
    Sig line 4 vacant - apply within

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Richard Heathfield@21:1/5 to David Entwistle on Mon Feb 24 13:08:49 2025
    On 24/02/2025 12:56, David Entwistle wrote:

    <snip>

    I can turn this (dis)functionality off,

    Or simply not enable it in the first place.

    but many users new to USENET and
    encryption will not be aware of what is happening.

    Then they have an opportunity to learn. Would you deprive them?

    I thiunk an easy compromise is already in place. I'd suggest, as an introduction, SCOSc, for compatibility with any newsreader, is used
    without the punctuation characters. I don't see any issues with that, if
    no one finds the idea disagreeable.

    I do, for reasons I've already given.

    On another subject, reading "A12.1 Trigraph Sequences" of the ANSI
    standard version of "The C Programming Language", does that apply to
    Strings, at preprocessing - where '??=' gets replaced with '#' etc. during preprocessing? Sorry if this a stupid question - I'm no longer familiar
    with C.

    Trigraphs have a very high priority. They are replaced in
    Translation Phase 1, even before line-splicing.

    Here's Translation Phase 1 in full:

    Physical source file multibyte characters are mapped, in an implementation-defined manner, to the source character set
    (introducing new-line characters for end-of-line indicators) if
    necessary. Trigraph sequences are replaced by corresponding
    single-character internal representations.


    That's except in Thunderbird C, of course. In Thunderbird C,
    trigraphs are handled just after smiley conversion and just
    before font serif removal.

    --
    Richard Heathfield
    Email: rjh at cpax dot org dot uk
    "Usenet is a strange place" - dmr 29 July 1999
    Sig line 4 vacant - apply within

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Rich@21:1/5 to Richard Heathfield on Mon Feb 24 18:08:44 2025
    Richard Heathfield <rjh@cpax.org.uk> wrote:

    Given this input:

    [long input snipped]

    $md5sum charset.txt
    d5c6d06587dbac07fed831293ff0580d charset.txt
    $ md5sum charset.scos2
    87ea4967605a5ba4d69ff6cf0fb541f5 charset.scos2

    Anyone get anything different?

    Mouse copy/paste from Tin running inside a Urxvt terminal results in
    identical md5's to yours above:

    $ md5sum *
    d5c6d06587dbac07fed831293ff0580d charset.txt
    87ea4967605a5ba4d69ff6cf0fb541f5 charset.scos2

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Rich@21:1/5 to Richard Harnden on Mon Feb 24 18:05:00 2025
    Richard Harnden <richard.nospam@gmail.invalid> wrote:
    One thing that is mildly annoying: if the encoded line of text
    happens to start with a "> ", then thunderbird thinks its quoted
    text.

    Thing is, starting with "> " *is* the Usenet convention for "quoted
    text".

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Richard Heathfield@21:1/5 to Rich on Mon Feb 24 19:30:28 2025
    On 24/02/2025 18:08, Rich wrote:
    Richard Heathfield <rjh@cpax.org.uk> wrote:

    Given this input:

    [long input snipped]

    $md5sum charset.txt
    d5c6d06587dbac07fed831293ff0580d charset.txt
    $ md5sum charset.scos2
    87ea4967605a5ba4d69ff6cf0fb541f5 charset.scos2

    Anyone get anything different?

    Mouse copy/paste from Tin running inside a Urxvt terminal results in identical md5's to yours above:

    $ md5sum *
    d5c6d06587dbac07fed831293ff0580d charset.txt
    87ea4967605a5ba4d69ff6cf0fb541f5 charset.scos2

    Thanks for that. So it is beginning to look like a fair lot of
    noise over not very much.

    4F 72 20 77 65 20 63 6F 75 6C 64 20 6A 75 73 74
    20 68 65 78 64 75 6D 70 20 74 65 78 74 20 69 6E
    73 74 65 61 64 2E 20 3A 2D 29 0A

    --
    Richard Heathfield
    Email: rjh at cpax dot org dot uk
    "Usenet is a strange place" - dmr 29 July 1999
    Sig line 4 vacant - apply within

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Rich@21:1/5 to Richard Heathfield on Mon Feb 24 21:36:43 2025
    Richard Heathfield <rjh@cpax.org.uk> wrote:
    On 24/02/2025 18:08, Rich wrote:
    Richard Heathfield <rjh@cpax.org.uk> wrote:

    Given this input:

    [long input snipped]

    $md5sum charset.txt
    d5c6d06587dbac07fed831293ff0580d charset.txt
    $ md5sum charset.scos2
    87ea4967605a5ba4d69ff6cf0fb541f5 charset.scos2

    Anyone get anything different?

    Mouse copy/paste from Tin running inside a Urxvt terminal results in
    identical md5's to yours above:

    $ md5sum *
    d5c6d06587dbac07fed831293ff0580d charset.txt
    87ea4967605a5ba4d69ff6cf0fb541f5 charset.scos2

    Thanks for that. So it is beginning to look like a fair lot of
    noise over not very much.

    I'd say if one is using a newsreader that /does/ perform such
    "transformations" -- and one is unaware such is happening, that for
    those "ones" it is more than noise. In fact, with Tin, if one
    surrounds words/strings with stars or forward slashes, Tin attempts to highlight those, and IIRC it hides the stars/slashes, so depending on
    just what character sequence is output, I might have had a different
    md5 from mouse copy/paste.

    I.e. the word bold below should end up bold in Tin but without stars:

    *bold*

    And the word italics below should be in italics (if my terminal 'did'
    italics, that is):

    /italics/

    I suspect if you send back a reply with a 'starred' or 'slashed' word,
    I won't see the stars or the slashes. But this is a lower likelyhood accidental pattern vs 2^2.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Richard Heathfield@21:1/5 to Rich on Tue Feb 25 04:36:24 2025
    On 24/02/2025 21:36, Rich wrote:
    Richard Heathfield <rjh@cpax.org.uk> wrote:
    On 24/02/2025 18:08, Rich wrote:
    Richard Heathfield <rjh@cpax.org.uk> wrote:

    Given this input:

    [long input snipped]

    $md5sum charset.txt
    d5c6d06587dbac07fed831293ff0580d charset.txt
    $ md5sum charset.scos2
    87ea4967605a5ba4d69ff6cf0fb541f5 charset.scos2

    Anyone get anything different?

    Mouse copy/paste from Tin running inside a Urxvt terminal results in
    identical md5's to yours above:

    $ md5sum *
    d5c6d06587dbac07fed831293ff0580d charset.txt
    87ea4967605a5ba4d69ff6cf0fb541f5 charset.scos2

    Thanks for that. So it is beginning to look like a fair lot of
    noise over not very much.

    I'd say if one is using a newsreader that /does/ perform such "transformations" -- and one is unaware such is happening, that for
    those "ones" it is more than noise. In fact, with Tin, if one
    surrounds words/strings with stars or forward slashes, Tin attempts to highlight those, and IIRC it hides the stars/slashes, so depending on
    just what character sequence is output, I might have had a different
    md5 from mouse copy/paste.

    I.e. the word bold below should end up bold in Tin but without stars:

    *bold*

    My newsreader correctly echoed the asterisk characters, and your
    newsreader correctly sent the text without corrupting the text.
    Here's the article source (between the +rows):

    +++++++++++++++++++++++++++++++++++++++
    I.e. the word bold below should end up bold in Tin but without stars:

    *bold*

    And the word italics below should be in italics (if my terminal
    'did'
    italics, that is):

    /italics/

    I suspect if you send back a reply with a 'starred' or 'slashed'
    word,
    I won't see the stars or the slashes. But this is a lower
    likelyhood
    accidental pattern vs 2^2.
    +++++++++++++++++++++++++++++++++++++++

    So your reader works and so does mine, and it seems that neither
    is a good example of a reader that corrupts data.

    Yes, of course it's true that if you're using broken software you
    might have trouble with corrupted data, but frankly if my
    accounts software started turning 7s into 1s, I would achieve
    better results by replacing my accounts software than I would by
    asking my accountant to stop using 7s in his sums.

    --
    Richard Heathfield
    Email: rjh at cpax dot org dot uk
    "Usenet is a strange place" - dmr 29 July 1999
    Sig line 4 vacant - apply within

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Richard Harnden@21:1/5 to Richard Heathfield on Tue Feb 25 10:09:16 2025
    On 25/02/2025 04:36, Richard Heathfield wrote:
    On 24/02/2025 21:36, Rich wrote:
    Richard Heathfield <rjh@cpax.org.uk> wrote:
    On 24/02/2025 18:08, Rich wrote:
    Richard Heathfield <rjh@cpax.org.uk> wrote:

    Given this input:

    [long input snipped]

    $md5sum charset.txt
    d5c6d06587dbac07fed831293ff0580d  charset.txt
    $ md5sum charset.scos2
    87ea4967605a5ba4d69ff6cf0fb541f5  charset.scos2

    Anyone get anything different?

    Mouse copy/paste from Tin running inside a Urxvt terminal results in
    identical md5's to yours above:

        $ md5sum *
        d5c6d06587dbac07fed831293ff0580d  charset.txt
        87ea4967605a5ba4d69ff6cf0fb541f5  charset.scos2

    Thanks for that. So it is beginning to look like a fair lot of
    noise over not very much.

    I'd say if one is using a newsreader that /does/ perform such
    "transformations" -- and one is unaware such is happening, that for
    those "ones" it is more than noise.  In fact, with Tin, if one
    surrounds words/strings with stars or forward slashes, Tin attempts to
    highlight those, and IIRC it hides the stars/slashes, so depending on
    just what character sequence is output, I might have had a different
    md5 from mouse copy/paste.

    I.e. the word bold below should end up bold in Tin but without stars:

    *bold*

    My newsreader correctly echoed the asterisk characters, and your
    newsreader correctly sent the text without corrupting the text. Here's
    the article source (between the +rows):


    If I cut and paste the below into vi, then it's fine: the 2-squared
    comes out as 2-hat-2.

    +++++++++++++++++++++++++++++++++++++++
    I.e. the word bold below should end up bold in Tin but without stars:

    *bold*

    And the word italics below should be in italics (if my terminal 'did' italics, that is):

    /italics/

    I suspect if you send back a reply with a 'starred' or 'slashed' word,
    I won't see the stars or the slashes.  But this is a lower likelyhood accidental pattern vs 2^2.
    +++++++++++++++++++++++++++++++++++++++

    So your reader works and so does mine, and it seems that neither is a
    good example of a reader that corrupts data.

    There's a difference between how a reader renders the text and what that
    text actually is.

    I've never had thunderbird corrupt valid C. I know nothing about pan,
    tin, gnus, etc.

    Worst is M$ word helpfully correcting en-dashes to em-dashes, or doing
    its smart-quotes thing. That certainly breaks things.


    Yes, of course it's true that if you're using broken software you might
    have trouble with corrupted data, but frankly if my accounts software
    started turning 7s into 1s, I would achieve better results by replacing
    my accounts software than I would by asking my accountant to stop using
    7s in his sums.


    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Richard Heathfield@21:1/5 to Rich on Tue Feb 25 16:51:33 2025
    On 25/02/2025 16:32, Rich wrote:
    However, in this case, possibly suggesting that folks go looking for save/pipe/turn off options might be more fruitful than suggesting they
    switch away from X to Y when they presumably have some preference for X
    (or at least familarity with X).

    Absolutely - free country and all that. (Or perhaps they've been
    bitten by Y.)

    But people who choose to use newsreaders that can't handle
    printable ASCII non-destructively cannot reasonably expect
    software developers to amend their code as a workaround. I would
    reason that it's the people with the problem who need to own and
    address their problem. That's not to say they can't seek and
    receive help and advice, of course, but they can't expect to be
    able to bend other people around them.

    --
    Richard Heathfield
    Email: rjh at cpax dot org dot uk
    "Usenet is a strange place" - dmr 29 July 1999
    Sig line 4 vacant - apply within

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Rich@21:1/5 to Richard Heathfield on Tue Feb 25 16:32:40 2025
    Richard Heathfield <rjh@cpax.org.uk> wrote:
    On 24/02/2025 21:36, Rich wrote:
    Richard Heathfield <rjh@cpax.org.uk> wrote:
    On 24/02/2025 18:08, Rich wrote:
    Richard Heathfield <rjh@cpax.org.uk> wrote:

    Given this input:

    [long input snipped]

    $md5sum charset.txt
    d5c6d06587dbac07fed831293ff0580d charset.txt
    $ md5sum charset.scos2
    87ea4967605a5ba4d69ff6cf0fb541f5 charset.scos2

    Anyone get anything different?

    Mouse copy/paste from Tin running inside a Urxvt terminal results in
    identical md5's to yours above:

    $ md5sum *
    d5c6d06587dbac07fed831293ff0580d charset.txt
    87ea4967605a5ba4d69ff6cf0fb541f5 charset.scos2

    Thanks for that. So it is beginning to look like a fair lot of
    noise over not very much.

    I'd say if one is using a newsreader that /does/ perform such
    "transformations" -- and one is unaware such is happening, that for
    those "ones" it is more than noise. In fact, with Tin, if one
    surrounds words/strings with stars or forward slashes, Tin attempts to
    highlight those, and IIRC it hides the stars/slashes, so depending on
    just what character sequence is output, I might have had a different
    md5 from mouse copy/paste.

    I.e. the word bold below should end up bold in Tin but without stars:

    *bold*

    My newsreader correctly echoed the asterisk characters, and your
    newsreader correctly sent the text without corrupting the text.
    Here's the article source (between the +rows):

    +++++++++++++++++++++++++++++++++++++++
    I.e. the word bold below should end up bold in Tin but without stars:

    *bold*

    And the word italics below should be in italics (if my terminal
    'did'
    italics, that is):

    /italics/

    Tin removed the stars and slashes (for display), and made the slashes
    be underlined in the terminal. I suspect the stars should have been
    bold, but a bold font face was not used for the display in any case.

    Copy and paste from the terminal display resulted in no stars or
    slashes around "bold" or "italics". So for those specific cases, if
    those patterns cropped up, Tin in a terminal would also corrupt the
    "text".

    However, both "saving" the article to a file, and "piping" the article
    to less, resulted in the stars and slashes being present in both. So
    the issue is only with display in the terminal (at least with Tin).

    So your reader works and so does mine, and it seems that neither
    is a good example of a reader that corrupts data.

    Mostly. For the particular 'patterns' that Tin recognizes, it too
    'corrupted'. It simply appears that other newsreaders recognize many
    more patterns, leading to more occurrences of "change" from what was
    originally created.

    Here is what Tin recognizes, and the 'highlight' it defaults to:

    60 Attr. of highlighting with *stars* : Bold
    61 Attr. of highlighting with _dash_ : Underline
    62 Attr. of highlighting with /slash/ : Half bright
    63 Attr. of highlighting with -stroke- : Reverse video

    There is also a singe globlal setting to turn off all of the above, and
    testing with it off, I get the stars and slashes without modification.
    So there is a way to avoid the issue, if it is causing trouble, with
    Tin.

    Yes, of course it's true that if you're using broken software you
    might have trouble with corrupted data, but frankly if my accounts
    software started turning 7s into 1s, I would achieve better results
    by replacing my accounts software than I would by asking my
    accountant to stop using 7s in his sums.

    Agreed. But it is also possible for the other newsreaders that there
    is one or more of:

    A save or pipe option that does not perform the visual character
    translations.

    A toggle to turn off the visual character translations.

    However, in this case, possibly suggesting that folks go looking for save/pipe/turn off options might be more fruitful than suggesting they
    switch away from X to Y when they presumably have some preference for X
    (or at least familarity with X).

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Rich@21:1/5 to Richard Heathfield on Tue Feb 25 20:53:49 2025
    Richard Heathfield <rjh@cpax.org.uk> wrote:
    On 25/02/2025 16:32, Rich wrote:
    However, in this case, possibly suggesting that folks go looking for
    save/pipe/turn off options might be more fruitful than suggesting they
    switch away from X to Y when they presumably have some preference for X
    (or at least familarity with X).

    Absolutely - free country and all that. (Or perhaps they've been
    bitten by Y.)

    But people who choose to use newsreaders that can't handle
    printable ASCII non-destructively cannot reasonably expect
    software developers to amend their code as a workaround. I would
    reason that it's the people with the problem who need to own and
    address their problem. That's not to say they can't seek and
    receive help and advice, of course, but they can't expect to be
    able to bend other people around them.

    Also agreed.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From David Entwistle@21:1/5 to Richard Heathfield on Fri Feb 21 02:03:28 2025
    On Mon, 10 Feb 2025 03:29:37 +0000, Richard Heathfield wrote:


    01000001011...

    I'm afraid I haven't been following the discussion on SCOS Message Format,
    but would offer a comment, which may have been covered. If not here it is:

    Due to some recent technical challenges I have been reading USENET using
    two newsreaders; Thunderbird and Pan. Both are excellent, but handle
    certain parts of the character set differently.

    Under some circumstances, Thunderbird removes the caret (character 94
    dec.) and makes the following character a superscript - indicating one character raised to the power of the second. This is excellent when that
    is what is intended. Not so good, when it is not what was intended. Pan
    doesn't do this.

    Pan uses the asterisk (character 42 dec.) to embolden characters. I think
    it looks for pairs and intends to act on anything between, but the rules concerning spaces and other characters may complicate the interpretation.
    As I recall, the asterisks are not removed. I'm not sure exactly what's
    going on, but a sequence of glyphs seems to result in the loss of
    character return and line feeds.

    There are probably some other more subtle consequences, but I don't think
    we can assume that all readers will reliably receive all the characters we intend them to receive, other than the basic alphanumeric ones, via their various newsreaders. Checking compatibility would be challenging and there isn't much you can do about it.

    Consequently, if SCOS introduces a lot of these lesser used characters,
    which may have different actions in different newsreaders, we're going to confuse all but the most technical and dedicated readers (this is possibly considered a good thing).

    Just a thought for consideration.

    --
    David Entwistle

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Richard Heathfield@21:1/5 to David Entwistle on Fri Feb 21 02:54:21 2025
    On 21/02/2025 02:03, David Entwistle wrote:
    On Mon, 10 Feb 2025 03:29:37 +0000, Richard Heathfield wrote:


    01000001011...

    I'm afraid I haven't been following the discussion on SCOS Message Format, but would offer a comment, which may have been covered. If not here it is:

    Due to some recent technical challenges I have been reading USENET using
    two newsreaders; Thunderbird and Pan. Both are excellent, but handle
    certain parts of the character set differently.

    Under some circumstances, Thunderbird removes the caret (character 94
    dec.) and makes the following character a superscript - indicating one character raised to the power of the second. This is excellent when that
    is what is intended. Not so good, when it is not what was intended. Pan doesn't do this.

    Pan uses the asterisk (character 42 dec.) to embolden characters. I think
    it looks for pairs and intends to act on anything between, but the rules concerning spaces and other characters may complicate the interpretation.
    As I recall, the asterisks are not removed. I'm not sure exactly what's
    going on, but a sequence of glyphs seems to result in the loss of
    character return and line feeds.

    There are probably some other more subtle consequences, but I don't think
    we can assume that all readers will reliably receive all the characters we intend them to receive, other than the basic alphanumeric ones, via their various newsreaders. Checking compatibility would be challenging and there isn't much you can do about it.

    Consequently, if SCOS introduces a lot of these lesser used characters,
    which may have different actions in different newsreaders, we're going to confuse all but the most technical and dedicated readers (this is possibly considered a good thing).

    Just a thought for consideration.

    Well, a newsreader that can't correctly render ASCII is - um... -
    shall we say in urgent need of a bug report?

    SCOS does the best it can to help Pan out by leaving untouched
    anything it doesn't understand. Perhaps Pan could extend SCOS the
    same courtesy? ;-)

    Seriously, though, I don't see what can be done. We could remove
    caret from the alphabet... but then who's to say that the next
    Pan release won't interpret zz as a snooze emoji and 314 as a
    Greek pi symbol? We could end up with no alphabet at all!

    --
    Richard Heathfield
    Email: rjh at cpax dot org dot uk
    "Usenet is a strange place" - dmr 29 July 1999
    Sig line 4 vacant - apply within

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From David Entwistle@21:1/5 to Richard Heathfield on Fri Feb 21 22:23:02 2025
    The following is based on my partial and limited understanding of the
    matter. None of this is very important, nor causing any significant
    problems, but maybe should be considered. It could be wrong...

    On Fri, 21 Feb 2025 02:54:21 +0000, Richard Heathfield wrote:

    Well, a newsreader that can't correctly render ASCII is - um... - shall
    we say in urgent need of a bug report?

    Not really. The ASCII specification includes caret notation for control characters. In has been like that since the early days.

    <https://en.wikipedia.org/wiki/ASCII#Table_of_codes>

    The interpretation of the caret and the subsequent character will be
    platform and application dependant. Certainly a caret should be rendered
    within a integrated development environment, used for writing a program
    where the caret is an operator But a caret does not for part of general
    English language usage and possibly shouldn't be rendered in an interface intended to present plain text English language. In these cases it can be interpreted in one of several ways, as set out above, or to indicate superscript. Pan does render the caret, it is Thunderbird that removes it
    and treats the following character as superscript. I don't know what the
    other newreaders do.


    SCOS does the best it can to help Pan out by leaving untouched anything
    it doesn't understand. Perhaps Pan could extend SCOS the same courtesy?
    ;-)

    That would be ideal, but this is not the case where caret forms part of
    the encoded character set. Some versions of the SCOS encryption process do
    not leave caret untouched. More generally, some versions of SCOS take
    common alphanumeric characters, which will not cause any problems, and substitutes other symbols, like caret, which may be interpreted, quite correctly, as control characters by a platform. The version you introduced recently, which excludes the non-alphanumeric characters, is in my opinion better.


    Seriously, though, I don't see what can be done. We could remove caret
    from the alphabet... but then who's to say that the next Pan release
    won't interpret zz as a snooze emoji and 314 as a Greek pi symbol? We
    could end up with no alphabet at all!

    I'd go with removing all the non-alphanumeric characters. The caret escape sequence has been around and implemented since the 1960s.

    Of course this is only my view.

    Best wishes,

    PS I do like SCOS.

    --
    David Entwistle

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Richard Heathfield@21:1/5 to David Entwistle on Sat Feb 22 06:36:18 2025
    On 21/02/2025 22:23, David Entwistle wrote:
    The following is based on my partial and limited understanding of the
    matter. None of this is very important, nor causing any significant
    problems, but maybe should be considered. It could be wrong...

    On Fri, 21 Feb 2025 02:54:21 +0000, Richard Heathfield wrote:

    Well, a newsreader that can't correctly render ASCII is - um... - shall
    we say in urgent need of a bug report?

    Not really. The ASCII specification includes caret notation for control characters. In has been like that since the early days.

    Irrelevant, I'm afraid, David.

    A Usenet article is a message from a human to other humans. We
    can't reasonably expect other people to conspire with us in
    communicating via control characters. One doesn't embed ASCII
    0000001s in Usenet articles.

    This is the use-mention distinction. We can talk about ^A till
    we're blue in the face without ever actually using one. We talk
    about it by mentioning it, using a ^ and an A like this: ^A

    No control characters were used in the construction of that
    paragraph.

    Newsreading software that wrongly interprets ^A as "control-A"
    will wrongly interpret C=B^A, and is in /desperate/ need of a bug
    report.

    <https://en.wikipedia.org/wiki/ASCII#Table_of_codes>

    The interpretation of the caret and the subsequent character will be
    platform and application dependant.

    Indeed. For example, the SCOS and SCOS2 applications interpret it
    as a printable character... which of course it is.

    PS I do like SCOS.

    Well, I'm glad... but a SCOS that can't shroud source code is
    only half a SCOS.

    --
    Richard Heathfield
    Email: rjh at cpax dot org dot uk
    "Usenet is a strange place" - dmr 29 July 1999
    Sig line 4 vacant - apply within

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Richard Heathfield@21:1/5 to David Entwistle on Sat Feb 22 08:57:02 2025
    On 22/02/2025 08:19, David Entwistle wrote:
    On Sat, 22 Feb 2025 06:36:18 +0000, Richard Heathfield wrote:

    Irrelevant, I'm afraid, David.


    I don't believe so.

    Any newsreader will have at its heart a terminal emulation function, where the characteristics of the early electromechanical terminals are
    accurately reproduced via software.

    Why? That's not how I would write a newsreader. What possible
    value could it have?

    Although convenient to forget that, it
    remains true.

    Citation needed, as they say.

    The details of the implementation of the caret control set
    will have depended on the details of the emulation chosen.

    Assumes "facts" not in evidence.

    The newsreader software developers may, like us, may have forgotten about this detail but it will remain embedded in any software that has been with
    us for as long as USENET.

    Deeply unlikely in my opinion. Would anyone else care to comment?

    For more recently developed readers, the
    behaviour may be less well defined. In either case it would be unwise to assume that sending a caret followed by a capital letter A to the reader
    will result in those two characters being displayed.

    I don't see why; I really don't. A newsreader that screwed up so
    badly would quickly be condemned as unusable.

    A perfectly correct
    and compliant implementation will not. If the emulation says it should
    treat it as a control sequence, that is what it will do.

    Then the newsreader, if that's how it works, remains broken for
    the same reason I outlined in my last article. A newsreader that
    cannot correctly render the following code:

    out=in^key;

    is not fit for purpose in sci.crypt.

    --
    Richard Heathfield
    Email: rjh at cpax dot org dot uk
    "Usenet is a strange place" - dmr 29 July 1999
    Sig line 4 vacant - apply within

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From David Entwistle@21:1/5 to Richard Heathfield on Sat Feb 22 08:19:27 2025
    On Sat, 22 Feb 2025 06:36:18 +0000, Richard Heathfield wrote:

    Irrelevant, I'm afraid, David.


    I don't believe so.

    Any newsreader will have at its heart a terminal emulation function, where
    the characteristics of the early electromechanical terminals are
    accurately reproduced via software. Although convenient to forget that, it remains true. The details of the implementation of the caret control set
    will have depended on the details of the emulation chosen.

    The newsreader software developers may, like us, may have forgotten about
    this detail but it will remain embedded in any software that has been with
    us for as long as USENET. For more recently developed readers, the
    behaviour may be less well defined. In either case it would be unwise to
    assume that sending a caret followed by a capital letter A to the reader
    will result in those two characters being displayed. A perfectly correct
    and compliant implementation will not. If the emulation says it should
    treat it as a control sequence, that is what it will do.

    Best wishes,
    --
    David Entwistle

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)