Author Topic: Encodings and status bar  (Read 119 times)

jporkkahtc

  • Senior Community Member
  • Posts: 2616
  • Hero Points: 210
  • Text
Encodings and status bar
« on: November 09, 2022, 11:37:38 pm »
So when editing a BAT file I noticed some weird character.
Turns out that it is 0x96, a wide dash.

I noticed it because it displayed as some weird character - but I don't know what encoding the file was originally loaded as unfortunately.

In any case, with a little experimenting I found something I find odd.
How "Current character" is displayed in the status bar depends on the encoding.
When "SampleFile.txt" is loaded with latin1, then CurrentCharacter doesn't show the actual character value.
When loaded with Text SBCS, then it does show 0x96.

Why does it display as "U+00FB" in this case?

See the attached screen shots.



Clark

  • SlickEdit Team Member
  • Senior Community Member
  • *
  • Posts: 6436
  • Hero Points: 501
Re: Encodings and status bar
« Reply #1 on: November 09, 2022, 11:50:01 pm »
When you load with latin1, this code page is being converted to Utf-8.

The status line is showing the value of the Utf-8 character and not what will be there when you save the file. You can always switch to hex mode, you will see the actual binary data (not Utf-8).

Also, Latin1 is the wrong code page. Try "Western European (Windows-1252)"