Page 1 of 1

Character Encoding problem viewing non english subs from QT

Posted: 04 Jun 2007 14:25
by hudsonUK
Although I am using Windows XP, this has been tested on a MAC also so I thought it best to post in non specific OS.

I have a quicktime movie with 2 text tracks for subs, one english the other German (also happens with French).

They both show correctly in QT and if using SRT files outside of the QT movie they show OK in VLC.

But when playing the German or French ones from the embedded QT text tracks, special characters (non ASCII - Accented etc) show as '?'

Have tried to set the encoding in VLC but it appears to be overriden to QTF-8 according to the log.

Anyone any clues about this ?

VLC version is 8.6a and QT is 7.1.6

For info : when using the embedded subtitles, the message show this :

subsdec debug: using demux suggested character encoding: UTF-8
main debug: using decoder module "subsdec"

Thanks

Posted: 04 Jun 2007 14:49
by Jean-Baptiste Kempf
You can force the encoding in the preferences.

Posted: 04 Jun 2007 15:01
by hudsonUK
You can force the encoding in the preferences.
I tried that but it made no difference. Besides, I thought UTF-8 should be able to show those characters anyway.

Edit - Obviously should have read UTF not QTF

Posted: 05 Jun 2007 09:30
by hudsonUK
If anyone would like to try with the actual movie, you can download it from here :

http://intrepid.mhtrek.net/heavyliesthecrown-hi_ST.mov

It is about 360mb and is a Star Trek fan film.

I had it tested on a German system just in case and it still shows them wrong. To see the problem, select the German subtitle track.

Posted: 05 Jun 2007 19:46
by DJ
I did run across an oddity while trying this (low budget) production. In the beginning the file appears to be interlaced and I employed a deinterlace filter. As the video progresses it is NOT interlaced and the deinterlacing filter seems to interfere with the subtitles.

Posted: 06 Jun 2007 01:29
by hudsonUK
You are certainly right that it's low budget.

Regarding the interlaced/non interlaced, I don't have enough expertise to comment on that. I have not seen any other issues with the subs though besides this one.

The issue with the sub can be seen on the very first German one.

The very first word should be :

(Erzähler)

But is actually showing as :

(Erz?hler)

This occurs for all the special characters such as : äßöÄ

Posted: 06 Jun 2007 07:41
by DJ
Try changing the subtitle text encoding specifically to ISO-8859-1

I will admit also seeing Erz?hler, but I don't have German installed as my primary language within Windows. I do have a Unicode font set for subtitles though. I'm not exactly sure what the problem is here.

Posted: 06 Jun 2007 09:33
by hudsonUK
Try changing the subtitle text encoding specifically to ISO-8859-1

I will admit also seeing Erz?hler, but I don't have German installed as my primary language within Windows. I do have a Unicode font set for subtitles though. I'm not exactly sure what the problem is here.
I have had it tested with someone who has German as primary language with the same results.

My thinking was similar and I have tried to set the subtitle text encoding but that does not appear appear to make a difference. The log (extract in original post) shows that it is selecting UTF-8 anyway no matter what is set in VLC.

Bear in mind that QT is showing it correctly though.

Posted: 07 Jun 2007 06:11
by DJ
This is rather standard for subs, but may be changed with a text editor or within VLCs preferences, Input / Codecs, Other codecs, Subtitles.

Posted: 07 Jun 2007 20:24
by hudsonUK
This is rather standard for subs, but may be changed with a text editor or within VLCs preferences, Input / Codecs, Other codecs, Subtitles.
Not quite sure what you're suggesting here. I have tried to set the encoding using the VLC preferences but it makes no difference.

When viewed using an SRT file outside of the QT movie, VLC shows them correctly without having to change any settings.

I was quite excited by the thought of informing users they did not have to use quicktime to view the movie, seems like that might not be possible :(

Posted: 08 Jun 2007 08:05
by DJ
This is rather standard for subs, but may be changed with a text editor or within VLCs preferences, Input / Codecs, Other codecs, Subtitles.
I was referring to UTF-8 used for subtitles.

Posted: 08 Jun 2007 09:26
by hudsonUK
Thanks for the clarification. Am I right in thinking UTF-8 should show German/french characters ?

Posted: 08 Jun 2007 10:42
by DJ
Yes!

Posted: 08 Jun 2007 11:03
by hudsonUK
Thanks, I'm sort of a bit stumped now. I've run out of things to try.

Posted: 09 Jun 2007 07:08
by DJ

Posted: 09 Jun 2007 17:24
by hudsonUK
Thank you for that. In case it matters I am currently on 0.8.6b.

To clarify, this is occuring with an embedded text track in quicktime which is not english (eg german). Works ok when using an external SRT file.

Happens on both W/NT and MAC

Also, the messages logged when turning the subs on are :

main debug: control type=17
main debug: looking for decoder module: 28 candidates
subsdec debug: using demux suggested character encoding: UTF-8
main debug: using decoder module "subsdec"
main debug: thread 5028 (decoder) created at priority 0 (input/decoder.c:159)

Michael

Posted: 20 Jun 2007 17:26
by gat19_94
how can i encode subs in vlc???