[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Search]

Re: character issues in w3m



w3 does some magic for some non-ascii characters and replaces them
with a close ascii equivalent. In the webpage you mention, the
apostrophe is actually a "right single quotation mark" (Unicode
codepoint 0x2019), which w3 apparently replaces with the apostrophe
from ascii. w3m does no such magic by default. 

I have actually been using a somewhat hackish patch for
emacspeak-w3m.el for quite a while that does the same for w3m.
However, I am currently trying to resolve the issues with non-ascii or latin-1
characters in emacspeak in a more fundamental way, which will make my
hack for w3m obsolete.

I will hopefully soon be able to post a patch for adventurous people
to test this new feature. It will only affect buffers in multibyte
mode. I would recommend using emacs in multibyte mode anyway nowadays,
unless you have very good reasons not to. It is a UTF-8 world out there!

Best regards, Lukas

Robert D. Crawford writes ("character issues in w3m"):
> http://www.nytimes.com/2007/09/30/us/politics/30watch.html?ex=1348891200&en=02eb54b65d042599&ei=5124&partner=permalink&exprod=permalink
> 
> The first sentence  at the top of the story shows like this in emacs-w3m:
> 
> Freedom?s Watch, a deep-pocketed conservative group led by two former
> senior White House officials, made an audacious debut in late August
> when it began a $15 million advertising campaign designed to maintain
> Congressional support for President Bush?s troop increase in Iraq.
> 
> And in emacs/w3 it looks like this:
> 
> Freedomùs Watch, a deep-pocketed conservative group led by two former
> senior White House officials, made an audacious debut in late August
> when it began a $15 million advertising campaign designed to maintain
> Congressional support for President Bushùs troop increase in Iraq.
> 
> Note the question mark in the w3m sentence.  For some reason the octal
> characters are being rendered in w3m as question marks.  In w3, only the
> octals are shown.  
> 
> Is this a result of emacspeak needing unibyte?  For those using w3m,
> have you solved this issue?  Is this an issue only for me, a problem
> with my configuration?
> 
> As always, any help is appreciated,
> rdc
> -- 
> Robert D. Crawford                                      rdc1x@xxxxxxxxxxx
> 
> GOOD-NIGHT, everybody ... Now I have to go administer FIRST-AID to my
> pet LEISURE SUIT!!
> 
> -----------------------------------------------------------------------------
> To unsubscribe from the emacspeak list or change your address on the
> emacspeak list send mail to "emacspeak-request@xxxxxxxxxxx" with a
> subject of "unsubscribe" or "help"
> 

-----------------------------------------------------------------------------
To unsubscribe from the emacspeak list or change your address on the
emacspeak list send mail to "emacspeak-request@xxxxxxxxxxx" with a
subject of "unsubscribe" or "help"



If you have questions about this archive or had problems using it, please send mail to:

priestdo@xxxxxxxxxxx No Soliciting!

Emacspeak List Archive | 2007 | 2006 | 2005 | 2004 | 2003 | 2002 | 2001 | 2000 | 1999 | 1998 | Pre 1998

Emacspeak Files | Emacspeak Blog