An open community 
of Macintosh users,
for Macintosh users.

FineTunedMac Dashboard widget now available! Download Here

Topic Options
#43850 - 03/06/17 11:53 AM Copy-pasting diacritical marks from Firefox
deniro Offline


Registered: 09/09/09
I copy and paste a lot of text from web pages via Firefox.

The diacritical marks don't always transfer, and I see many because what I'm doing involves a lot of foreign names. I'd like to figure out what's involved here. I'm thinking of factors like text encoding, fonts, html code, and how certain programs interpret the copied text.

I do most of my writing in Tom Bender's Tex-Edit Plus, which has some really nice features, esp. its text cleanup, but it can't read diacriticals very well, hardly at all. This is what I'd like to correct.

Some other examples. Text Wrangler 3.5.3 does transfer the text with diacritical marks exactly, but without other html formatting, which I like. Pages and Tex-edit transfer all html formatting, which of course includes the diacriticals. So these programs are obviously doing something that Tex-Edit Plus isn't or can't. Maybe there's a preference, tweak, or action that will allow me to paste into TE Plus without losing all the formatting. I don't mind losing some. I don't want the html code to transfer, just the diacriticals.

Interestingly, I can copy-paste into Text Wrangler then copy-paste the same text into TE Plus and many, but not all, of the diacriticals transfer properly. Of course, I wouldn't want to add that step to my workflow, which is already full of tedious, repetitive tasks: copy, paste, text clean up, search and replace. I've been trying to figure out grep searches but have been unable to. There ought to be a way to use Applescript or Automator to handle those tasks. A way that I can figure out, that's another matter.

Regardless, this is a subject I've been wondering about for a long time. It isn't typing diacritical marks that's a problem. It's transferring them from one place to another.
_________________________
OS X 10.11.6
iMac 21.5", Mid 2011
2.8 GHz Intel Core i7, 24 GB
AMD Radeon HD 6770M
Using Apple computers since 1980

Top
#43852 - 03/06/17 12:15 PM Re: Copy-pasting diacritical marks from Firefox [Re: deniro]
grelber Offline


Registered: 08/05/09
Loc: North of 49th ||
If your diacritical marks/characters are Unicode — check your Mac's Character Viewer — then they should transfer over. I've rarely had the problem when transfer texts from web pages/sources in Firefox; for me it usually involves trying to transfer such via Gmail.
(A similar thing happens in FTM and Gmail: When I try to separate words or sentences with 2 spaces, FTM and Gamil change it to a single space.)

I think you're screwed, but others more knowledgeable about the underpinnings can probably be more helpful.

Top
#43855 - 03/06/17 07:01 PM Re: Copy-pasting diacritical marks from Firefox [Re: deniro]
Bob_00001 Offline


Registered: 01/03/10
Virtually everything that you find on the internet is UTF-8 encoded, and unfortunately, Tom Bender's Tex-Edit Plus has almost no ability to deal with unicode. It's been a great text editing program for many years, but it's not really up to the task anymore. I've pretty much given up on it. I've been hoping for a major update to it, but nothing so far.

The TextEdit program that's bundled with the Mac has no problem with diacriticals, because it works with unicode.


Edited by Bob_00001 (03/06/17 07:02 PM)
_________________________
MacBook Pro 15" (2015)
OSX 10.10.5

Top
#43861 - 03/07/17 11:01 AM Re: Copy-pasting diacritical marks from Firefox [Re: Bob_00001]
deniro Offline


Registered: 09/09/09
I like TE Plus most of all for its text cleanup and search and replace, which I use several times a day almost every day.

Typing is faster than in Pages, which bogs down. I have to wait for Pages to catch up to my typing, and I'm not a speedy typist. Pages files are bigger and take up more drive space. They load slower and are proprietary. TE Plus lets me get to work right now and stays out of my way. The word count is better, line numbering, showing invisibles, and on. I've meant to tackle its scripting ability but haven't yet. I find it a better program than Pages for what I need to do, though I use both. I use Pages for printing, for pasting html text, for documents with columns, tables, outlining, for letters, serious documents. The spell checker is better than in TE Plus, though both are supposed to draw from the same system-wide database.

I used NeoOffice for a while, before I got Pages, but now almost never. It, too, is slow, loaded with far too many features, with a learning curve for just about anything you want to do. I don't need to do everything and I don't want to learn how to do everything. Occasionally it will read documents that my other programs can't. That might be its only virtue.

Other possibilities. A text cleanup program? Textsoap, for example, though it can't do everything TE Plus does. I don't want to add another program into my workflow.

I would like to like Text Wrangler. It's faster than TE Plus, does a better job of handling large documents, but it's more complicated, doesn't quite do what TE Plus does, offers little in text formatting, and is more of a programmer's tool. Like Text Wrangler, TE Plus handles grep searches, and it's search/replace in general seems better for what I need. The interface is better organized. The program is simply easier to use. It allows me to get work done with the smallest learning curve.

What else to say? It's a fine program. I can't even recall how many years ago I started using it. Long time.
_________________________
OS X 10.11.6
iMac 21.5", Mid 2011
2.8 GHz Intel Core i7, 24 GB
AMD Radeon HD 6770M
Using Apple computers since 1980

Top
#43862 - 03/07/17 11:04 AM Re: Copy-pasting diacritical marks from Firefox [Re: deniro]
deniro Offline


Registered: 09/09/09
Certain diacritical marks seem to paste easier than others. Programs seem to have the hardest time with marks in Scandinavian and eastern European names.
_________________________
OS X 10.11.6
iMac 21.5", Mid 2011
2.8 GHz Intel Core i7, 24 GB
AMD Radeon HD 6770M
Using Apple computers since 1980

Top
#43863 - 03/07/17 11:07 AM Re: Copy-pasting diacritical marks from Firefox [Re: grelber]
deniro Offline


Registered: 09/09/09
grelber: Character Viewer doesn't tell me much. Nothing about Unicode.
_________________________
OS X 10.11.6
iMac 21.5", Mid 2011
2.8 GHz Intel Core i7, 24 GB
AMD Radeon HD 6770M
Using Apple computers since 1980

Top
#43864 - 03/07/17 11:14 AM Re: Copy-pasting diacritical marks from Firefox [Re: deniro]
deniro Offline


Registered: 09/09/09
What about the Text Encoding under View in Firefox? What should that be?
_________________________
OS X 10.11.6
iMac 21.5", Mid 2011
2.8 GHz Intel Core i7, 24 GB
AMD Radeon HD 6770M
Using Apple computers since 1980

Top
#43865 - 03/07/17 12:14 PM Re: Copy-pasting diacritical marks from Firefox [Re: deniro]
deniro Offline


Registered: 09/09/09
Well, this is interesting.

I found I can paste text with Norwegian diacritical marks into TE Plus if I do it in smaller portions. A page long list will have garbled characters, but copying and pasting part of the list, then another part, will retain the formatting.
_________________________
OS X 10.11.6
iMac 21.5", Mid 2011
2.8 GHz Intel Core i7, 24 GB
AMD Radeon HD 6770M
Using Apple computers since 1980

Top
#43866 - 03/07/17 12:54 PM Re: Copy-pasting diacritical marks from Firefox [Re: deniro]
joemikeb Offline
Moderator

Registered: 08/04/09
Loc: Fort Worth, Texas
There is a beta version of Tex-Edit plus dated Sept 2016, you might give that a try.
_________________________
joemikeb • moderator

Top
#43867 - 03/07/17 03:41 PM Re: Copy-pasting diacritical marks from Firefox [Re: deniro]
grelber Offline


Registered: 08/05/09
Loc: North of 49th ||
Originally Posted By: deniro
Character Viewer doesn't tell me much. Nothing about Unicode.

Character Viewer contains a section of all Unicode characters — at least it does in Lion.

Top
#43868 - 03/07/17 03:45 PM Re: Copy-pasting diacritical marks from Firefox [Re: deniro]
grelber Offline


Registered: 08/05/09
Loc: North of 49th ||
Originally Posted By: deniro
What about the Text Encoding under View in Firefox? What should that be?

Default setting is usually UTF-8.
Check under the info sign in the URL window and/or under Viewer for the pages you are accessing in Firefox.

Top

Moderator:  alternaut, dianne, MacManiac