Bug 74503

Summary: Other: Spellcheck shoud ignore URLs
Product: LibreOffice Reporter: Alexander Wilms <f.alexander.wilms>
Component: WriterAssignee: Not Assigned <libreoffice-bugs>
Status: NEW ---    
Severity: enhancement    
Priority: medium    
Version: 4.2.0.4 release   
Hardware: Other   
OS: All   
Whiteboard: BSA
Crash report or crash signature: Regression By:
Bug Depends on:    
Bug Blocks: 96000    

Description Alexander Wilms 2014-02-04 12:29:50 UTC
Problem description: 
URLs are marked as incorrect, since words are in a different language, periods are not placed as in a sentence words might not be capitalized

Steps to reproduce:
1. Enter a URL
2. Run the spellcheck

Current behavior:
It's marked as incorrect

Expected behavior:
It shouldn't
              
Operating System: Windows 7
Version: 4.2.0.4 release
Comment 1 Dominique Boutry 2014-02-05 12:12:03 UTC
Hi. Its sounds logical, but we must be aware that SpellCheck deals with the text accompanied with an URL, not the URL itself. One finds both :
- "For an help on LibreOffice, clic here" : hyperling on "here"; if it is mispelled as "heer", spellcheck is welcome,
- "news on LibO at www.libreoffice.org" : hyperlink on "www.libreoffice.org"; spellcheck should ignore.

How to distinguish the cases?
Comment 2 retired 2014-02-05 20:00:27 UTC
Imo valid request.

Think we all agree, spellchecker should check the text but not the URL. That's possible and very useful I think so let's set this to NEW and Enhancement, since it's a feature request and OS to All.
Comment 3 Dominique Boutry 2014-02-08 08:00:31 UTC
OK... but what valid request?
No spell check for the underlying URL, but what for the visible text in the 2 exemples of comment 1?

Thanks
Comment 4 retired 2014-02-08 08:16:24 UTC
Good point. So this makes this a two part request?

1) Do not check the underlying URL
2) do not check URLs in text (spellchecker should ignore anything after www."xxxxxxxxxx"

Does that make sense?
Comment 5 Dominique Boutry 2014-02-19 09:30:13 UTC
Yes for me.
Comment 6 Alexander Wilms 2014-02-19 15:03:51 UTC
Additional to the pattern www. I'd also ignore everything after *://, so that any kind of URL gets ignored.