Our CMS does not support Unicode text thus when we need to search-replace for characters like the ™ and © symbols. This takes a fair amount of time and is easy to miss so I wrote a script in AutoHotKey to automatically handle the Unicode character encoding.
Now I can highlight the word and click a button and Whamo! Instant replacement with HTML equivalents! No more trying to scan text and find illegal characters.
Unicode character encoding
Here is the AutoHotKey code I use. My code first grabs the highlighted text and copies it to the clipboard, where it manipulates it, then sends it back to the active program I was working in. No more need for Unicode character encoding!
Transform,Clipboard,html,%Clipboard%,3 ;3=numbered expressions used where named expression not available
Clipboard:= RegExReplace(Clipboard, "mUs)•\s(.*).<br>", " <li>$1</li>") ;convert bullet & br to ul
Clipboard:= RegExReplace(Clipboard, " <li>(.*)</li>", " <ul>`r`n <li>$1</li>`r`n </ul>") ;convert bullet & br to ul
Clipboard:= RegExReplace(Clipboard, "mUs).<br>", "<br>") ;convert bullet & br to ul
Example of how to Web Scraping multiple pages with some simple URL manipulation. This is where being able to make sure you’ve loaded a page fully, scrape it, then navigate to the next is critical. Looking for patterns in the URL will help you understand how you’ll be able to navigate to the next page.
A copy of the AutoHotKey syntax writer can be found here. Remember patience is a virtue and Happy Scraping!
This is the second video in this series. Here we practice setting values on a page (kind of reverse of Web Scraping with AutoHotkey however I don’t believe anybody has coined a decent term yet) and clicking links.
Word of warning- some pages want you to fire an “event”. Sometimes this is tricky. Given this video is set to an introductory level I only touch a little on the subject.
Web Scraping with AutoHotKey 102-Setting values and clicking links
The syntax for writing the writing the code can be found on my first post here. There is also an AutoHotKey forum thread you might wish to review here.
(web harvesting or web data extraction) is a computer software technique of extracting information from websites. Usually, such software programs
Do you frequently access a Web page / then have to write an email regarding the data you gathered? I had to look at our SharePoint and then email clients based on what I found. The email is sent via Outlook and is tailored, specifically, to my client. Saves me a ton of time!
It is easy to use AutoHotKey to read various aspects of a SharePoint site and write custom emails tailored to the respondents.
Web Scraping and emailing data video demonstration