Public Notes by chase_ats Tagged #parsing
Notes publicly shared by our members.
"Feedbag is a Ruby library for the auto-discovery of syndicated feeds (RSS/Atom)."
_Give it a url and it'll try finding the feed for the site_
#gems #ruby #open_source #parsing #scraping #automation #feeds #pub
"Massages HTML how you want to: sanitize tags, remove headers and footers, convert to plain text."
"Summary
Remove headers and footers and navigation, and strip to only the "content" part of the HTML
Sanitize tags, removing javascript and styling
Convert HTML to markdown, plain text, or sanitized HTML"
#Ruby #repos #starred #parsing #html #html_parsing #parsers #pub
RKelly gem for parsing Javascript. Not sure how well it actually works.
#javascript #ruby #nokogiri #parsing #dev #%stack_overflow #bookmarked_on_site #gems #pub