Web Scrape Improvements #33
Loading…
x
Reference in New Issue
Block a user
No description provided.
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
** Try to isolate article bodies
** Try to isolate recipe, targeting keywords like tablespoon, tbsp, etc
Web Scrap Improvementsto Web Scrape ImprovementsScrape Improvement Notes
Seperate out bodies of logic that scrape certain sections
Only prepend URL to images that don't have a full URL
Don't try to process ico files
If a small image is scraped, don't try to resize it
Don't pass URL params to scrape like ?v=htu