Web Scrape Improvements #33
Labels
No Label
Bug
Enhancement
Idea
In Progress
New Feature
Security
No Milestone
No Assignees
1 Participants
Notifications
Due Date
No due date set.
Dependencies
No dependencies set.
Reference: Max/SolidScribe#33
Loading…
Reference in New Issue
Block a user
No description provided.
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
** Try to isolate article bodies
** Try to isolate recipe, targeting keywords like tablespoon, tbsp, etc
Web Scrap Improvementsto Web Scrape ImprovementsScrape Improvement Notes
Seperate out bodies of logic that scrape certain sections
Only prepend URL to images that don't have a full URL
Don't try to process ico files
If a small image is scraped, don't try to resize it
Don't pass URL params to scrape like ?v=htu