- XPath selectors
- code_html.xpath('//img/@src')
- links = code_html.xpath('//a/@href')
- bs.find_all("<html_tag_label>")
- items.py, pipelines.py, settings.py, and spiders
- spiders/my_sypder.py.
- pipelines.py
- CrawlSpider
- process_item (item, spider)
- Scrapinghub