A web browser (Google Chrome or Mozilla Firefox) is required for this chapter. We will be using the following Python libraries:
- requests
- re
- bs4
If these libraries don't exist in your current Python setup, refer to Chapter 2, Python and the Web – Using urllib and Requests, the Setting things up section, for more information on their installation and how to set them up.
The code files for this chapter are available in this book's GitHub repository: https://github.com/PacktPublishing/Hands-On-Web-Scraping-with-Python/tree/master/Chapter09.
Those of you who are already using re can refer to the Using regular expressions to extract data section.
             
                                             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
             
     
         
                 
                 
                 
                 
                 
                 
                 
                 
                 
                 
                 
                 
                 
                 
                