If programming is magic then web scraping is surely a form of wizardry. By writing a simple automated program, you can query web servers, request data, and parse it to extract the information you need. The expanded e...

Buy Now From Amazon

Product Review

If programming is magic then web scraping is surely a form of wizardry. By writing a simple automated program, you can query web servers, request data, and parse it to extract the information you need. The expanded edition of this practical book not only introduces you web scraping, but also serves as a comprehensive guide to scraping almost every type of data from the modern web.

Part I focuses on web scraping mechanics: using Python to request information from a web server, performing basic handling of the server's response, and interacting with sites in an automated fashion. Part II explores a variety of more specific tools and applications to fit any web scraping scenario you're likely to encounter.

  • Parse complicated HTML pages
  • Develop crawlers with the Scrapy framework
  • Learn methods to store data you scrape
  • Read and extract data from documents
  • Clean and normalize badly formatted data
  • Read and write natural languages
  • Crawl through forms and logins
  • Scrape JavaScript and crawl through APIs
  • Use and write image-to-text software
  • Avoid scraping traps and bot blockers
  • Use scrapers to test your website


Similar Products

Mining the Social Web: Data Mining Facebook, Twitter, LinkedIn, Instagram, GitHub, and MoreApplied Text Analysis with Python: Enabling Language-Aware Data Products with Machine LearningPython for Data Analysis: Data Wrangling with Pandas, NumPy, and IPythonFlask Web Development: Developing Web Applications with PythonPython Data Science Handbook: Essential Tools for Working with DataNatural Language Processing with Python: Analyzing Text with the Natural Language ToolkitHands-On Machine Learning with Scikit-Learn and TensorFlow: Concepts, Tools, and Techniques to Build Intelligent SystemsAutomate the Boring Stuff with Python: Practical Programming for Total BeginnersPython for Finance: Mastering Data-Driven FinanceSerious Python: Black-Belt Advice on Deployment, Scalability, Testing, and More