Table of Contents
this is a data extractor that reads out content from "extraction.booking.html" and extracts the below listed information.
Html content taken from extraction.booking.html
output format: JSON string
file name: Hotel.json
Hotel Model:
- Hotel name
- Address
- Classification
- Review points
- Number of reviews
- Description
- Room categories
- Alternative hotels
- download .Net 6 runtime from this address: https://dotnet.microsoft.com/download/dotnet/6.0/
- Clone the repo
git clone https://github.com/CodingBrushUp/Booking_com_Serializer.git
- copy
Resources
folder to your C:\ (our feed is in this folder)
go to the project root folder, run cmd
on url bar to open command prompt
run these commands:
dotnet restore
dotnet build
dotnet run --project ./Task1/Task1.csproj
dotnet test
- we have a variety of libraries for Web Crawling such as HttpAgility, Selenium, etc. Selenium is a better choice for real projects.
- ...
contact[at]alihaghighi[dot]pro