
ThatsCookedBy is a leading community based website dedicated to foodies where they can view and submit their favourite recipes, post comments, rate recipes, communicate with other users, participate in discussions and forums and more.
Situation
ThatscookedBy wanted N-Tech to develop an effective and manageable data parsing Python script that can pars recipe data from various websites with specific formats.Client has Drupal 7 website based on food recipes and wanted to develop Python script for data parsing.
Challenge
A python script was supposed to be compatible with Drupal 7 language, and multiple other language websites from which the data parsing was to be done.Client required Python script that supports the following formats:
Schema.org
Microdata
Mircoformat
RDFa
It was difficult to manage multiple formats using python script.
Our developers have closely worked with the client and drupal website developers to understand the database structure that assisted us to create effective Python script for food recipe data parsing.
Solutions
An effective data parsing script that helps to parse data in multiple formats. We developed Python script that supports the following formats:
Schema.org | Microdata | Mircoformat | RDFa
- System auto-detects the format type while it is scanning and accordingly it parses the rich snippets and generates the content in Drupal.
- Data parsing script allows to link back to the source website.
- The script allows to parse all supported items for the rich snippet
- The input can be a form (or you can suggest something) that supports:
Loading a text file containing links to the actual content that the parser needs to read for. e.g A link to a website that contains links that supports one of the data formats mentioned above e.g. www.Site1.com/
Benefits
“ Again delivered as promised and quality work. Very helpful guys "