Morning | 09:00 - 13:30
·Some programming in Python (we’ll use Python 3) – for example, be comfortable with the material in Introduction to Python
·Basic understanding of HTML and the DOM structure for web pages – for example, be comfortable with the material in Modern Web Development with HTML5 and CSS
·Access to a computer with a browser, where you can install Python packages and develop code at a command line; in some cases, it may help to use virtualenv
·Know how to install Python libraries using PIP, etc.
·Basic familiarity with Git and use of GitHub – see Introducing GitHub if needed
Downloads required in advance of the course:
·Install BeautifulSoup4, TextBlob, spaCy, datasketch, gensim, networkx, PyTextRank
We will provide a GitHub link to everyone who registers for this course, which includes detailed instructions for setup, plus Jupyter notebooks for each of the course exercises and a Docker container with all of the required libraries and data sets pre-loaded.
Big Data Spain will issue the certificate for this course
·You are a Python programmer and need to learn how to use available NLP packages
·You are a data scientist with some Python experience and need to leverage NLP and text mining
·You are interested in deep learning, chatbots, knowledge graphs, and related AI work, and want to understand the basics for preparing text data for those kinds of use cases
Bio of the instructor:
Director, Learning Group @ O'Reilly Media. Known as a "player/coach" data scientist, he has led innovative Data teams building large-scale apps for several years. As a recognized expert in distributed systems, machine learning, and Enterprise data workflows, Paco is also an advisor for Amplify Partners. He has 30+ years technology industry experience ranging from Bell Labs to early-stage start-ups.
Bio of the instructor:
Daniel Vila is co-founder of recogn.ai, a Madrid-based startup and spin-off from the Technical University of Madrid, building next generation solutions for text analytics and content management using the latest AI techniques. Daniel holds a PhD in Artificial Intelligence by the Technical University of Madrid (2016) and has built one the largest knowledge graphs in Spain combining NLP and semantic technologies: powering the datos.bne.es data service from the National Library of Spain. He also received the Fujitsu Laboratories of Europe Innovation Award in 2014 and is a stable contributor to spaCy, one of the most advanced industrial libraries for NLP in Python.