·Some programming in Python (we’ll use Python 3)
·Basic understanding of HTML and the DOM structure for web pages
·Access to a computer with a browser, where you can install Python packages and develop code at a command line; we recommend using `virtualenv`
·Know how to install Python libraries using `pip`, etc.
·Basic familiarity with `git` and use of GitHub
Downloads required in advance of the course:
·Install Python 3.5 (or later), git, virtualenv
·Install BeautifulSoup4, spaCy, datasketch, gensim, networkx, PyTorch
We will provide a GitHub link to everyone who registers for this course, including detailed instructions for setup, plus Jupyter notebooks for each of the course exercises and a Docker container with the required libraries and datasets pre-loaded.
Big Data Spain will issue the certificate for this course to prove subject matter competency
·You are a Python programmer and need to learn how to use available packages for NLP and deep learning
·You are a data scientist with some Python experience and need to leverage NLP, text mining, and deep learning
·You are interested in deep learning, chatbots, knowledge graphs, and related AI work, and want to understand the basics for preparing text data for those kinds of use cases
Bio of the instructor - Paco Nathan
Paco Nathan is known as a "player/coach", with core expertise in data science, natural language processing, machine learning, cloud computing; 35+ years tech industry experience, ranging from Bell Labs to early-stage start-ups. Co-chair JupyterCon, host of Executive Briefings at The AI Conf and Strata Data. Evangelist for Computable. Advisor for Amplify Partners, Deep Learning Analytics, Recognai. Recent roles: Director, Learning Group @ O'Reilly Media; Director, Community Evangelism @ Databricks and Apache Spark. Cited in 2015 as one of the Top 30 People in Big Data and Analytics by Innovation Enterprise.
Bio of the instructor - Daniel Vila Suero
Daniel Vila is co-founder of recogn.ai, a Madrid-based startup and spin-off from the Technical University of Madrid, building next generation solutions for text analytics and content management using the latest AI techniques. Daniel holds a PhD in Artificial Intelligence by the Technical University of Madrid (2016) and has built one the largest knowledge graphs in Spain combining NLP and semantic technologies: powering the datos.bne.es data service from the National Library of Spain. He also received the Fujitsu Laboratories of Europe Innovation Award in 2014 and is a stable contributor to spaCy, one of the most advanced industrial libraries for NLP in Python.