NATURAL LANGUAGE PROCESSING (NLP)

Synopsis: 

  • Working with text data 
  • Operations with text data 
  • Text vectorization 
  • Tokenization 
  • Lemmatization and Stemming 
  • Working with text libraries such as Spacy and NLTK 
  • Named Entity Recognition[Text Wrapping Break]Assertion 

Resources: 

Introduction to NLP  using Spacy   

NLTK 

Introduction to NLP 

NLP  

 

 

Big Data 

Synopsis: 

  • Working with pyspark 
  • Use join/filter/select/withColumn/groupBy and other spark operations 
  • File Load and Save in Parquet(or Delta) format 

Resources: