项目作者: anjanatiha
项目描述 :
Generates Bi-gram Grammar with Add One Smoothing and shows Zip-f's Law on Brown Corpus
高级语言: Jupyter Notebook
项目地址: git://github.com/anjanatiha/Big-gram-Grammer-Generator-with-Smoothing.git
Bigram Grammer Generator with Add One Smoothing
Domain : Natural Language Processing
Sub-Domain : Language Processing, Text Processing, Language Modeling
Techniques : Language Modeling, Bigram, Add-one Smoothing, Zip Law
Application Domain : Language Generation
Description
- From the SnapshotBROWN.pos.all.txt file extracted all word types and their frequencies.
- Sorted the list of word types in decreasing order based on their frequency.
Drew a chart showing the relationship between the rank in the ordered list and the frequency (Zipf’s Law). (Do not stem but do ignore punctuation.) - Generated a Bigram Grammar from the above file.
- Performed add-one smoothing.
- Showed the grammar before and after smoothing for the sentence “A similar resolution passed in the Senate”.
Languages : Python
Libraries :
Duration :
Current Version : v1.0.0.3
Last Update : 04.07.2018 (Time : 05:45 P.M)