Skip to content

vintasoftware/deduplication-slides

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

74 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

1 + 1 = 1 or Record Deduplication with Python

Jupyter Notebook from the talk "1 + 1 = 1 or Record Deduplication with Python", presented at PyBay 2018 and PyGotham 2018. The slides.ipynb version was presented at PyBay, while the slides-reduced.ipynb version was presented at PyGotham.

Running (Binder)

It's possible to run the slides-reduced.ipynb version online! Click here: Binder

Running (Local)

Install libpostal (instructions here) and pip install -r requirements.txt. Run jupyter notebook

About

"1 + 1 = 1 or Record Deduplication with Python" Jupyter Notebook

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •