Livebook Notebooks

Collection of Livebook notebooks.

Livebook is also a great way to get your feet wet with Elixir concepts, like a powerful language scratchpad.

Usage

install a compatible Elixir and Erlang version. You may wish to use asdf.
1. We have included a .tool-versions file to support local Elixir versions
Install Livebook via escript (preferred) or via the Desktop app .
From this project folder run livebook server index.livemd. This opens the main navigation page where you can access all the other Livebook examples.

Job Board Crawling by SpiderMan

I'm always on the lookout for elixir job posts so when I stumbled on SpiderMan as a crawler and it's livebook example, I was intrigued. The example crawls [https://elixirjobs.net/] to create a CSV of jobs by link, title, sub_title, date, workplace, and type.

Elixir Jobs

ElixirJobs

I wanted to take the example a few steps further:

Crawl the newest 25 pages instead of all 63 at the time of writing this. We don't want to crawl the entire site and ~25 give us about the last year worth of posts.
Reorder and change the columns to date, title, company, location, workplace, type, link, and page_number.
Convert the date to yyyy-mm-dd format, my ugliest Elixir code so far.
Sort the CSV by date descending to see the latest job first.
Added sections to make navigation a little easier.
In the section marked Sorting the Results, I left the section that evaluates to ** (SyntaxError) nofile:5:1: unexpected token: "" (column 1, code point U+200B) as U+200B is a zero width space, cleverly hidden in a paste job.

ElixirRadar Job Board

This largely builds on the elixir jobs base to crawl [https://elixir-radar.com/jobs] to create a CSV of jobs with some notable exceptions:

I hardcoded the page numbers as I'm not sure of the pagination style. 1-6 seems to follow a pattern so far but we can address this later.
There's no date so we sort by page number descending.
There's a somewhat larger description field that we could've pushed to the end.

Elixir Companies

This builds on the elixir radar jobs base to crawl [https://elixir-companies.com/en/companies] to create a CSV of companies.

Due to the way the DOM is structured, fields aren't in independent elements. There's text with <br> tags that translate to \n when parsing.
This involved pulling the last 1 or 2 elements from the end of the list as the first element was always one bit of information with the remaining portions covering one or more fields.
The site feels so different to parse that it almost felt like starting from scratch.
While Elixir Companies utilizes infinite scroll techniques in the browser to fetch page requests, it follows what I presume is a standard page=number query string format that is identical between the 3 sites. To me, these notebooks showcase how quickly I got up and running with spider_man over other web crawling techniques. I'm a huge fan now.

Job Search

Job Description to Markdown

Using the excellent req library, we want to get the HTML of the job post url and convert the contents to markdown.

Job Application Fields to Markdown

Using the excellent req library, we want to get the HTML of the job post application url and convert all form fields to markdown.

Miscellaneous

Scratchpad

A scratch pad for various code doodles.

Oddities and Awkward Things

Odd behavior and awkward things I've run into in my experiences with Livebook. I'm by far no Elixir expert though I am getting more up to speed all the time.

Name		Name	Last commit message	Last commit date
Latest commit History 80 Commits
codebeam_us--2022		codebeam_us--2022
data		data
elixirconf--2022		elixirconf--2022
public-apps		public-apps
utilities		utilities
.env		.env
.gitignore		.gitignore
.tool-versions		.tool-versions
LICENSE		LICENSE
README.md		README.md
discography_prototype.livemd		discography_prototype.livemd
index.livemd		index.livemd
oddities.livemd		oddities.livemd
req--job_description.livemd		req--job_description.livemd
req--job_fields.livemd		req--job_fields.livemd
req--larajobs.livemd		req--larajobs.livemd
req--laravel_giveaway_2022.livemd		req--laravel_giveaway_2022.livemd
req--supabase_edge-functions_shirt.livemd		req--supabase_edge-functions_shirt.livemd
req--supabase_edge-functions_shirt_again.livemd		req--supabase_edge-functions_shirt_again.livemd
scratchpad.livemd		scratchpad.livemd
spiderman--elixir_companies.livemd		spiderman--elixir_companies.livemd
spiderman--elixir_forum_jobs.livemd		spiderman--elixir_forum_jobs.livemd
spiderman--elixir_jobs.livemd		spiderman--elixir_jobs.livemd
spiderman--elixir_radar_jobs.livemd		spiderman--elixir_radar_jobs.livemd

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Livebook Notebooks

Table of Contents

Usage

Job Board Crawling by SpiderMan

Elixir Jobs

ElixirRadar Job Board

Elixir Companies

Job Search

Job Description to Markdown

Job Application Fields to Markdown

Miscellaneous

Scratchpad

Oddities and Awkward Things

About

Uh oh!

Languages

License

w0rd-driven/livebook_notebooks

Folders and files

Latest commit

History

Repository files navigation

Livebook Notebooks

Table of Contents

Usage

Job Board Crawling by SpiderMan

Elixir Jobs

ElixirRadar Job Board

Elixir Companies

Job Search

Job Description to Markdown

Job Application Fields to Markdown

Miscellaneous

Scratchpad

Oddities and Awkward Things

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Languages