ExtractLinks.jl

Extract links from a webpage

Author LilithHafner

Github

Popularity: 1 Star

Updated Last: 1 Year Ago

Started In: January 2024

ExtractLinks

Extract links from a web-page

julia> extract_links("https://julialang.org")
210-element Vector{String}:
 "https://julialang.org/libs/bootstrap/bootstrap.min.css"
 "https://julialang.org/css/app.css"
 "https://julialang.org/css/franklin.css"
 ⋮
 "https://julialang.org/libs/bootstrap/bootstrap.min.js"
 "https://www.youtube.com/iframe_api"

That's all.

Features

Resolves relative links
Heuristics to determine what is and is not a link
Handles malformed web-pages gracefully
Low compile and precompile times
Option to avoid web query if you provide the body of the page as a keyword argument
Filters out duplicates

Required Packages

Used By Packages

No packages found.