ExtractLinks.jl

Extract links from a webpage
Author LilithHafner
Popularity
1 Star
Updated Last
8 Months Ago
Started In
January 2024

ExtractLinks

Build Status Coverage PkgEval Aqua

Extract links from a web-page

julia> extract_links("https://julialang.org")
210-element Vector{String}:
 "https://julialang.org/libs/bootstrap/bootstrap.min.css"
 "https://julialang.org/css/app.css"
 "https://julialang.org/css/franklin.css"
 ⋮
 "https://julialang.org/libs/bootstrap/bootstrap.min.js"
 "https://www.youtube.com/iframe_api"

That's all.

Features

  • Resolves relative links
  • Heuristics to determine what is and is not a link
  • Handles malformed web-pages gracefully
  • Low compile and precompile times
  • Option to avoid web query if you provide the body of the page as a keyword argument
  • Filters out duplicates

Used By Packages

No packages found.