Extract links from a web-page
julia> extract_links("https://julialang.org")
210-element Vector{String}:
"https://julialang.org/libs/bootstrap/bootstrap.min.css"
"https://julialang.org/css/app.css"
"https://julialang.org/css/franklin.css"
⋮
"https://julialang.org/libs/bootstrap/bootstrap.min.js"
"https://www.youtube.com/iframe_api"
That's all.
Features
- Resolves relative links
- Heuristics to determine what is and is not a link
- Handles malformed web-pages gracefully
- Low compile and precompile times
- Option to avoid web query if you provide the body of the page as a keyword argument
- Filters out duplicates