Package: rurl 1.4.0

Bart Turczynski
rurl: Parse, Clean, and Normalize URLs
A lightweight toolkit for extracting structured information from URLs. Includes functions for parsing, normalizing protocols, extracting domains, and constructing clean URLs. Domain and public-suffix extraction is delegated to the 'pslr' package, which implements the Public Suffix List from <https://publicsuffix.org>. Punycode and IDNA encoding is handled by the 'punycoder' package.
Authors:
rurl_1.4.0.tar.gz
rurl_1.4.0.zip(r-4.7)rurl_1.4.0.zip(r-4.6)rurl_1.4.0.zip(r-4.5)
rurl_1.4.0.tgz(r-4.6-any)rurl_1.4.0.tgz(r-4.5-any)
rurl_1.4.0.tar.gz(r-4.7-any)rurl_1.4.0.tar.gz(r-4.6-any)
rurl_1.4.0.tgz(r-4.6-emscripten)
manual.pdf |manual.html✨
card.svg |card.png
rurl/json (API)
NEWS
| # Install 'rurl' in R: |
| install.packages('rurl', repos = c('https://bart-turczynski.r-universe.dev', 'https://cloud.r-project.org')) |
Bug tracker:https://github.com/bart-turczynski/rurl/issues
Pkgdown/docs site:https://bart-turczynski.github.io
domain-name-checkerdomain-name-detectionpunycodeseotldtld-checkertld-verificationtldsurl-checkerurl-cleanerurl-cleaningurl-parserurl-parsing
Last updated from:efafa424fe. Checks:9 OK. Indexed: yes.
| Target | Result | Time | Files | Syslog |
|---|---|---|---|---|
| linux-devel-x86_64 | OK | 150 | ||
| source / vignettes | OK | 177 | ||
| linux-release-x86_64 | OK | 183 | ||
| macos-release-arm64 | OK | 83 | ||
| macos-oldrel-arm64 | OK | 115 | ||
| windows-devel | OK | 96 | ||
| windows-release | OK | 84 | ||
| windows-oldrel | OK | 99 | ||
| wasm-release | OK | 110 |
Exports:canonical_joinget_clean_urlget_domainget_fragmentget_hostget_parse_statusget_passwordget_pathget_portget_queryget_schemeget_subdomainget_tldget_userget_userinforurl_cache_configrurl_cache_inforurl_clear_cachessafe_parse_urlsafe_parse_urls
Readme and manuals
Help Manual
| Help page | Topics |
|---|---|
| Canonical Join of Two URL Sets (Base R Version) | canonical_join |
| Get cleaned URLs | get_clean_url |
| Get domain names | get_domain |
| Get URL fragments | get_fragment |
| Get URL hosts | get_host |
| Get the parse status of URLs | get_parse_status |
| Get URL passwords | get_password |
| Get URL paths | get_path |
| Get URL ports | get_port |
| Get URL query strings | get_query |
| Get URL schemes | get_scheme |
| Get URL subdomains | get_subdomain |
| Extract the top-level domain (TLD) from a URL | get_tld |
| Get URL user names | get_user |
| Get URL userinfo | get_userinfo |
| Configure the rurl memoization caches | rurl_cache_config |
| Inspect the rurl memoization caches | rurl_cache_info |
| Clear all rurl caches | rurl_clear_caches |
| Parse multiple URLs and return a data.frame of components | safe_parse_urls |