R Package Scholar
240,356

janitor: Simple Tools for Examining and Cleaning Dirty Data

Sam Firke  Bill Denney  Chris Haid  Ryan Knight  Malte Grosser  Jonathan Zadra   View description and downloadsView dependenciesGitHub project

2016 Published
0 Citations
6 Authors
13 Revisions
2.2.1 Version
MIT License
Referenced by ⇅ Year
BARIS: Access and Import Data from the French Open Data Portal (Version 1.1.3)

2020
DAMisc: Dave Armstrong"s Miscellaneous Functions (Version 1.7.2)

2010
parlitools: Tools for Analysing UK Politics (Version 0.4.1)

2017
toRvik: Extensive and Tidy NCAA Men"s College Basketball Data (Version 1.1.1)

2022
GNGTools: Tools for Go/No-Go Decision-Making Framework (Version 1.0.0)

2022
worldriskpollr: Aggregated Survey Data from the World Risk Poll (Version 0.7.2)

2023
LDLcalc: Calculate and Predict the Low Density Lipoprotein Values (Version 2.1)

2021
CGPfunctions: Powell Miscellaneous Functions for Teaching and Learning Statistics (Version 0.6.3)

2018
HMDHFDplus: Read Human Mortality Database and Human Fertility Database Data from the Web (Version 2.0.8)

2015
LightLogR: Process Data from Wearable Light Loggers and Optical Radiation Dosimeters (Version 0.10.0)

2024
ThermalSampleR: Calculate Sample Sizes Required for Critical Thermal Limits Experiments (Version 0.1.2)

2022
dams: Dams in the United States from the National Inventory of Dams (NID) (Version 0.3.0)

2014
ech: Downloading and Processing Microdata from ECH-INE (Uruguay) (Version 0.1.3)

2020
epe4md: EPE's 4MD Model to Forecast the Adoption of Distributed Generation (Version 0.1.4)

2023
epitweetr: Early Detection of Public Health Threats from 'Twitter' Data (Version 2.2.16)

2020
extractox: Extract Tox Info from Various Databases (Version 1.2.0)

2024
fastRhockey: Functions to Access Premier Hockey Federation and National Hockey League Play by Play Data (Version 0.4.0)

2021
hudr: Providing Data from the US Department of Housing and Urban Development (Version 1.2.0)

2022
industRial: Data, Functions and Support Materials from the Book "industRial Data Science" (Version 0.1.0)

2021
ispdata: Access Data from the Public Security Institute of the State of Rio De Janeiro (Version 1.1.2)

2023
lterdatasampler: Educational Dataset Examples from the Long Term Ecological Research Program (Version 0.1.1)

2023
madshapR: Functions to Support Data Management and Processing Using the Maelstrom Research Approach (Version 2.0.0)

2023
mnis: Easy Downloading Capabilities for the Members' Name Information Service (Version 0.3.1)

2016
mtsta: Accessing the Red List of Montane Tree Species of the Tropical Andes (Version 0.0.0.1)

2023
nfl4th: Functions to Calculate Optimal Fourth Down Decisions in the National Football League (Version 1.0.4)

2021
protti: Bottom-Up Proteomics and LiP-MS Quality Control and Data Analysis Tools (Version 1.0.0)

2021
quicR: RT-QuIC Data Formatting and Analysis (Version 2.1.0)

2024
readapra: Download and Tidy Data from the Australian Prudential Regulation Authority (Version 0.2.1)

2025
scCustomize: Custom Visualizations & Functions for Streamlined Analyses of Single Cell Sequencing (Version 3.2.4)

2022
speechbr: Access the Speechs and Speaker's Informations of House of Representatives of Brazil (Version 2.0.0)

2022
tidyDenovix: Cleans Spectrophotometry Data Obtained from the Denovix DS-11 Instrument (Version 2.1.0)

2024
tntpr: Data Analysis Tools Customized for TNTP (Version 1.2.1)

2024
CGPfunctions Statistics: (Version )

0
bndesr: Access Data from the Brazilian Development Bank (BNDES) (Version 1.0.4)

2023
dams (NID): (Version )

0
epiCleanr: A Tidy Solution for Epidemiological Data (Version 0.2.0)

2023
gimap: Calculate Genetic Interactions for Paired CRISPR Targets (Version 1.1.2)

2025
fastml: Guarded Resampling Workflows for Safe and Automated Machine Learning in R (Version 0.7.6)

2024
rmdl: Language to Manage Many Models (Version 0.1.0)

2024
tidybins: Make Tidy Bins (Version 0.1.1)

2021
autostats: Auto Stats (Version 0.4.1)

2021
ConsRankClass: Classification and Clustering of Preference Rankings (Version 1.0.2)

2021
DCPO: Dynamic Comparative Public Opinion (Version 0.5.3)

2020
REDCapDM: 'REDCap' Data Management (Version 1.0.0)

2022
f1dataR: Access Formula 1 Data (Version 2.0.1)

2023
fabR: Wrapper Functions Collection Used in Data Pipelines (Version 2.1.1)

2023
APCalign: Resolving Plant Taxon Names Using the Australian Plant Census (Version 1.1.3)

2023
AdverseEvents: 'shiny' Application for Adverse Event Analysis of 'OnCore' Data (Version 0.0.4)

2024
BAwiR: Analysis of Basketball Data (Version 1.4.3)

2018
BFS: Get Data from the Swiss Federal Statistical Office (Version 0.7.0)

2019
BIGr Species: (Version )

0
BeeBDC: Occurrence Data Cleaning (Version 1.3.2)

2023
EDCimport: Import Data from EDC Software (Version 0.7.0)

2022
FlickrAPI: Access to Flickr API (Version 0.1.0.1)

2019
HLMdiag: Diagnostic Tools for Hierarchical (Multilevel) Linear Models (Version 0.5.1)

2011
IMD: Index of Multiple Deprivation Data for the UK (Version 1.2.2)

2021
MAGMA.R: MAny-Group MAtching (Version 1.0.4)

2024
MEAanalysis: Analyse and Visualise Multi Electrode Array Burst Data (Version 0.1.0)

2025
NHSRdatasets: NHS and Healthcare-Related Data for Education and Training (Version 0.3.0)

2019
OxSR: Soil Iron Oxides via Diffuse Reflectance (Version 1.0.1)

2025
PKbioanalysis: Pharmacokinetic Bioanalysis Experiments Design and Exploration (Version 0.4.0)

2024
Rmonize: Tools for Data Harmonization (Version 2.0.0)

2023
SangerTools: Tools for Population Health Management Analytics (Version 1.0.2)

2022
SingleCaseES: A Calculator for Single-Case Effect Sizes (Version 0.7.3)

2018
TangledFeatures: Feature Selection in Highly Correlated Spaces (Version 0.1.1)

2023
ThermalSampleR Experiments: (Version )

0
WaterBalanceR: Calculate High Resolution Water Balance of Starch Potatoes (Version 0.1.19)

2025
acledR: Manipulate ACLED Data (Version 1.0.1)

2025
aggreCAT: Mathematically Aggregating Expert Judgments (Version 1.0.0)

2025
babyTimeR: Parse Output from 'BabyTime' Application (Version 0.1.0)

2025
bambooHR: A Wrapper to the 'BambooHR' API (Version 0.1.1)

2022
baseballr: Acquiring and Analyzing Baseball Data (Version 1.6.0)

2022
bayesrules: Datasets and Supplemental Functions from Bayes Rules! Book (Version 0.0.3)

2021
bluebike: Blue Bike Comprehensive Data (Version 0.0.3)

2022
brfinance: Simplified Access to Brazilian Financial and Macroeconomic Data (Version 0.6.0)

2025
cepumd: Calculate Consumer Expenditure Survey (CE) Annual Estimates (Version 2.1.0)

2024
cfbfastR: Access College Football Play by Play Data (Version 2.0.0)

2021
cleanepi: Clean and Standardize Epidemiological Data (Version 1.1.2)

2024
clockify: A Wrapper for the 'Clockify' API (Version 0.1.7)

2021
connected: Visualize and Improve Connectedness of Factors in Tables (Version 1.1)

2025
covid19india: Pulling Clean Data from Covid19india.org (Version 0.1.4)

2021
crypto2: Download Crypto Currency Data from 'CoinMarketCap' without 'API' (Version 2.0.5)

2021
dacc: Detection and Attribution Analysis of Climate Change (Version 0.0-7)

2023
dail: Data from Access to Information Law (Version 1.5.2)

2022
datacult: Exploratory Data Analysis for Public Policy Applied to Culture (Version 0.1.0)

2026
datazoom.amazonia: Simplify Access to Data from the Amazon Region (Version 1.1.0)

2021
debiasedTrialEmulation: Pipeline for Debiased Target Trial Emulation (Version 0.1.0)

2025
diffdfs: Compute the Difference Between Data Frames (Version 0.9.0)

2022
discord: Functions for Discordant Kinship Modeling (Version 1.3)

2017
drpop: Efficient and Doubly Robust Population Size Estimation (Version 0.0.3)

2021
dySEM: Dyadic Structural Equation Modeling (Version 1.4.1)

2024
eatRep: Educational Assessment Tools for Replication Methods (Version 0.15.2)

2021
edar: Convenient Functions for Exploratory Data Analysis (Version 0.0.6)

2025
epe4md Generation: (Version )

0
etl: Extract-Transform-Load Framework for Medium Data (Version 0.4.2)

2016
evanverse: Utility Functions for Data Analysis and Visualization (Version 0.3.7)

2025
excluder: Checks for Exclusion Criteria in Online Data (Version 0.5.2)

2021
fastrep: Time-Saving Package for Creating Reports (Version 0.7)

2022
filebin: Wrapper for the Filebin File Sharing API (Version 0.0.6)

2021
fitbitr: Interface with the 'Fitbit' API (Version 0.3.0)

2021
fitzRoy: Easily Scrape and Process AFL Data (Version 1.6.0)

2019
flightsbr: Download Flight and Airport Data from Brazil (Version 1.1.1)

2022
formods: 'Shiny' Modules for General Tasks (Version 0.2.2)

2023
framecleaner: Clean Data Frames (Version 0.2.1)

2021
galaxias: Describe, Package, and Share Biodiversity Data (Version 0.1.2)

2025
geofi: Access Finnish Geospatial Data (Version 1.1.0)

2021
getLattes: Import and Process Data from the 'Lattes' Curriculum Platform (Version 1.0.0)

2020
ggdiagram: Object-Oriented Diagram Plots with 'ggplot2' (Version 0.1.1)

2025
gooseR: R Integration for 'Goose' AI (Version 0.1.1)

2025
govinfoR: A 'GovInfo' API Wrapper (Version 0.0.3)

2024
healthbR: Access Brazilian Public Health Data (Version 0.1.1)

2026
healthyR.data: Data Only Package to 'healthyR' (Version 1.2.0)

2020
healthyR.ai: The Machine Learning and AI Modeling Companion to 'healthyR' (Version 0.1.1)

2021
hlaR: Tools for HLA Data (Version 1.0.0)

2021
hmsidwR: Health Metrics and the Spread of Infectious Diseases (Version 1.1.2)

2024
hoopR: Access Men's Basketball Play by Play Data (Version 2.1.0)

2021
hudr Development: (Version )

0
influential: Identification and Classification of the Most Influential Nodes (Version 2.2.9)

2020
irtQ: Unidimensional Item Response Theory Modeling (Version 1.0.0)

2023
itraxR: Itrax Data Analysis Tools (Version 1.13.2)

2021
k5: Kiernan Nicholls Miscellaneous (Version 0.2.1)

2023
khisr: An R Client to Retrieve Data from DHIS2 (Version 1.0.6)

2024
lacrmr: Connect to the 'Less Annoying CRM' API (Version 1.0.5)

2020
logolink: An Interface for Running 'NetLogo' Simulations (Version 1.0.0)

2025
metricminer: Mine Metrics from Common Places on the Web (Version 1.0.1)

2024
modeltime: The Tidymodels Extension for Time Series Modeling (Version 1.3.3)

2020
moderndive: Tidyverse-Friendly Introductory Linear Regression (Version 0.7.0)

2018
motherduck: Utilities for Managing a 'Motherduck' Database (Version 0.2.1)

2025
mtsta Andes: (Version )

0
mxnorm: Apply Normalization Methods to Multiplexed Images (Version 1.1.0)

2022
nflfastR: Functions to Efficiently Access NFL Play by Play Data (Version 5.1.0)

2020
nichetools: Complementary Package to 'nicheROVER' and 'SIBER' (Version 0.3.3)

2024
nomisdata: Access 'Nomis' UK Labour Market Data and Statistics (Version 0.1.1)

2025
octopus: A Database Management Tool (Version 0.4.2)

2023
oddsapiR: Access Live Sports Odds from the Odds API (Version 0.0.3)

2022
ohsome: An 'ohsome API' Client (Version 0.2.2)

2023
omicsTools: Omics Data Process Toolbox (Version 1.1.7)

2023
opinAr: Argentina's Public Opinion Toolbox (Version 1.0.0)

2024
otpr: An R Wrapper for the 'OpenTripPlanner' REST API (Version 0.5.1)

2019
plotor: Odds Ratio Tools for Logistic Regression (Version 0.8.0)

2024
presenter: Present Data with Style (Version 0.1.2)

2021
provolleyballr: Extract Data from US Women's Professional Volleyball Websites (Version 0.1.0)

2026
public.ctn0094data: De-Identified Data from CTN-0094 (Version 1.0.6)

2023
qualitycontrol: Unified Framework for Data Quality Control (Version 0.1.0)

2022
questionr: Functions to Make Surveys Processing Easier (Version 0.8.2)

2013
quid: Bayesian Mixed Models for Qualitative Individual Differences (Version 0.0.1)

2021
r4ds.tutorials: Tutorials for "R for Data Science" (Version 0.3.3)

2023
rSRD: Sum of Ranking Differences Statistical Test (Version 0.1.8)

2022
rattle: Graphical User Interface for Data Science in R (Version 5.5.1)

2013
readapra Authority: (Version )

0
redatam: Import 'REDATAM' Files (Version 2.1.2)

2024
repoRter.nih: R Interface to the 'NIH RePORTER Project' API (Version 0.1.4)

2022
reservoirnet: Reservoir Computing and Echo State Networks (Version 0.3.0)

2023
rfars: Download and Analyze Crash Data (Version 2.0.2)

2022
scdhlm: Estimating Hierarchical Linear Models for Single-Case Designs (Version 0.7.4)

2016
sitrep: Report Templates and Helper Functions for Applied Epidemiology (Version 0.4.0)

2025
spotifyr: R Wrapper for the 'Spotify' Web API (Version 2.2.5)

2017
starling Records: (Version )

0
tabbycat: Tabulate and Summarise Categorical Data (Version 0.18.0)

2021
textrecipes: Extra 'Recipes' for Text Processing (Version 1.1.0)

2018
theftdlc: Analyse and Interpret Time Series Features (Version 0.2.1)

2024
threesixtygiving: Download Charitable Grants from the '360Giving' Platform (Version 0.2.2)

2020
tidyDenovix Instrument: (Version )

0
tidyREDCap: Helper Functions for Working with 'REDCap' Data (Version 1.1.3)

2020
tidycensuskr: Easy Access for South Korea Census Data and Boundaries (Version 0.2.7)

2025
tidyquant: Tidy Quantitative Financial Analysis (Version 1.0.11)

2016
unmconf: Modeling with Unmeasured Confounding (Version 1.0.0)

2023
validata: Validate Data Frames (Version 0.1.0)

2021
vvauditor: Creates Assertion Tests (Version 0.8.0)

2023
weathR: Interact with the U.S. National Weather Service API (Version 0.1.0)

2025
wehoop: Access Women's Basketball Play by Play Data (Version 2.1.0)

2021
worldfootballR: Extract and Clean World Football (Soccer) Data (Version 0.6.2)

2021
yfR: Downloads and Organizes Financial Data from Yahoo Finance (Version 1.1.2)

2022
zoomr: Connect to Your 'Zoom' Data (Version 0.4.0)

2023

RPKG Scholar presents a tabulation of an author's contribution in the development of R packages stored in the Comprehensive R Archive Network (CRAN). Within this site, we consider package dependencies (suggests,imports,depends,enhances) as citations because we believe that using one's package to develop another is tantamount to citing the author of the package being imported, suggested or enhanced.

rpkg.net © 2022 - 2026 Obi Obianom