read_delim function - RDocumentation (2024)

Description

read_csv() and read_tsv() are special cases of the more generalread_delim(). They're useful for reading the most common types offlat file data, comma separated values and tab separated values,respectively. read_csv2() uses ; for the field separator and , for thedecimal point. This format is common in some European countries.

Usage

read_delim( file, delim = NULL, quote = "\"", escape_backslash = FALSE, escape_double = TRUE, col_names = TRUE, col_types = NULL, col_select = NULL, id = NULL, locale = default_locale(), na = c("", "NA"), quoted_na = TRUE, comment = "", trim_ws = FALSE, skip = 0, n_max = Inf, guess_max = min(1000, n_max), name_repair = "unique", num_threads = readr_threads(), progress = show_progress(), show_col_types = should_show_types(), skip_empty_rows = TRUE, lazy = should_read_lazy())read_csv( file, col_names = TRUE, col_types = NULL, col_select = NULL, id = NULL, locale = default_locale(), na = c("", "NA"), quoted_na = TRUE, quote = "\"", comment = "", trim_ws = TRUE, skip = 0, n_max = Inf, guess_max = min(1000, n_max), name_repair = "unique", num_threads = readr_threads(), progress = show_progress(), show_col_types = should_show_types(), skip_empty_rows = TRUE, lazy = should_read_lazy())
read_csv2( file, col_names = TRUE, col_types = NULL, col_select = NULL, id = NULL, locale = default_locale(), na = c("", "NA"), quoted_na = TRUE, quote = "\"", comment = "", trim_ws = TRUE, skip = 0, n_max = Inf, guess_max = min(1000, n_max), progress = show_progress(), name_repair = "unique", num_threads = readr_threads(), show_col_types = should_show_types(), skip_empty_rows = TRUE, lazy = should_read_lazy())
read_tsv( file, col_names = TRUE, col_types = NULL, col_select = NULL, id = NULL, locale = default_locale(), na = c("", "NA"), quoted_na = TRUE, quote = "\"", comment = "", trim_ws = TRUE, skip = 0, n_max = Inf, guess_max = min(1000, n_max), progress = show_progress(), name_repair = "unique", num_threads = readr_threads(), show_col_types = should_show_types(), skip_empty_rows = TRUE, lazy = should_read_lazy())

Value

A tibble(). If there are parsing problems, a warning will alert you.You can retrieve the full details by calling problems() on your dataset.

Arguments

file

Either a path to a file, a connection, or literal data(either a single string or a raw vector).

Files ending in .gz, .bz2, .xz, or .zip willbe automatically uncompressed. Files starting with http://,https://, ftp://, or ftps:// will be automaticallydownloaded. Remote gz files can also be automatically downloaded anddecompressed.

Literal data is most useful for examples and tests. To be recognised asliteral data, the input must be either wrapped with I(), be a stringcontaining at least one new line, or be a vector containing at least onestring with a new line.

Using a value of clipboard() will read from the system clipboard.

delim

Single character used to separate fields within a record.

quote

Single character used to quote strings.

escape_backslash

Does the file use backslashes to escape specialcharacters? This is more general than escape_double as backslashescan be used to escape the delimiter character, the quote character, orto add special characters like \\n.

escape_double

Does the file escape quotes by doubling them?i.e. If this option is TRUE, the value """" representsa single quote, \".

col_names

Either TRUE, FALSE or a character vectorof column names.

Examples

Run this code

# Input sources -------------------------------------------------------------# Read from a pathread_csv(readr_example("mtcars.csv"))read_csv(readr_example("mtcars.csv.zip"))read_csv(readr_example("mtcars.csv.bz2"))if (FALSE) {# Including remote pathsread_csv("https://github.com/tidyverse/readr/raw/main/inst/extdata/mtcars.csv")}# Read from multiple file paths at oncecontinents <- c("africa", "americas", "asia", "europe", "oceania")filepaths <- vapply( paste0("mini-gapminder-", continents, ".csv"), FUN = readr_example, FUN.VALUE = character(1))read_csv(filepaths, id = "file")# Or directly from a string with `I()`read_csv(I("x,y\n1,2\n3,4"))# Column selection-----------------------------------------------------------# Pass column names or indexes directly to select themread_csv(readr_example("chickens.csv"), col_select = c(chicken, eggs_laid))read_csv(readr_example("chickens.csv"), col_select = c(1, 3:4))# Or use the selection helpersread_csv( readr_example("chickens.csv"), col_select = c(starts_with("c"), last_col()))# You can also rename specific columnsread_csv( readr_example("chickens.csv"), col_select = c(egg_yield = eggs_laid, everything()))# Column types --------------------------------------------------------------# By default, readr guesses the columns types, looking at `guess_max` rows.# You can override with a compact specification:read_csv(I("x,y\n1,2\n3,4"), col_types = "dc")# Or with a list of column types:read_csv(I("x,y\n1,2\n3,4"), col_types = list(col_double(), col_character()))# If there are parsing problems, you get a warning, and can extract# more details with problems()y <- read_csv(I("x\n1\n2\nb"), col_types = list(col_double()))yproblems(y)# Column names --------------------------------------------------------------# By default, readr duplicate name repair is noisyread_csv(I("x,x\n1,2\n3,4"))# Same default repair strategy, but quietread_csv(I("x,x\n1,2\n3,4"), name_repair = "unique_quiet")# There's also a global option that controls verbosity of name repairwithr::with_options( list(rlib_name_repair_verbosity = "quiet"), read_csv(I("x,x\n1,2\n3,4")))# Or use "minimal" to turn off name repairread_csv(I("x,x\n1,2\n3,4"), name_repair = "minimal")# File types ----------------------------------------------------------------read_csv(I("a,b\n1.0,2.0"))read_csv2(I("a;b\n1,0;2,0"))read_tsv(I("a\tb\n1.0\t2.0"))read_delim(I("a|b\n1.0|2.0"), delim = "|")

Run the code above in your browser using DataLab