mirror/duckdb: DuckDB is an analytical in-process SQL database management system

mirror of https://github.com/duckdb/duckdb.git synced 2026-07-27 03:54:16 +00:00

DuckDB is an analytical in-process SQL database management system http://www.duckdb.org

analytics database embedded-database olap sql

C++ 77.7%
sql 16.9%
Python 2.7%
C 1.5%
Swift 0.5%
Other 0.5%

Find a file

Mark 7316202df1 Arena allocate the memory for each aggregate state in TDigest (#24125 ) As previously memory was not tracked in APPROX_QUANTILE, which, in the case of exceeding memory on the host kills the process, instead of failing gracefully. This is solved by allocating the memory using an arena.		2026-07-24 18:53:02 +00:00
.github	Make benchmark runner output/regressions easier to read (#24134 )	2026-07-24 18:51:54 +00:00
api_spec	Replace the C API spec and code generation	2026-07-24 16:10:58 +02:00
benchmark	fixed typo	2026-07-24 11:01:06 +02:00
data	Storage: validate deprecated FSST metadata	2026-07-21 16:35:10 +08:00
examples	tidy	2026-06-19 15:09:34 +02:00
extension	Arena allocate the memory for each aggregate state in TDigest (#24125 )	2026-07-24 18:53:02 +00:00
logo	README: Display different logo for light/dark mode	2024-03-26 18:03:50 +01:00
scripts	Make benchmark runner output/regressions easier to read (#24134 )	2026-07-24 18:51:54 +00:00
src	Replace the C API spec and code generation	2026-07-24 16:10:58 +02:00
test	Arena allocate the memory for each aggregate state in TDigest (#24125 )	2026-07-24 18:53:02 +00:00
third_party	change std::vectors to arena_vectors in tdigest	2026-07-24 13:48:49 +02:00
tools	Remove Julia client in favor of duckdb/DuckDB.jl	2026-07-24 14:07:12 +02:00
.clang-format	remove empty lines at the start of blocks	2025-10-28 09:00:51 +01:00
.clang-tidy	Make clang-tidy checks easier to read	2026-04-20 10:49:45 +02:00
.clangd	Disable clang-tidy checks for some excluded dirs	2026-04-20 10:49:19 +02:00
.codecov.yml	second round of renames	2023-08-21 15:42:32 +02:00
.editorconfig	removed some more references to r client	2023-09-05 07:42:12 +02:00
.gitattributes	Let GitHub render *.test files as SQL	2025-03-06 10:10:24 +01:00
.gitignore	rework task scheduler, split into multiple pools and queues	2026-05-28 17:03:48 +02:00
.sanitizer-leak-suppressions.txt	And text pool	2025-01-27 12:57:06 +01:00
.sanitizer-thread-suppressions.txt	remove AddToEvictionQueue tsan suppression	2026-06-15 13:57:19 +02:00
AGENTS.md	AGENTS.md: Drop in-process	2026-06-18 06:57:01 +02:00
AI_POLICY.md	add ai policy	2026-06-03 11:21:13 +02:00
CITATION.cff	Update CITATION.cff	2021-07-28 11:31:36 +02:00
CLAUDE.md	create AGENTS.md and link CLAUDE.md	2026-06-04 22:41:26 -04:00
CMakeLists.txt	Get tag from git describe in python script	2026-07-23 11:51:07 +02:00
CODE_OF_CONDUCT.md	fix typos and spelling errors	2023-07-13 17:41:59 -04:00
CONTRIBUTING.md	Add note on enums	2026-06-26 08:34:29 +02:00
Doxyfile	Fix typos in errors and comments	2026-04-22 08:53:31 +02:00
DuckDBConfig.cmake.in	Don't require icu unless extension is enabled	2025-11-27 07:49:14 +01:00
DuckDBConfigVersion.cmake.in	Normalize version number in CMake version config	2026-05-28 23:18:43 +08:00
LICENSE	Update copyright year	2026-01-26 18:01:34 +01:00
Makefile	Replace the C API spec and code generation	2026-07-24 16:10:58 +02:00
README.md	Update Labs URL	2026-05-27 13:28:57 +02:00
SECURITY.md	Update Labs URL	2026-05-27 13:28:57 +02:00

README.md

DuckDB

DuckDB is a high-performance analytical database system. It is designed to be fast, reliable, portable, and easy to use. DuckDB provides a rich SQL dialect with support far beyond basic SQL. DuckDB supports arbitrary and nested correlated subqueries, window functions, collations, complex types (arrays, structs, maps), and several extensions designed to make SQL easier to use.

DuckDB is available as a standalone CLI application and has clients for Python, R, Java, Wasm, etc., with deep integrations with packages such as pandas and dplyr.

For more information on using DuckDB, please refer to the DuckDB documentation.

Installation

If you want to install DuckDB, please see our installation page for instructions.

Data Import

For CSV files and Parquet files, data import is as simple as referencing the file in the FROM clause:

SELECT * FROM 'myfile.csv';
SELECT * FROM 'myfile.parquet';

Refer to our Data Import section for more information.

SQL Reference

The documentation contains a SQL introduction and reference.

Development

For development, DuckDB requires CMake, Python 3 and a C++17 compliant compiler. In the root directory, run make to compile the sources. For development, use make debug to build a non-optimized debug version. You should run make unit and make allunit to verify that your version works properly after making changes. To test performance, you can run BUILD_BENCHMARK=1 BUILD_TPCH=1 make and then perform several standard benchmarks from the root directory by executing ./build/release/benchmark/benchmark_runner. The details of benchmarks are in our Benchmark Guide.

Please also refer to our Build Guide and Contribution Guide.

Support

See the Support Options page and the dedicated endoflife.date page.