JSON¶
arro3.io.infer_json_schema ¶
Infer the schema of a JSON file by reading the first n records of the buffer, with
max_records
controlling the maximum number of records to read.
Parameters:
-
file
(IO[bytes] | Path | str
) –The input JSON path or buffer.
-
max_records
(int | None
, default:None
) –The maximum number of records to read to infer schema. If not provided, will read the entire file to deduce field types. Defaults to None.
Returns:
-
Schema
–Inferred Arrow Schema
arro3.io.read_json ¶
read_json(
file: IO[bytes] | Path | str,
schema: ArrowSchemaExportable,
*,
batch_size: int | None = None
) -> RecordBatchReader
Reads JSON data with a known schema into Arrow
Parameters:
-
file
(IO[bytes] | Path | str
) –The JSON file or buffer to read from.
-
schema
(ArrowSchemaExportable
) –The Arrow schema representing the JSON data.
-
batch_size
(int | None
, default:None
) –Set the batch size (number of records to load at one time). Defaults to None.
Returns:
-
RecordBatchReader
–An arrow RecordBatchReader.
arro3.io.write_json ¶
write_json(
data: ArrowStreamExportable | ArrowArrayExportable,
file: IO[bytes] | Path | str,
*,
explicit_nulls: bool | None = None
) -> None
Write Arrow data to JSON.
By default the writer will skip writing keys with null values for backward compatibility.
Parameters:
-
data
(ArrowStreamExportable | ArrowArrayExportable
) –the Arrow Table, RecordBatchReader, or RecordBatch to write.
-
file
(IO[bytes] | Path | str
) –the output file or buffer to write to
-
explicit_nulls
(bool | None
, default:None
) –Set whether to keep keys with null values, or to omit writing them. Defaults to skipping nulls.
arro3.io.write_ndjson ¶
write_ndjson(
data: ArrowStreamExportable | ArrowArrayExportable,
file: IO[bytes] | Path | str,
*,
explicit_nulls: bool | None = None
) -> None
Write Arrow data to newline-delimited JSON.
By default the writer will skip writing keys with null values for backward compatibility.
Parameters:
-
data
(ArrowStreamExportable | ArrowArrayExportable
) –the Arrow Table, RecordBatchReader, or RecordBatch to write.
-
file
(IO[bytes] | Path | str
) –the output file or buffer to write to
-
explicit_nulls
(bool | None
, default:None
) –Set whether to keep keys with null values, or to omit writing them. Defaults to skipping nulls.