@@ -8,10 +8,12 @@ to Kafka is created by transforming the record by means of
8
8
`Confluent avro-converter <https://github.com/confluentinc/schema-registry/tree/master/avro-converter >`__
9
9
API.
10
10
11
+ More information about properties of this file reader :ref: `here<config_options-filereaders-avro> `.
12
+
11
13
Parquet
12
14
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
13
15
14
- Read files with `Parquet <https://parquet.apache.org/ >`__ format.
16
+ Reads files with `Parquet <https://parquet.apache.org/ >`__ format.
15
17
16
18
The reader takes advantage of the Parquet-Avro API and uses the Parquet file
17
19
as if it were an Avro file, so the message sent to Kafka is built in the same
@@ -22,6 +24,8 @@ way as the Avro file reader does.
22
24
over and over again and has to seek the file, the performance
23
25
can be affected.
24
26
27
+ More information about properties of this file reader :ref: `here<config_options-filereaders-parquet> `.
28
+
25
29
SequenceFile
26
30
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
27
31
@@ -32,8 +36,7 @@ This reader can process this file format and build a Kafka message with the
32
36
key/value pair. These two values are named ``key `` and ``value `` in the message
33
37
by default but you can customize these field names.
34
38
35
- More information about properties of this file reader
36
- :ref: `here<config_options-filereaders-sequencefile> `.
39
+ More information about properties of this file reader :ref: `here<config_options-filereaders-sequencefile> `.
37
40
38
41
Text
39
42
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
@@ -44,6 +47,8 @@ Each line represents one record which will be in a field
44
47
named ``value `` in the message sent to Kafka by default but you can
45
48
customize these field names.
46
49
50
+ More information about properties of this file reader :ref: `here<config_options-filereaders-text> `.
51
+
47
52
Delimited text
48
53
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
49
54
@@ -56,3 +61,20 @@ Also, the token delimiter for columns is configurable.
56
61
57
62
More information about properties of this file reader :ref: `here<config_options-filereaders-delimited> `.
58
63
64
+ Agnostic
65
+ ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
66
+
67
+ Actually, this reader is a wrapper of the readers listing above.
68
+
69
+ It tries to read any kind of file format using an internal reader based on the file extension,
70
+ applying the proper one (Parquet, Avro, SecuenceFile, Text or Delimited text). In case of no
71
+ extension has been matched, the Text file reader will be applied.
72
+
73
+ Default extensions for each format:
74
+ * Parquet: .parquet
75
+ * Avro: .avro
76
+ * SequenceFile: .seq
77
+ * Delimited text: .tsv, .csv
78
+ * Text: any other sort of file extension.
79
+
80
+ More information about properties of this file reader :ref: `here<config_options-filereaders-agnostic> `.
0 commit comments