luigi.contrib.hdfs.format module¶
- class luigi.contrib.hdfs.format.HdfsReadPipe(path)[source]¶
Bases:
InputPipeProcessWrapper
Initializes a InputPipeProcessWrapper instance.
- Parameters:
command – a subprocess.Popen instance with stdin=input_pipe and stdout=subprocess.PIPE. Alternatively, just its args argument as a convenience.
- class luigi.contrib.hdfs.format.HdfsAtomicWritePipe(path)[source]¶
Bases:
OutputPipeProcessWrapper
File like object for writing to HDFS
The referenced file is first written to a temporary location and then renamed to final location on close(). If close() isn’t called the temporary file will be cleaned up when this object is garbage collected
TODO: if this is buggy, change it so it first writes to a local temporary file and then uploads it on completion
- class luigi.contrib.hdfs.format.HdfsAtomicWriteDirPipe(path, data_extension='')[source]¶
Bases:
OutputPipeProcessWrapper
Writes a data<data_extension> file to a directory at <path>.
- class luigi.contrib.hdfs.format.PlainFormat[source]¶
Bases:
Format
- input = 'bytes'¶
- output = 'hdfs'¶