Package com.linkedin.venice.hadoop
Interface InputDataInfoProvider
-
- All Superinterfaces:
java.lang.AutoCloseable,java.io.Closeable
- All Known Implementing Classes:
DefaultInputDataInfoProvider,KafkaInputDataInfoProvider
public interface InputDataInfoProvider extends java.io.CloseableThis interface lets users get input data information
-
-
Nested Class Summary
Nested Classes Modifier and Type Interface Description static classInputDataInfoProvider.InputDataInfoA POJO that contains input data information (schema information and input data file size)
-
Method Summary
All Methods Static Methods Instance Methods Abstract Methods Modifier and Type Method Description org.apache.avro.SchemaextractAvroSubSchema(org.apache.avro.Schema origin, java.lang.String fieldName)longgetInputLastModificationTime(java.lang.String inputUri)voidinitZstdConfig(int numFiles)static voidloadZstdTrainingSamples(VeniceRecordIterator recordIterator, PushJobZstdConfig pushJobZstdConfig)This function loads training samples from recordReader abstraction for building the Zstd dictionary.byte[]trainZstdDictionary()InputDataInfoProvider.InputDataInfovalidateInputAndGetInfo(java.lang.String inputUri)
-
-
-
Method Detail
-
validateInputAndGetInfo
InputDataInfoProvider.InputDataInfo validateInputAndGetInfo(java.lang.String inputUri) throws java.lang.Exception
- Throws:
java.lang.Exception
-
initZstdConfig
void initZstdConfig(int numFiles)
-
loadZstdTrainingSamples
static void loadZstdTrainingSamples(VeniceRecordIterator recordIterator, PushJobZstdConfig pushJobZstdConfig)
This function loads training samples from recordReader abstraction for building the Zstd dictionary.- Parameters:
recordIterator- The data accessor of input records.
-
trainZstdDictionary
byte[] trainZstdDictionary()
-
extractAvroSubSchema
org.apache.avro.Schema extractAvroSubSchema(org.apache.avro.Schema origin, java.lang.String fieldName)
-
getInputLastModificationTime
long getInputLastModificationTime(java.lang.String inputUri) throws java.io.IOException- Throws:
java.io.IOException
-
-