Package org.deidentifier.arx.aggregates
Klasse StatisticsBuilder
java.lang.Object
org.deidentifier.arx.aggregates.StatisticsBuilder
A class offering basic descriptive statistics about data handles.
-
Konstruktorübersicht
Konstruktoren -
Methodenübersicht
Modifizierer und TypMethodeBeschreibunggetClassificationPerformance(String[] features, String clazz, ARXClassificationConfiguration<?> config) Creates a new set of statistics for the given classification taskgetClassificationPerformance(String[] features, String clazz, ARXClassificationConfiguration<?> config, ARXFeatureScaling scaling) Creates a new set of statistics for the given classification taskgetClassificationPerformance(String clazz, ARXClassificationConfiguration<?> config) Creates a new set of statistics for the given classification taskgetContingencyTable(int column1, boolean orderFromDefinition1, int column2, boolean orderFromDefinition2) Returns a contingency table for the given columns.getContingencyTable(int column1, int column2) Returns a contingency table for the given columns.getContingencyTable(int column1, int size1, boolean orderFromDefinition1, int column2, int size2, boolean orderFromDefinition2) Returns a contingency table for the given columns.getContingencyTable(int column1, int size1, int column2, int size2) Returns a contingency table for the given columns.getContingencyTable(int column1, int size1, String[][] hierarchy1, int column2, int size2, String[][] hierarchy2) Returns a contingency table for the given columns.getContingencyTable(int column1, String[][] hierarchy1, int column2, String[][] hierarchy2) Returns a contingency table for the given columns.String[]getDistinctValues(int column) Returns the distinct set of data items from the given column.String[]getDistinctValuesOrdered(int column) Returns an ordered list of the distinct set of data items from the given column.String[]getDistinctValuesOrdered(int column, boolean orderFromDefinition) Returns an ordered list of the distinct set of data items from the given column.String[]getDistinctValuesOrdered(int column, String[][] hierarchy) Returns an ordered list of the distinct set of data items from the given column.Returns statistics about the equivalence classes.getFrequencyDistribution(int column) Returns a frequency distribution for the values in the given column.getFrequencyDistribution(int column, boolean orderFromDefinition) Returns a frequency distribution for the values in the given column.getFrequencyDistribution(int column, String[][] hierarchy) Returns a frequency distribution for the values in the given column.Returns an interruptible instance of this object.Returns data quality according to various models.getQualityStatistics(Set<String> qis) Returns data quality according to various models.getQualityStatistics(DataHandle output) Returns data quality according to various models.getQualityStatistics(DataHandle output, Set<String> qis) Returns data quality according to various models.<T> Map<String, StatisticsSummary<?>> getSummaryStatistics(boolean listwiseDeletion) Returns summary statistics for all attributes.
-
Konstruktordetails
-
StatisticsBuilder
Creates a new instance.- Parameter:
handle-
-
-
Methodendetails
-
getClassificationPerformance
public StatisticsClassification getClassificationPerformance(String clazz, ARXClassificationConfiguration<?> config) throws ParseException Creates a new set of statistics for the given classification task- Parameter:
clazz- - The class attributeconfig- - The configuration- Löst aus:
ParseException
-
getClassificationPerformance
public StatisticsClassification getClassificationPerformance(String[] features, String clazz, ARXClassificationConfiguration<?> config) throws ParseException Creates a new set of statistics for the given classification task- Parameter:
features- - The feature attributesclazz- - The class attributesconfig- - The configuration- Löst aus:
ParseException
-
getClassificationPerformance
public StatisticsClassification getClassificationPerformance(String[] features, String clazz, ARXClassificationConfiguration<?> config, ARXFeatureScaling scaling) throws ParseException Creates a new set of statistics for the given classification task- Parameter:
features- - The feature attributesclazz- - The class attributesconfig- - The configurationscaling- - Feature scaling- Löst aus:
ParseException
-
getContingencyTable
public StatisticsContingencyTable getContingencyTable(int column1, boolean orderFromDefinition1, int column2, boolean orderFromDefinition2) Returns a contingency table for the given columns.- Parameter:
column1- The first columnorderFromDefinition1- Indicates whether the order that should be assumed for string data items can (and should) be derived from the hierarchy provided in the data definition (if any)column2- The second columnorderFromDefinition2- Indicates whether the order that should be assumed for string data items can (and should) be derived from the hierarchy provided in the data definition (if any)- Gibt zurück:
-
getContingencyTable
Returns a contingency table for the given columns. This method assumes that the order of string data items will be derived from the hierarchies provided in the data definition (if any)- Parameter:
column1- The first columncolumn2- The second column- Gibt zurück:
-
getContingencyTable
public StatisticsContingencyTable getContingencyTable(int column1, int size1, boolean orderFromDefinition1, int column2, int size2, boolean orderFromDefinition2) Returns a contingency table for the given columns.- Parameter:
column1- The first columnsize1- The maximal size in this dimensionorderFromDefinition1- Indicates whether the order that should be assumed for string data items can (and should) be derived from the hierarchy provided in the data definition (if any)column2- The second columnsize2- The maximal size in this dimensionorderFromDefinition2- Indicates whether the order that should be assumed for string data items can (and should) be derived from the hierarchy provided in the data definition (if any)- Gibt zurück:
-
getContingencyTable
public StatisticsContingencyTable getContingencyTable(int column1, int size1, int column2, int size2) Returns a contingency table for the given columns. This method assumes that the order of string data items can (and should) be derived from the hierarchies provided in the data definition (if any)- Parameter:
column1- The first columnsize1- The maximal size in this dimensioncolumn2- The second columnsize2- The maximal size in this dimension- Gibt zurück:
-
getContingencyTable
public StatisticsContingencyTable getContingencyTable(int column1, int size1, String[][] hierarchy1, int column2, int size2, String[][] hierarchy2) Returns a contingency table for the given columns. The order for string data items is derived from the provided hierarchies- Parameter:
column1- The first columnsize1- The maximal size in this dimensionhierarchy1- The hierarchy for the first column, may be nullcolumn2- The second columnsize2- The maximal size in this dimensionhierarchy2- The hierarchy for the second column, may be null- Gibt zurück:
-
getContingencyTable
public StatisticsContingencyTable getContingencyTable(int column1, String[][] hierarchy1, int column2, String[][] hierarchy2) Returns a contingency table for the given columns. The order for string data items is derived from the provided hierarchies- Parameter:
column1- The first columnhierarchy1- The hierarchy for the first column, may be nullcolumn2- The second columnhierarchy2- The hierarchy for the second column, may be null- Gibt zurück:
-
getDistinctValues
Returns the distinct set of data items from the given column.- Parameter:
column- The column- Gibt zurück:
-
getDistinctValuesOrdered
Returns an ordered list of the distinct set of data items from the given column. This method assumes that the order of string data items can (and should) be derived from the hierarchy provided in the data definition (if any)- Parameter:
column- The column- Gibt zurück:
-
getDistinctValuesOrdered
Returns an ordered list of the distinct set of data items from the given column.- Parameter:
column- The columnorderFromDefinition- Indicates whether the order that should be assumed for string data items can (and should) be derived from the hierarchy provided in the data definition (if any)- Gibt zurück:
-
getDistinctValuesOrdered
Returns an ordered list of the distinct set of data items from the given column. This method assumes that the order of string data items can (and should) be derived from the provided hierarchy- Parameter:
column- The columnhierarchy- The hierarchy, may be null- Gibt zurück:
-
getEquivalenceClassStatistics
Returns statistics about the equivalence classes.- Gibt zurück:
-
getFrequencyDistribution
Returns a frequency distribution for the values in the given column. This method assumes that the order of string data items can (and should) be derived from the hierarchy provided in the data definition (if any)- Parameter:
column- The column- Gibt zurück:
-
getFrequencyDistribution
public StatisticsFrequencyDistribution getFrequencyDistribution(int column, boolean orderFromDefinition) Returns a frequency distribution for the values in the given column.- Parameter:
column- The columnorderFromDefinition- Indicates whether the order that should be assumed for string data items should be derived from the hierarchy provided in the data definition (if any)- Gibt zurück:
-
getFrequencyDistribution
Returns a frequency distribution for the values in the given column. The order for string data items is derived from the provided hierarchy- Parameter:
column- The columnhierarchy- The hierarchy, may be null- Gibt zurück:
-
getInterruptibleInstance
Returns an interruptible instance of this object.- Gibt zurück:
-
getQualityStatistics
Returns data quality according to various models.- Gibt zurück:
-
getQualityStatistics
Returns data quality according to various models. This is a special variant of the method supporting arbitrary user-defined outputs.- Parameter:
output-- Gibt zurück:
-
getQualityStatistics
Returns data quality according to various models. This is a special variant of the method supporting arbitrary user-defined outputs.- Parameter:
output-qis-- Gibt zurück:
-
getQualityStatistics
Returns data quality according to various models.- Parameter:
qis-- Gibt zurück:
-
getSummaryStatistics
Returns summary statistics for all attributes.- Parameter:
listwiseDeletion- A flag enabling list-wise deletion- Gibt zurück:
-