Klasse DataManager

java.lang.Object
org.deidentifier.arx.framework.data.DataManager
Bekannte direkte Unterklassen:
DataManagerSubset

public class DataManager extends Object
Holds all data needed for the anonymization process.
  • Konstruktordetails

    • DataManager

      public DataManager(String[] header, DataMatrix data, Dictionary dictionary, DataDefinition definition, Map<String,DistributionAggregateFunction> functions, ARXConfiguration config)
      Creates a new data manager from pre-encoded data.
      Parameter:
      header -
      data -
      dictionary -
      definition -
      functions -
      config -
    • DataManager

      protected DataManager(Data dataAnalyzed, Data dataGeneralized, Data dataInput, DataDefinition definition, DomainShare[] shares, String[] header, GeneralizationHierarchy[] hierarchiesGeneralized, GeneralizationHierarchy[] hierarchiesAnalyzed, int[] generalizationLevelsMinimum, int[] generalizationLevelsMaximum, DataAggregationInformation microaggregationData)
      For creating a projected instance
      Parameter:
      dataAnalyzed -
      dataGeneralized -
      dataInput -
      definition -
      shares -
      header -
      hierarchiesGeneralized -
      hierarchiesAnalyzed -
      generalizationLevelsMinimum -
      generalizationLevelsMaximum -
      microaggregationFunctions -
      microaggregationDomainSizes -
      microaggregationHeader -
      microaggregationStartIndex -
  • Methodendetails

    • getDataAnalyzed

      public Data getDataAnalyzed()
      Returns the input data that will be analyzed.
      Gibt zurück:
      the data
    • getDataGeneralized

      public Data getDataGeneralized()
      Returns the input data that will be generalized.
      Gibt zurück:
      the data
    • getDataInput

      public Data getDataInput()
      Returns the input data.
      Gibt zurück:
      the data
    • getDistribution

      public double[] getDistribution(DataMatrix dataMatrix, int index, int distinctValues)
      Returns the distribution of the attribute in the data array at the given index.
      Parameter:
      dataMatrix -
      index -
      distinctValues -
      Gibt zurück:
    • getDistribution

      public double[] getDistribution(String attribute)
      Returns the distribution of the given sensitive attribute in the original dataset. Required for t-closeness.
      Parameter:
      attribute -
      Gibt zurück:
      distribution
    • getDomainShares

      public DomainShare[] getDomainShares()
      Returns the domain shares for all generalized quasi-identifiers
      Gibt zurück:
    • getDomainSharesReliable

      public DomainShareReliable[] getDomainSharesReliable()
      Returns the reliable domain shares for all generalized quasi-identifiers
      Gibt zurück:
    • getHeader

      public String[] getHeader()
      The original data header.
      Gibt zurück:
    • getHierachiesHeights

      public int[] getHierachiesHeights()
      Returns the heights of the hierarchies used for generalizing quasi-identifiers
      Gibt zurück:
    • getHierarchies

      public GeneralizationHierarchy[] getHierarchies()
      Returns the generalization hierarchiesQI.
      Gibt zurück:
      the hierarchiesQI
    • getHierarchiesMaxLevels

      public int[] getHierarchiesMaxLevels()
      Returns the maximum levels for the generalizaiton.
      Gibt zurück:
      the maximum level for each QI
    • getHierarchiesMinLevels

      public int[] getHierarchiesMinLevels()
      Returns the minimum levels for the generalizations.
      Gibt zurück:
    • getAggregationInformation

      public DataAggregationInformation getAggregationInformation()
      Returns data configuring microaggregation
      Gibt zurück:
    • getOrder

      public int[] getOrder(String attribute)
      Returns the order of the given sensitive attribute in the original dataset. Required for t-closeness.
      Parameter:
      attribute -
      Gibt zurück:
      distribution
    • getSubsetInstance

      public DataManager getSubsetInstance(RowSet rowset)
      Returns an instance of this data manager, that is projected onto the given rowset
      Parameter:
      rowset -
      Gibt zurück:
    • getTree

      public int[] getTree(DataMatrix data, int index, int[][] hierarchy)
      Returns a tree for the given attribute at the index within the given data array, using the given hierarchy. The resulting tree can be used to calculate the earth mover's distance with hierarchical ground-distance.
      Parameter:
      data -
      index -
      hierarchy -
      Gibt zurück:
      tree
    • getTree

      public int[] getTree(String attribute)
      Returns the tree for the given sensitive attribute, if a generalization hierarchy is associated. The resulting tree can be used to calculate the earth mover's distance with hierarchical ground-distance.
      Parameter:
      attribute -
      Gibt zurück:
      tree
    • getDataDefinition

      protected DataDefinition getDataDefinition()
      Returns the data definitions
      Gibt zurück: