This item is available under a Creative Commons License for non-commercial use only
Applied mathematics, Electrical and electronic engineering
While many existing formal concept analysis algorithms are efficient, they are typically unsuitable for distributed implementation. Taking the MapReduce (MR) framework as our inspiration we introduce a distributed approach for performing formal concept mining. Our method has its novelty in that we use a light-weight MapReduce runtime called Twister which is better suited to iterative algorithms than recent distributed approaches. First, we describe the theoretical foundations underpinning our distributed formal concept analysis approach. Second, we provide a representative exemplar of how a classic centralized algorithm can be implemented in a distributed fashion using our methodology: we modify Ganter’s classic algorithm by introducing a family of MR⋆ algorithms, namely MRGanter and MRGanter+ where the prefix denotes the algorithm’s lineage. To evaluate the factors that impact distributed algorithm performance, we compare our MR ∗ algorithms with the state-of-the-art. Experiments conducted on real datasets demonstrate that MRGanter+ is efficient, scalable and an appealing algorithm for distributed problems.
de Fréin, R., Xu, B., Robson, E. and O'Foghlú, Mícheál (2012) Distributed formal concept analysis algorithms based on an iterative mapreduce framework. In: 10th International Conference on Formal Concept Analysis. Lecture Notes in Computer Science, 10 (10). Springer Verlag, Leuven , Belgium, pp. 292-308. doi:10.1007/978-3-642-29892-9_26