Zhijing George Mou
Bibliography

incomplete, with related other works

On Mou
on Divide and Conquer

Every solution is either trivial or false.
-- Zhijing George Mou.

Ordered by date of original research or of publication.

  • C. F. Gauss, "Nachlass, Theoria Interpolationis Methodo Nova Tractata," in Carl Friedrich Gauss Werke, Band 3, Koniglichen Gesellschaft der Wissenschaften: Gottingen, pp. 265-330, 1866.

  • H. BURKHARDT,"Trigonometrische Interpolation," in Chapter 9, Encyklopiidie der Mathematischen Wissenschaften, vol. 2, part 1, 1st half, 1899-1916.

  • H. BURKHARDT,"Trigonometrische Reihenund Integrale(bis etwa 1850)," in Chapter 12, Encyklopiidie der Mathematischen Wissenschaften,vol. 2, part 1, 2nd half, 1904-1916.

  • J. W. COOLEY & J. W. TUKEY, "An Algorithm for the Machine Calculation of Complex Fourier Series," Math. Comput., Vol. 19, no. 2, pp. 297-301, Reprinted in Digital Signal Processing, ed. Larry R. RABINER & C. M. RADER, pp. 223-227, New York: IEEE Press, 1972, Apr. 1965.

  • Alan Perlis, The Synthesis of Algorithmic Systems, The 1st Turing Lecture. (ACM V 14 No 1, 1967).(cached copy here)

  • Donald Fraser. Array permutation by index-digit permutation. Journal of ACM, 23(2):298—309, April 1976.

  • The 1977 Turing Lecture. Can Programming Be Liberated from the von Neumann Style? A Functional Style and Its Algebra of Programs, by John Backus, IBM Research Laboratory, San Jose CA

  • MICHAEL T. HEIDEMAN, DON H. JOHNSON, C. SIDNEY BURRUS "Gauss and the History of the Fast Fourier Transform", 1985. Rice U Eng. Dept. cached copy here.

  • Z. G. Mou and P. Hudak. An algebraic model for divide-and-conquer algorithms and its parallelism. The Journal of Supercomputiug, 2(3):257-278, November 1988. cached copy.

  • The design and analysis of parallel algorithms / by Selim G. Akl. 1989, Prentice Hall textbook, cached copy here. This was the state of the art before George Mou.

  • Zhijing George Mou, A Formal Model for Divide-and-Conquer and Its Parallel Realization, May 1990, Research Report YALEU/DCS/RR-795. (Yale PhD Thesis)

  • Z. G. Mou. Divacon: A parallel language for scientific computing based on divide-and-conquer. In Proceedings of the Third Symposium on the Frontiers of Massively Parallel Computation, pages 451-461. IEEE, October 1990. Link.

    Abstract: An overview of the language, covering Divacon primitives and simple programming constructs that are referred to as functional forms, is given. Two divide-and-conquer programming constructs are discussed. Divacon style programming is demonstrated for a number of scientific applications. Some interesting equivalences and transformations between Divacon programs are examined. Implementation and performance are briefly considered.

  • Z. G. Mou and M. Goodman, "A Comparison of Communication Costs for Three Parallel Programming Paradigms on Hypercube and Mesh Architectures". Proc. 5th SIAM Conf. on Parallel Processing p491-500, held in Houston TX March 1991. ACM link here.

  • B. Carpentieri, Z. G. Mou, Compile-time transformations and optimization of parallel Divide-and-Conquer algorithms Published in SIGP 1 October 1991. ACM SIGPLAN Notices. Link.
    Abstract: The goal is to make Divide-and-Conquer algorithms suitable for implementation on hypercube-like parallel architectures, such as the Connection Machine, even if the original algorithm implies a division function that is not the left-right division and/or communication pattern that cannot be implemented by direct-neighbor communication
  • Marc Goodman, Z. G. Mou, DC Transpose: a method for reducing communication in divide-and-conquer algorithms on mesh-based computers. Published in Proceedings of the Third IEEE Symposium on Parallel and Distributed Processing 2 December 1991.
    Abstract: The paper introduces the notion of a DC Transpose, which is a remapping of data on a mesh, using a global routing mechanism such as the global router on the Mas-Par MP-1, to solve divide-and-conquer algorithms.
  • Xiaojing Wang, Z. G. Mou. "A divide-and-conquer method of solving tridiagonal systems on hypercube massively parallel computers" Proceedings of the Third IEEE Symposium on Parallel and Distributed Processing. 2 December 1991. Link here.
    Abstract: The authors present a new parallel algorithm, based on the divide-and-conquer (DC) strategy, for solving tridiagonal systems, and show that for the binary hypercube architecture, the communication complexity of their DC method is the lowest among all, and therefore the most efficient tridiagonal solver for communication-expensive massively parallel hypercube computers.
  • Z. G. Mou, C. Constantinescu, and T. Hickey. Divide-and-conquer on a 3-dimensional mesh. In Proceedings of the European Workshops on Parallel Computing, pages 344-355, Barcelona, Spain, March 1992.

  • Z.G.Mou, Cornel Constantinescu, and T. Hickey. Optimal mappings of divide-and-conquer algorithms to mesh connected parallel architectures. In Proceedings of International Computer Symposium, pages 273-284, Taiwan, December 1992.

  • Z. G. Mou, Xiaojing Wang, T. Hickey. Divide-and-Conquer Algorithms with Recursive Broadcast Communication on Reconfigurable Arbitrary Dimensional Mesh. Published in SIAM Conference on Parallel Processing for Scientific Computing. 1993.
  • Z. George Mou and Xiaojing Wang, Optimal Mappings of m Dimensional FFT Communication to k Dimensional Mesh for Arbitrary m and k. pp 104-119. in PARLE '93, Parallel Architectures and Languages Europe, Lecture Notes in Computer Science #694, Springer-Verlag. 5th International PARLE Conference, Munich, Germany, June 1993 Proceedings. cached copy here.

  • Z.G. Mou. A Divide-and-Conquer Parallel Algorithm for Banded Linear Systems. PP, 1995.
  • Jayadev Misra, Powerlist: A Structure for Parallel Recursion. ACM Transactions. on Programming Languages and Systems, Vol 16, N. 6 November 1994, Pages 1737-1767.
  • Z.G. Mou. "Spectrum analysis and min-cut transformation of communication networks in parallel computers" Published in: Proceedings Frontiers '95. The Fifth Symposium on the Frontiers of Massively Parallel Computation (IEEE) Conference in McLean, VA, USA on 6-9 February 1995. Link here.
    Abstract: We present a formal model for the analysis of communication networks in parallel computers. Unlike most others, our model focuses on the transmission delays as opposed to the propagation delays of communication patterns. The model allows all symmetric communication networks to be examined by their spectrums and characterized by their transmission dimensions. A min-cut transformation is introduced as a tool for the spectrum analysis, which reduces any symmetric network to the same canonical form. Parallel architectures with different topologies can then be easily compared and evaluated.
  • Z. G. Mou. The elements, structure, and taxonomy of divide-and-conquer. In Theory and Practice of Higher-Order Parallel Programming, Dagstul Seminar Report 169, pages 13–14, February 1997.
  • Jeffrey Yepez, Guy P. Seeley, and George Mou. Lattice-gas automata on parallel architectures. In Proceedings of International Symposium on Parallel Computation, pages 522–525, 1993. Beijing, China. (For review on lattice gas automata see The Classical Lattice-Gas Method by Jeffrey Yepez 1999, DTIC/ADA455835.pdf).

  • Mou, Z.G. (1999). Recursive individually distributed object. In: Rolim, J., et al. Parallel and Distributed Processing. IPPS 1999. Lecture Notes in Computer Science, vol 1586. Springer, Berlin, Heidelberg . https://doi.org/10.1007/BFb0097888. Paper presented at IPPS/SPDP Workshops 12 April 1999
    TLDR: This paper proposes an alternative model called Individually Distributed Object (IDO) which allows a single large object to be distributed over a network, thus providing a high level interface for the exploitation of parallelism inside the computation of each object which was left out of the distributed objects model.

    Abstract: Distributed Objects (DO) as defined by OMG’s CORBA architecture provide a model for object-oriented parallel distributed computing. The parallelism in this model however is limited in that the distribution refers to the mappings of different objects to different hosts, and not to the distribution of any individual object. We propose in this paper an alternative model called Individually Distributed Object (IDO) which allows a single large object to be distributed over a network, thus providing a high level interface for the exploitation of parallelism inside the computation of each object which was left out of the distributed objects model. Moreover, we propose a set of functionally orthogonal operations for the objects which allow the objects to be recursively divided, combined, and communicate over recursively divided address space. Programming by divide-and-conquer is therefore effectively supported under this framework. The Recursive Individually Distributed Object (RIDO) has been adopted as the primary parallel programming model in the Brokered Objects for Ragged-network Gigaflops (BORG) project at the Applied Physics Laboratory of Johns Hopkins University, and applied to large-scale real-world problems.

  • Google's MapReduce paper MapReduce: Simplified Data Processing on Large Clusters Jeffrey Dean and Sanjay Ghemawat, 2004.
  • Zhijing G. Mou, Hai Liu, Paul Hudak, Compress-and-Conquer for Optimal Multicore Computing, In Proceedings of the POPL 2010 Workshop on Declarative Aspects of Multicore Programming, DAMP 2010, Madrid, Spain, January 19, 2010.
    ACM link here, ResearchGate link here, cached copy here.

    Abstract: The CC paradigm is implemented in Haskell as a modular, higher-order function, compiled into low-level Haskell threads that run on a multicore machine, and performance benchmarks confirm the theoretical analysis.
  • Z. George Mou, Black-White Array: A New Data Structure for Dynamic Data Sets. April 2020, arXiv preprint here. cached copy here.

  • Orian Leitersdorf, Yahav Boneh, Gonen Gazit, Ronny Ronen, Shahar Kvatinsky, FourierPIM: High-Throughput In-Memory Fast Fourier Transform and Polynomial Multiplication Viterbi Faculty of Electrical and Computer Engineering, Technion – Israel Institute of Technology, Haifa, 3200003, Israel. ArXiv 2304.02336v1 [cs.AR] 5 Apr 2023.

  • Patent US20230351547A1, Methods and control systems that use dimensional-transform-based three-dimensional searching and voxel mapping. Published 02-November-2023.

  • Your thoughts?
    (will not be shared or abused)
    Comment:
                                              Feedback is welcome.
    Copyright © 2024 Thomas C. Veatch. All rights reserved.
    Created: June 13, 2024; Modified June 17, 2024
    Abstract: