Euro-Par 2015 Accepted Papers

Alain Tchana, Noel de Palma, Ibrahim Safieddine and Daniel Hagimont. Software consolidation as an efficient energy and cost saving solution for a SaaS/PaaS cloud model
Spiros N. Agathos, Alexandros Papadogiannakis and Vassilios V. Dimakopoulos. Targeting the Parallella
Tiziano De Matteis, Salvatore Di Girolamo and Gabriele Mencagli. A Multicore Parallelization of Continuous Skyline Queries on Data Streams
Klaus Kofler, Biagio Cosenza and Thomas Fahringer. Automatic Data Layout Optimizations for GPUs
Leyuan Wang, Sean Baxter and John Owens. Fast Parallel Suffix Array on the GPU
Huan Zhou, Kamran Idrees and José Gracia. Leveraging MPI-3 shared-memory extensions for efficient PGAS runtime systems
Jose I. Aliaga, Joaquin Pérez and Enrique S. Quintana-Orti. Systematic Fusion of CUDA Kernels for Iterative Sparse Linear System Solvers
Dominique Lasalle and George Karypis. Efficient Nested Dissection for Multicore Architectures
Shahar Timnat, Maurice Herlihy and Erez Petrank. A Practical Transactional Memory Interface
Hao-Ping Kang and Che-Rung Lee. Improving Performance of Convolutional Neural Networks by Separable Filters on GPU
Andrey Rodchenko, Andy Nisbet, Antoniu Pop and Mikel Lujan. Effective Barrier Synchronization on Intel Xeon Phi Coprocessor
Qi Shen, Craig Sharp, William Blewitt, Gary Ushaw and Graham Morgan. PR-STM: Priority Rule Based Software Transactions on the GPU
Foad Lotfifar and Matthew Johnson. Feature Extraction Multi-Level Hypergraph Partitioning Algorithm
Eric Angel, Evripidis Bampis, Vincent Chau and Kim Thang Nguyen. Non-preemptive Throughput Maximization for Speed-Scaling with Power-Down
Enrico Calore, Jiri Kraus, Sebastiano F. Schifano and Raffaele Tripiccione. Accelerating Lattice Boltzmann Applications with OpenACC
Ramon Nou, Alberto Miranda and Toni Cortes. Performance impacts with Reliable Parallel File Systems at Exascale level
Ahmad Abdelfattah, Hatem Ltaief and David Keyes. High Performance Multi-GPU SpMV for Multi-component PDE-based Applications
Nikola Markovic, Daniel Nemirovsky, Veljko Milutinovic, Osman Unsal, Mateo Valero and Adrian Cristal. Hardware Round-Robin Scheduler for Single-ISA Asymmetric Multi-Core
Kamal Sharma, Ian Karlin, Jeff Keasler, James McGraw and Vivek Sarkar. Data Layout Optimization for Portable Performance
Joyce Whang, Andrew Lenharth and Inderjit Dhillon. Scalable Data-driven PageRank: Algorithms, System Issues & Lessons Learned
Alvaro Estebanez, Diego R. Llanos, David Orden and Belen Palop. Moody Scheduling for Speculative Parallelization
Stephen Moore, Devi Sudheer Chunduri, Sergiy Zhuk, Tigran Tchrakian, Ewout van den Berg, Albert Akhriev, Alberto Costa Nogueira Junior, Andrew Rawlinson and Lior Horesh. Semi-Discrete Matrix-Free Formulation of 3D Elastic Full Waveform Inversion Modeling
Michael Orr and Oliver Sinnen. A Duplicate-Free State-Space Model for Optimal Task Scheduling
Taichiro Suzuki, Akira Nukada and Satoshi Matsuoka. Efficient Execution of Multiple CUDA Applications using Transparent Suspend, Resume and Migration
Xavier Aguilar, Karl Fuerlinger and Erwin Laure. Automatic On-line Detection of MPI Application Structure with Event Flow Graphs
Christian Iwainsky, Sergei Shudler, Alexandru Calotoiu, Alexandre Strube, Michael Knobloch, Christian Bischof and Felix Wolf. How many threads will be too many? On the scalability of OpenMP implementations
Matthieu Kuhn, Guillaume Latu, Nicolas Crouseilles and Stéphane Genaud. Parallelization of an advection-diffusion problem arising in edge plasma physics using hybrid MPI/OpenMP programming
Abdou Guermouche, Loris Marchal, Bertrand Simon and Frédéric Vivien. Scheduling Trees of Malleable Tasks for Sparse Linear Algebra
Carlos Gómez-Calzado, Arnaud Casteigts, Alberto Lafuente and Mikel Larrea. A Connectivity Model for Agreement in Dynamic Systems
Emmanuelle Saillard, Patrick Carribault and Denis Barthou. MPI Thread-level Checking for MPI+OpenMP Applications
Adrien Lèbre, Jonathan Pastor and Mario Südholt. VMPlaceS A Generic Tool to Investigate and Compare VM Placement Algorithms
Peter Thoman, Stefan Moosbrugger and Thomas Fahringer. Optimizing Task Parallelism with Library-Semantics-Aware Compilation
Harald Servat, German Llort, Juan Gonzalez Garcia, Judit Giménez and Jesús Labarta. Low-overhead detection of memory access patterns and their time evolution
Alessio Guerrieri and Alberto Montresor. DFEP: Distributed Funding-based Edge Partitioning
Tobias Hilbrich, Martin Schulz, Holger Brunst, Joachim Protze, Bronis R. de Supinski and Matthias S. Mueller. Event-Action Mappings for Parallel Tools Infrastructures
Hartwig Anzt, Edmond Chow and Jack Dongarra. Iterative Sparse Triangular Solves for Preconditioning
Olivier Beaumont, Ikbel Belaid, Lionel Eyraud-Dubois and Juan-Angel Lorenzo-Del-Castillo. Allocating jobs with periodic demands
Panagiotis Hadjidoukas, Panagiotis Angelikopoulos, Lina Kulakova, Costas Papadimitriou and Petros Koumoutsakos. Exploiting Task-Based Parallelism in Bayesian Uncertainty Quantification
Matthias Diener, Eduardo Cruz, Marco Antonio Zanata Alves, Mohammad Alhakeem, Philippe Navaux and Hans-Ulrich Heiss. Locality and Balance for Communication-Aware Thread Mapping in Multicore Systems
Mingzhe Li, Khaled Hamidouche, Xiaoyi Lu, Jian Lin and Dhabaleswar Panda. High-Performance and Scalable Design of MPI-3 RMA on Xeon Phi Clusters
Andrew Lenharth, Donald Nguyen and Keshav Pingali. Concurrent Priority Queues are not Good Priority Schedulers
Louis-Claude Canon and Laurent Philippe. On the Heterogeneity Bias of Cost Matrices when Assessing Scheduling Algorithms
Andreas Vogel, Alexandru Calotoiu, Alexandre Strube, Sebastian Reiter, Arne Nägel, Felix Wolf and Gabriel Wittum. 10,000 performance models per minute - scalability of the UG4 simulation framework
Georgios Rokos, Gerard Gorman and Paul Kelly. A Fast and Scalable Graph Coloring Algorithm for Multi-core and Many-core Architectures
Johanne Cohen and Fanny Pascual. Scheduling tasks from selfish multi-tasks agents
Yijia Gu, Thomas Wahl, Mahsa Bayati and Miriam Leeser. Behavioral Non-Portability in Scientific Numeric Computing
Shams Imam and Vivek Sarkar. Load Balancing Prioritized Tasks via Work-Stealing
Shams Imam, Jisheng Zhao and Vivek Sarkar. A Composable Deadlock-free Approach to Object-based Isolation
Jonathan Beard, Cooper Epstein and Roger Chamberlain. Online Automated Reliability Classification of Queueing Models for Streaming Processing using Support Vector Machines
Tekin Bicer, Doga Gursoy, Rajkumar Kettimuthu, Francesco De Carlo, Gagan Agrawal and Ian T. Foster. Rapid Tomographic Image Reconstruction via Large-Scale Parallelization
Alina Sbirlea, Kunal Agrawal and Vivek Sarkar. Elastic Tasks: Unifying Task Parallelism and SPMD Parallelism with an Adaptive Runtime



Keynote Talks (PDF) online

Workshop Organizers Meeting:
Monday, August 24th,
18:15 - 19:00, room EI 3A


Conference program as PDF


Conference program online


BEWARE: Fake conference organizers try to sell you hotel rooms.