


default search action
29th HiPC 2022: Bengaluru, India
- 29th IEEE International Conference on High Performance Computing, Data, and Analytics, HiPC 2022, Bengaluru, India, December 18-21, 2022. IEEE 2022, ISBN 978-1-6654-9423-6

- Arjun Menon Vadakkeveedu, Debabrata Mandal, Pradeep Ramachandran, Nitin Chandrachoodan:

Split-Knit Convolution: Enabling Dense Evaluation of Transpose and Dilated Convolutions on GPUs. 1-10 - Bingyi Zhang, Hanqing Zeng, Viktor K. Prasanna:

Low-latency Mini-batch GNN Inference on CPU-FPGA Heterogeneous Platform. 11-21 - Qinghua Zhou

, Quentin Anthony, Aamir Shafi, Hari Subramoni, Dhabaleswar K. Panda:
Accelerating Broadcast Communication with GPU Compression for Deep Learning Workloads. 22-31 - Nawras Alnaasan

, Arpan Jain, Aamir Shafi, Hari Subramoni, Dhabaleswar K. Panda:
AccDP: Accelerated Data-Parallel Distributed DNN Training for Modern GPU-Based HPC Clusters. 32-41 - Manohar Lal Das, Vishwesh Jatala, Gagan Raj Gupta:

Joint Partitioning and Sampling Algorithm for Scaling Graph Neural Network. 42-47 - Zhongyi Lin, Louis Feng, Ehsan K. Ardestani, Jaewon Lee, John Lundell, Changkyu Kim, Arun Kejariwal, John D. Owens:

Building a Performance Model for Deep Learning Recommendation Model Training on GPUs. 48-58 - Kartik Lakhotia, Fabrizio Petrini, Rajgopal Kannan, Viktor K. Prasanna:

Accelerating Prefix Scan with in-network computing on Intel PIUMA. 59-68 - Ravi Shreyas Anupindi, Swaroop Kotni, Arkaprava Basu:

memwalkd : Accelerating Key-value stores using Page Table Walkers. 69-74 - Manolis Katsaragakis, Christos Baloukas, Lazaros Papadopoulos, Verena Kantere, Francky Catthoor, Dimitrios Soudris

:
Energy Consumption Evaluation of Optane DC Persistent Memory for Indexing Data Structures. 75-84 - Rohit Singh

, K. P. Arun, Debadatta Mishra:
LDT: Lightweight Dirty Tracking of Memory Pages for x86 Systems. 85-94 - Bharath Ramesh, Qinghua Zhou

, Aamir Shafi, Mustafa Abduljabbar, Hari Subramoni, Dhabaleswar K. Panda:
Designing Efficient Pipelined Communication Schemes using Compression in MPI Libraries. 95-99 - Kaushik Kandadi Suresh, Akshay Paniraja Guptha, Benjamin Michalowicz

, Bharath Ramesh, Mustafa Abduljabbar, Aamir Shafi, Hari Subramoni, Dhabaleswar K. Panda:
Efficient Personalized and Non-Personalized Alltoall Communication for Modern Multi-HCA GPU-Based Clusters. 100-104 - Zhihui Du, Joseph Patchett, Oliver Alvarado Rodriguez, Fuhuan Li, David A. Bader:

High-Performance Truss Analytics in Arkouda. 105-114 - Arindam Khanda

, Sanjukta Bhowmick, Xin Liang, Sajal K. Das:
Parallel Vertex Color Update on Large Dynamic Networks. 115-124 - Reet Barik, Marco Minutoli, Mahantesh Halappanavar, Ananth Kalyanaraman:

IMpart: A Partitioning-based Parallel Approach to Accelerate Influence Maximization. 125-134 - Benoît Gallet, Michael Gowanlock:

Leveraging GPU Tensor Cores for Double Precision Euclidean Distance Calculations. 135-144 - Fazlay Rabbi, Christopher S. Daley, Ümit V. Çatalyürek, Hasan Metin Aktulga

:
A Portable Sparse Solver Framework for Large Matrices on Heterogeneous Architectures. 145-155 - Nischay Ram Mamidi, Dhruv Saxena, Kumar Prasun, Anil Nemili, Bharatkumar Sharma, S. M. Deshpande:

Performance analysis of GPU accelerated meshfree q-LSKUM solvers in Fortran, C, Python, and Julia. 156-165 - Abir Mukherjee, Preeti Malakar:

A Deep Learning-Based In Situ Analysis Framework for Tropical Cyclogenesis Prediction. 166-175 - Weicong Chen, Curtis Tatsuoka, Xiaoyi Lu:

HiBGT: High-Performance Bayesian Group Testing for COVID-19. 176-185 - Chang Su, Linglin Wei, Xianzhong Xie:

Churn Prediction in Telecommunications Industry Based on Conditional Wasserstein GAN. 186-191 - Yoichi Shimomura, Akihiro Musa, Yoshihiko Sato, Atsuhiko Konja, Guoqing Cui, Rei Aoyagi, Keichi Takahashi, Hiroyuki Takizawa

:
A Real-time Flood Inundation Prediction on SX-Aurora TSUBASA. 192-197 - Harshvardhan Das, Suraj Kumar, Subodh Kumar:

Precise Parallel FEM-based Interactive Cutting Simulation of Deformable Bodies. 198-203 - David Redon, Bilel Derbel, Pierre Fortin:

Scaling the SOO Global Blackbox Optimizer on a 128-core Architecture. 204-214 - Tri Nguyen

, Michela Becchi:
A GPU-accelerated Data Transformation Framework Rooted in Pushdown Transducers. 215-225 - Tania Banerjee

, Jong Choi, Jaemoon Lee, Qian Gong
, Ruonan Wang, Scott Klasky, Anand Rangarajan
, Sanjay Ranka
:
An Algorithmic and Software Pipeline for Very Large Scale Scientific Data Compression with Error Guarantees. 226-235 - Ryan Kirkpatrick, Christopher Brown

, Vladimir Janjic
:
COMPROF and COMPLACE: Shared-Memory Communication Profiling and Automated Thread Placement via Dynamic Binary Instrumentation. 236-245 - Keith Bateman, Neeraj Rajesh, Jaime Cernuda Garcia, Luke Logan, Jie Ye, Stephen Herbein, Anthony Kougkas, Xian-He Sun:

LuxIO: Intelligent Resource Provisioning and Auto-Configuration for Storage Services. 246-255 - Narasinga Rao Miniskar, Mohammad Alaul Haque Monil

, Pedro Valero-Lara
, Frank Liu, Jeffrey S. Vetter:
IRIS-BLAS: Towards a Performance Portable and Heterogeneous BLAS Library. 256-261 - Avinash Maurya

, Bogdan Nicolae, M. Mustafa Rafique, Amr M. Elsayed
, Thierry Tonellot, Franck Cappello:
Towards Efficient Cache Allocation for High-Frequency Checkpointing. 262-271 - Conglong Li, Ammar Ahmad Awan, Hanlin Tang, Samyam Rajbhandari, Yuxiong He:

1-bit LAMB: Communication Efficient Large-Scale Large-Batch Training with LAMB's Convergence Speed. 272-281 - Jason Yik, Sanmukh R. Kuppannagari, Hanqing Zeng, Viktor K. Prasanna:

Input Feature Pruning for Accelerating GNN Inference on Heterogeneous Platforms. 282-291 - Yuta Nakamura, Tanu Malik, Iyad Kanj, Ashish Gehani:

Provenance-based Workflow Diagnostics Using Program Specification. 292-301 - Himani Sikarwar

, Debasis Das:
EECAAP: Efficient Edge-Computing based Anonymous Authentication Protocol for IoV. 302-307

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














