While CUDA has been the dominated parallel computing platform and programming model for general-purpose GPU computing, CUDA synchronization undergoes significant challenges for GPU programmers due to its intricate parallel computing mechanism and coding practices. In this paper,we propose AuCS, the first general framework to automate synchronization for CUDA kernel functions. AuCS transforms the original LLVM-level CUDA program control flow graph in a semantic-preserving manner for exploring the possible barrier function locations. Accordingly, AuCS develops mechanisms to correctly place barrier functions for automating synchronization in multiple erroneous (challenging-to-be-detected) synchronization scenarios, including data race, barrier divergence, redundant barrier functions. To evaluate the effectiveness and efficiency of AuCS, we conduct an intensive set of experiments and the results suggest that AuCS can automate 20 out of 24 erroneous synchronization scenarios.
Thu 14 NovDisplayed time zone: Tijuana, Baja California change
10:40 - 12:20 | ConcurrencyResearch Papers / Demonstrations at Cortez 1 Chair(s): Elena Sherman Boise State University | ||
10:40 20mTalk | MAP-Coverage: a Novel Coverage Criterion for Testing Thread-Safe Classes Research Papers Zan Wang College of Intelligence and Computing, Tianjin University, Yingquan Zhao Tianjin University, Shuang Liu College of Intelligence and Computing, Tianjin University, Jun Sun Singapore Management University, Singapore, Xiang Chen School of Information Science and Technology, Nantong University, Huarui Lin College of Intelligence and Computing, Tianjin University | ||
11:00 20mTalk | Automating Non-Blocking Synchronization In Concurrent Data Abstractions Research Papers Jiange Zhang University of Colorado Colorado Springs, Qing Yi University of Colorado Colorado Springs, Damian Dechev University of Central Florida Pre-print | ||
11:20 20mTalk | Automating CUDA Synchronization via Program Transformation Research Papers Mingyuan Wu Southern University of Science and Technology, Lingming Zhang The University of Texas at Dallas, Cong Liu Eindhoven University of Technology, Shin Hwei Tan , Yuqun Zhang Southern University of Science and Technology | ||
11:40 20mTalk | Efficient Transaction-Based Deterministic Replay for Multi-threaded Programs Research Papers Ernest Bota Pobee City University of Hong Kong, Xiupei Mei City University of Hong Kong, Wing-Kwong Chan City University of Hong Kong, Hong Kong | ||
12:00 10mDemonstration | VeriSmart 2.0: Swarm-Based Bug-Finding for Multi-Threaded Programs with Lazy-CSeq Demonstrations Bernd Fischer Stellenbosch University, Salvatore La Torre Università degli Studi di Salerno, Gennaro Parlato University of Molise | ||
12:10 10mDemonstration | ConVul: An Effective Tool for Detecting Concurrency Vulnerabilities Demonstrations Ruijie Meng University of Chinese Academy of Sciences, Biyun Zhu University of Chinese Academy of Sciences, Hao Yun University of Chinese Academy of Sciences, Haicheng Li University of Chinese Academy of Sciences, Yan Cai Institute of Software, Chinese Academy of Sciences, Zijiang Yang Western Michigan University |