Rengan Xu, Xiaonan Tian, Sunita Chandrasekaran and Barbara Chapman. Multi-GPU Support on Single Node Using Directive-Based Programming Model, Special Issue on Programming Models, Languages and Compilers for Manycore and Heterogeneous Architectures, Journal of Scientific Programming, Vol. 2015, Article ID 621730, 15 pages, 2015. doi: 10.1155/2015/621730 BibTex
Xiaonan Tian, Rengan Xu, Yonghong Yan, Sunita Chandrasekaran, Deepak Eachempati and Barbara Chapman. Compiler Transformation of Nested Loops for GPGPUs, Special Issue on Programming Models and Applications for Multicores and Manycores, Journal of Concurrency and Computation: Practice and Experience, 2015, doi: 10.1002/cpe.3648 BibTex
Conferences and Workshop Publications
Rengan Xu, Junjie Yang, Yifan Xu, Hong Li, Xing Liu,
Devashish Shankar, Haoci Zhang Meng Liu, Boyang Li, Yuxi Hu, Mingwei Tang, Zehua
Zhang, Tunhou Zhang, Dai Li Sijia Chen, Gian-Paolo Musumeci, Jiaqi Zhai, Bill
Zhu, Hong Yan, Srihari Reddy. Enhancing
Performance and Scalability of Large-Scale Recommendation Systems with Jagged
Flash Attention, in 18th ACM Conference on Recommender System
(RecSys'24), October 2024, Bari, Italy
Mingwei Tang, Meng Liu, Hong Li, Junjie Yang, Chenglin
Wei, Boyang Li, Dai Li, Rengan Xu, Yifan Xu, Zehua Zhang, Xiangyu Zhang, Linfeng
Liu, Yuelei Xie, Chengye Liu, Labib Fawaz, Li Li, Hongnan Wang, Bill Zhu, Sri
Reddy. Async Learned User Embeddings
for Ads Delivery Optimization, in
Proceedings of the Workshop on Multimodal Representation and Retrieval
(MRR'24), in conjuction with the 47th International ACM SIGIR Conference
on Research and Development in Information Retrieval 2024 (SIGIR'24), July,
2024, Washington, D.C., USA BibTex
Derya Cavdar, Valeriu Codreanu, Can Karakus, John A
Lockman, Damian Podareanu, Vikram Saletore, Alexander Sergeev, Don D Smith,
Victor Suthichai, Quy Ta, Srinivas Varadharajan, Lucas A Wilson, Rengan
Xu, Pei Yang. Densifying
Assumed-Sparse Tensors, in
Proceedings of 34th International Conference on High Performance Computing
(ISC 2019), pages 23-39, June, 2019, Frankfurt, Germany BibTex
Rengan Xu, Frank Han, Quy Ta. Deep Learning at Scale on NVIDIA V100
Accelerators, in Proceedings of the Nineth IEEE International Workshop
on Performance Modeling, Benchmarking and Simulation of High Performance
Computer Systems(PMBS 2018), pages 23-32, November, 2018, Dallas, Texas, USA BibTex
Michael Wolfe, Seyong Lee, Jungwon Kim, Xiaonan Tian,
Rengan Xu, Sunita Chandrasekaran and Barbara Chapman. Implementing the OpenACC Data
Model, in Proceedings of the Seventh International Workshop on
Accelerators and Hybrid Exascale Systems (AsHES 2017), pages 662-672, May, 2017,
Dallas, Texas, USA BibTex
Rengan Xu, Dounia Khaldi, Abid Malik and Barbara Chapman.
ACC-SVM: Accelerating SVM on GPUs
using OpenACC, in Proceedings of the First Workshop of Mission-Critical
Big Data Analytics (MCBDA 2016), May, 2016, Prairie View, TX, USA BibTex
Rengan Xu, Maxime Hugues, Henri Calandra, Sunita Chandrasekaran and Barbara Chapman. Accelerating Kirchhoff Migration on GPU using Directives, in Proceedings of the First Workshop on Accelerator Programming using Directives(WACCPD 2014), pages 37-46, November, 2014, New Orleans, Louisiana, USA BibTex
Guido Juckeland, William Brantley, Sunita Chandrasekaran, Barbara Chapman, Shuai Che, Mathew Colgrove, Huiyu Feng, Alexander Grund, Robert Henschel, Wen-Mei Hwu, Huian Li, Matthias S. MAller, Maxim Perminov, Pavel Shelepugin, Kevin Skadron, John Stratton, Alexey Titov, Ke Wang, Matthijs van Waveren, Brian Whitney, Sandra Wienke, Rengan Xu, Kalyan Kumaran SPEC ACCEL - A Standard Application Suite for Measuring Hardware Accelerator Performance, in the 5th International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computer System (PMBS 2014), pages 46-67, November, 2014, New Orleans, Louisiana, USA BibTex
Cheng Wang, Rengan Xu, Sunita Chandrasekaran, Barbara Chapman and Oscar Hernandez. A Validation Testsuite for OpenACC 1.0, in 2014 IEEE 28th International Parallel and Distributed Processing Symposium Workshop & PhD Forum (IPDPSW), pages 1407-1416, 2014, Phoenix, Arizona, USA (Equal contribution by the first two authors.) Talk SlidesBibTex
Rengan Xu, Xiaonan Tian, Yonghong Yan, Sunita Chandrasekaran, and Barbara Chapman. Reduction Operations in Parallel Loops for GPGPUs, in the 2014 International Workshop on Programming Models and Applications for Multicores and Manycores (PMAM 2014), pages 10-20, February, 2014, Orlando, Florida, USA Talk SlidesBibTex
Xiaonan Tian, Rengan Xu, Yonghong Yan, Zhifeng Yun, Sunita Chandrasekaran, and Barbara Chapman. Compiling A High-Level Directive-based Programming Model for GPGPUs, in the 26th International Workshop on Languages and Compilers for High Performance Computing (LCPC 2013), pages 105-120, September, 2013, Santa Clara, CA, USA Talk SlidesBibTex
Xiaonan Tian, Rengan Xu, Yonghong Yan, Zhifeng Yun, Sunita Chandrasekaran and Barbara Chapman. "Poster : OpenUH - An Open Source OpenACC Compiler. in GPU Technology conference (GTC), March, 2014, San Jose, CA, USA BibTex