Publications



    [Book chapter]
  • Embedded Systems Networking: Applications, case studies, and technologies, Elsevier. In preparation.
  • High-Level Design Tools for Complex DSP ApplicationsDSP for Embedded and Real-Time Systems: Expert Guide, Elsevier, 2012. ISBN-13: 9780123865359. 
    Yang Sun, Guohui Wang, Bei Yin, Joseph R. Cavallaro, and Tai Ly 

    [Talks]
  • Massively Parallel Signal Processing for Wireless Communication Systems 
    GPU Technology Conference (GTC) 2013. March 18-21, 2013, San Jose, California
    .
    [Slides] [Recording]

  • [Journal papers]
  • Parallel Interleaver Design for a High Throughput HSPA+/LTE Multi-Standard Turbo Decoder 
    Guohui Wang, Hao Shen, Yang Sun, Joseph R. Cavallaro, Aida Vosoughi, and Yuanbin Guo 
    IEEE Transactions on Circuits and Systems I - Regular Papers. (Invited.)     
    [PDF]
  • Computer Vision Accelerators for Mobile Systems based on OpenCL GPGPU Co-Processing 
    Guohui Wang, Yingen Xiong, Jay Yun, and Joseph R. Cavallaro 
    Journal of Signal Processing Systems. (Invited.)     
    [PDF]
  • Large-Scale MIMO Detection for 3GPP LTE: Algorithm and FPGA Implementation 
    Michael Wu, Bei Yin, Guohui Wang, Chris Dick, Joseph R. Cavallaro, and Christoph Studer 
    IEEE Journal of Selected Topics in Signal Processing.     
    [PDF]
  • GPU Acceleration of a Configurable N-Way MIMO Detector for Wireless Systems 
    Michael Wu, Bei Yin, Guohui Wang, Christoph Studer, and Joseph R. Cavallaro 
    Journal of Signal Processing Systems. (Invited)     
    [PDF]
  • Implementation of a High Throughput 3GPP Turbo Decoder on GPU 
    Michael Wu, Yang Sun, Guohui Wang, and Joseph R. Cavallaro 
    Journal of Signal Processing Systems (JSPS), 2011.
    [PDF] [BibTex]
  • A Novel Design Of the High Speed Buffer and Video/audio Synchronization in High Resolution Digital Cinema System 
    Guohui Wang, Zhenhua Zhu, Ke Zhang, Zhensong Wang 
    High Technology Letters (In Chinese), Vol.9, 2008. 

    [Conference papers]
  • A High Performance GPU-based Software-defined Basestation 
    Kaipeng Li, Michael Wu, Guohui Wang, and Joseph R. Cavallaro 
    48th IEEE Asilomar Conference on Signals, Systems, and Computers (ASILOMAR), Nov. 2014.     
    [PDF]
  • On the Performance of LDPC and Turbo Decoder Architectures with Unreliable Memories 
    Joao Andrade, Aida Vosoughi, Guohui Wang, Georgios Karakonstantis, Andreas Burg, Gabriel Falcao, Vitor Silva, Joseph R. Cavallaro
    48th IEEE Asilomar Conference on Signals, Systems, and Computers (ASILOMAR), Nov. 2014. 
       
    [PDF]
  • Efficient Architecture Mapping of FFT/IFFT for Cognitive Radio Networks 
    Guohui Wang, Bei Yin, Inkeun Cho, Joseph R. Cavallaro, Shuvra Bhattacharyy, and Jorma Takala 
    IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2014.   
    [PDF]
  • A 3.8 Gb/s Large-scale MIMO Detector for 3GPP LTE-Advanced 
    Bei Yin, Michael Wu, Guohui Wang, Chris Dick, Joseph R. Cavallaro, and Christoph Studer 
    IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2014. 
      
    [PDF]
  • High Throughput Low Latency LDPC Decoding on GPU for SDR Systems 
    Guohui Wang, Michael Wu, Bei Yin, and Joseph R. Cavallaro 
    To appear at 1st IEEE Global Conference on Signal and Information Processing (GlobalSIP), 2013.
      
    [PDF] [BibTex] 
  • Workload Analysis and Efficient OpenCL-based Implementation of SIFT Algorithm on a Smartphone 
    Guohui Wang, Blaine Rister, and Joseph R. Cavallaro 
    To appear at 1st IEEE Global Conference on Signal and Information Processing (GlobalSIP), 2013

    [PDF] [BibTex] 
  • HSPA+/LTE-A Turbo Decoder on GPU and Multicore CPU
    Michael Wu, Guohui Wang, Bei Yin, Christoph Studer, and Joseph R. Cavallaro 
    to appear at 47th Asilomar Conference on Signals, Systems, and Computers (ASILOMAR 2013).
      
    [PDF]
  • Highly Scalable On-the-Fly Interleaved Address Generation for UMTS/HSPA+ Parallel Turbo Decoder 
    Aida Vosoughi, Guohui Wang, Hao Shen, Joseph R. Cavallaro, and Yuanbin Guo 
    24th IEEE International Conference on Application-specific Systems, Architectures and Processors (ASAP 2013), June 2013.
     
    [PDF][
    BibTex] 
  • Accelerating Computer Vision Algorithms Using OpenCL Framework on the Mobile GPU - A Case Study 
    Guohui Wang, Yingen Xiong, Jay Yun, and Joseph R. Cavallaro 
    IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2013), May 2013. 
    [PDF] [BibTex]
  • A Fast and Efficient SIFT Detector using the Mobile GPU 
    Blaine Rister, Guohui Wang, Michael Wu and Joseph R. Cavallaro 
    IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2013), May 2013.
    [PDF] [BibTex]
  • Parallel Interleaver Architecture with New Scheduling Scheme for High Throughput Configurable Turbo Decoder (Finalist, best student paper contest)
    Guohui Wang, Aida Vosoughi, Hao Shen, Joseph R. Cavallaro, and Yuanbin Guo 
    IEEE International Symposium on Circuits and Systems (ISCAS 2013), May 2013.
     
    [PDF][
    BibTex] 
  • Parallel Nonbinary LDPC Decoding on GPU 
    Guohui Wang, Hao Shen, Bei Yin, Michael Wu, Yang Sun, and Joseph R. Cavallaro 
    46th Asilomar Conference on Signals, Systems, and Computers (ASILOMAR 2012), November 2012. 
    [PDF] [BibTex]
  • Low Complexity Opportunistic Decoder for Network Coding 
    Bei Yin, Michael Wu, Guohui Wang, and Joseph R. Cavallaro 
    46th Asilomar Conference on Signals, Systems, and Computers (ASILOMAR 2012), November 2012. 
    [PDF] [BibTex]
  • GPGPU Accelerated Scalable Parallel Decoding of LDPC Codes 
    Guohui Wang, Michael Wu, Yang Sun, and Joseph R. Cavallaro 
    45th Asilomar Conference on Signals, Systems, and Computers (ASILOMAR 2011), November 2011. 
    [PDF] [BibTex]
  • High-throughput Contention-Free concurrent interleaver architecture for multi-standard turbo decoder 
    Guohui Wang, Yang Sun, Joseph R. Cavallaro and Yuanbin Guo 
    22nd IEEE International Conference on Application-specific Systems, Architectures and Processors (ASAP 2011), September 2011. 
    [PDF] [BibTex]
  • A Massively Parallel Implementation of QC-LDPC Decoder on GPU 
    Guohui Wang, Michael Wu, Yang Sun, and Joseph R. Cavallaro 
    9th IEEE Symposium on Application Specific Processor (SASP 2011), June 2011. 
    [PDF] [BibTex]
  • Multi-Layer Parallel Decoding Algorithm and VLSI Architecture for Quasi-Cyclic LDPC Codes 
    Yang Sun, Guohui Wang, and Joseph R. Cavallaro 
    IEEE International Symposium on Circuits and Systems (ISCAS 2011), May 2011. 
    [PDF] [BibTex]
  • FPGA Prototyping of A High Data Rate LTE Uplink Baseband Receiver 
    Guohui Wang, Bei Yin, Kiarash Amiri, Yang Sun, Michael Wu, and Joseph R. Cavallaro 
    43rd Asilomar Conference on Signals, Systems and Computers (ASILOMAR 2009), November 2009. 
    [PDF] [BibTex] 

  • [Posters]
  • Parallel Interleaver Design for High Throughput Configurable Turbo Decoder
    (2nd place winner best graduate student poster)
     Annual Rice University ECE Affiliates Day Conference, April 2013.  
  • Parallel Interleaver Design for High Throughput Configurable Turbo Decoder 
    IEEE Texas Workshop on Integrated System Exploration (TexasWISE), March 2013. 
  • Low Energy Fast SIFT Detector on Heterogeneous Mobile Processors 
    IEEE Texas Workshop on Integrated System Exploration (TexasWISE), March 2013. 

  • [Patents]
  • System and Method for a Turbo Decoder with Parallel Interleaver 
    U.S. Patent Application. Filed by Huawei on Oct., 2012. 
  • System and Method for Turbo Code Interleaved Address Generation 
    U.S. Patent Application. Filed by Huawei on Oct., 2012. 
  • System and Method for Contention-Free Memory Access in an Interleaver 
    U.S. Patent US8621160 B2. Filed by Huawei, December 2011. Granted, December 2013. 
    (Also published as CN103262425A, WO2012079543A1) 
  • The Method, System and Device to Implement Video/audio Synchronization 
    China Patent ZL200710120585.0. Filed in 2007. Granted, September, 2012.
  • A Fast and High Performance Zooming Method for Multimedia Video 
    China Patent ZL200710178188.9. Filed in 2007. Granted, October, 2011.
  • A copyright protection method and system for audio and video contents in digital cinema 
    China Patent ZL200810114749.3. Filed in 2008. Granted, March, 2010. 
  • A method of Watermark Generation and Detection for digital cinema Copyright Protection 
    China Patent ZL200810103472.4. Filed in 2008. Granted, September, 2010.

  • [Thesis]
  • Master's thesis: "VLSI Architecture for High Definition Digital Cinema Playback System" (Abstract), Chinese Academy of Sciences, Beijing, China, June, 2008. Relative Project: Research and Implementation of DCI-Compliant 2K Digital Cinema Server (Jan.2006-Jan.2008, in ICT,CAS, Beijing, China).