About this Course
최근 조회 1,136

100% 온라인

지금 바로 시작해 나만의 일정에 따라 학습을 진행하세요.

탄력적인 마감일

일정에 따라 마감일을 재설정합니다.

중급 단계

완료하는 데 약 12시간 필요

권장: 4-10 hours/week...

영어

자막: 영어

귀하가 습득할 기술

Computer ArchitectureComputer ProgrammingConfiguring FPGA systems

100% 온라인

지금 바로 시작해 나만의 일정에 따라 학습을 진행하세요.

탄력적인 마감일

일정에 따라 마감일을 재설정합니다.

중급 단계

완료하는 데 약 12시간 필요

권장: 4-10 hours/week...

영어

자막: 영어

강의 계획 - 이 강좌에서 배울 내용

1
완료하는 데 2시간 필요

Reconfigurable cloud infrastructure

Distributed systems, data center and cloud architectures are facing the exponential growth in computing requirements and the impossibility for CPU-based solutions to keep pace. Within this context these complex distributed systems have to move toward accelerated computing. Accelerators complement CPU-based architectures and deliver both performance and power efficiency. Moreover, modern data center, as we know, can be used by several different users to serve different workloads and the idea of having an underlying architecture built on reconfigurable technologies seems to provide an ideal fit for these changing, demanding, workloads. This module provides a description of the main cloud computing components and technologies, as well as detailing the current technologies to accelerate cloud computing workloads....
8 videos (Total 46 min), 3 quizzes
8개의 동영상
An overview of cloud infrastructure6m
Cloud Computing: few definitions7m
Reconfigurable acceleration in the Cloud4m
Reconfigurable acceleration in the Cloud: intel FPGA-based solutions6m
Reconfigurable acceleration in the Cloud: Xilinx FPGA-based solutions5m
Reconfigurable acceleration in the Cloud: from the past, to the future3m
An introduction to the AWS EC2 F1 instances7m
3개 연습문제
QUIZ 130m
QUIZ 230m
QUIZ 35m
2
완료하는 데 2시간 필요

On how to accelerate the cloud with SDAccel

Within this module we are going to have a first taste on how to gain the best out of the combination of the F1 instances with SDAccel providing some few practical instructions on how to develop accelerated applications on Amazon F1 by using the Xilinx SDAccel development environment. Then, we are going to present what it is necessary to create FPGA kernels, assemble the FPGA program and to compile the Amazon FPGA Image, or AFI. Finally, we will describe the steps and tasks involved in developing a host application accelerated on the F1 FPGA....
9 videos (Total 51 min), 3 quizzes
9개의 동영상
F1: instances and FPGA description3m
How FPGA Acceleration Works on AWS3m
AWS F1 Platform Model9m
Creating Kernels from RTL IP, C/C++, OpenCL6m
Compiling the Platform3m
Creating an Amazon FPGA Image2m
Developing and Executing a Host Application on F17m
Start Accelerating4m
3개 연습문제
QUIZ 410m
QUIZ 530m
QUIZ 630m
3
완료하는 데 3시간 필요

Summing things up: the Smith-Waterman algorithm

Within this module we are going to introduce you to the Smith-Waterman algorithm that we have chosen to demonstrate how to create a hardware implementation of a system based on FPGA technologies using the Xilinx SDAccel design framework. We are going to dig into the details of the algorithm from its data structures to the computation flow. Then we are going to introduce the Roofline model and we are going to use it to analyze the theoretical peak performance and the operational intensity of the Smith-Waterman algorithm....
8 videos (Total 48 min), 1 reading, 1 quiz
8개의 동영상
Algorithm and code analysis5m
Roofline model 1/26m
Roofline model 2/24m
Code profiling6m
Static Code Analysis 1/26m
Static Code Analysis 2/24m
Performance Prediction via Roofline Model7m
1개의 읽기 자료
SDAccel Environment Profiling and Optimisation Guide30m
1개 연습문제
QUIZ 730m
4
완료하는 데 5시간 필요

The Smith-Waterman example in details

Within this module we are going to dig deeper in the Smith-Waterman algorithm. We are going to implement a first version of the algorithm on a local server with the Xilinx SDAccel design framework. Then we are going to introduce some optimizations to improve performance, in particular we will add more parallelism in the implementation and we will introduce systolic arrays. Moreover, we will explore how we can perform data compression and then we will leverage multiple memory ports to improve memory access speed. Finally, we are going to port our implementation of the Smith-Waterman algorithm on the AWS F1 instances....
12 videos (Total 95 min), 2 readings, 2 quizzes
12개의 동영상
A first implementation 2/39m
A first implementation 3/34m
Parallelism in the Smith-Waterman Algorithm8m
Systolic Array Architecture 1/29m
Systolic Array Architecture 2/212m
Input Compression6m
Shift Register8m
Dual Physical Ports5m
Smith-Waterman accelerated on the Amazon EC2 F1 instances 1/36m
Smith-Waterman accelerated on the Amazon EC2 F1 instances 2/38m
Smith-Waterman accelerated on the Amazon EC2 F1 instances 3/39m
2개의 읽기 자료
Sources Codes30m
Source Codes30m
2개 연습문제
QUIZ 830m
QUIZ 920m
완료하는 데 1시간 필요

Course conclusions

We are working at the edge of the research in the area of reconfigurable computing. FPGA technologies are not used only as standalone solutions/platforms but are now included into cloud infrastructures. They are now used both to accelerate infrastructure/backend computations and exposed as-a-Service that can be used by anyone. Within this context we are facing the definition of new research opportunities and technologies improvements and the time cannot be better under this perspective. This module is concluding this course but posing interesting questions towards possible future research directions that may also point the students to other Coursera courses on FPGAs....
1 video (Total 3 min), 1 reading
1개의 읽기 자료
Architectural optimizations for high performance and energy efficient Smith-Waterman implementation on FPGAs using OpenCL45m

강사

Avatar

Marco Domenico Santambrogio

Associate Professor
DEIB - Dept. of Electronics, Information and Bioengineering

밀라노 국립건축대학 정보

Politecnico di Milano is a scientific-technological University, which trains engineers, architects and industrial designers. From 2014 Politecnico di Milano started the release of several MOOCs, developed by the service for digital learning METID (Methods and Innovative Technologies for Learning), giving everybody the chance to enhance personal skills....

자주 묻는 질문

  • 강좌에 등록하면 바로 모든 비디오, 테스트 및 프로그래밍 과제(해당하는 경우)에 접근할 수 있습니다. 상호 첨삭 과제는 이 세션이 시작된 경우에만 제출하고 검토할 수 있습니다. 강좌를 구매하지 않고 살펴보기만 하면 특정 과제에 접근하지 못할 수 있습니다.

  • 수료증을 구매하면 성적 평가 과제를 포함한 모든 강좌 자료에 접근할 수 있습니다. 강좌를 완료하면 전자 수료증이 성취도 페이지에 추가되며, 해당 페이지에서 수료증을 인쇄하거나 LinkedIn 프로필에 수료증을 추가할 수 있습니다. 강좌 콘텐츠만 읽고 살펴보려면 해당 강좌를 무료로 청강할 수 있습니다.

궁금한 점이 더 있으신가요? 학습자 도움말 센터를 방문해 보세요.