我需要一些有關 MATLAB 并行編程的幫助。需要明確的是,我之前從未在我的任何代碼中實作過并行化技術。我有一個視頻壓縮引擎,作為我大學專案的一部分開發。它是 H.264 視頻壓縮引擎的基本版本。我必須將 MATALB 中可用的并行處理技術實作到這個引擎。基本上,我有一個函式將影像幀劃分為多個塊(由塊的大小決定)。我正在嘗試部分或完全并行化此代碼塊。當塊之間沒有依賴性時,我使用了“parfor”,并且效果很好。我已經上傳了這個實作。現在我試圖并行化塊之間存在依賴關系的情況。
function [reconstructed_frames, residual_blocks, encoded_data_cell, bit_count_coeff_per_frame, bit_count_mv_per_frame_cell, real_avg_bit_count_per_row_per_frame, total_bit_count_per_frame, QP_used_in_row, scene_change_frames, SAD_value_per_frame] = block_prediction_parallalized(Y, block_size, srch_rng, QP, I_period,pathToResiduals, no_ref_frames, VBS_enable, Fast_ME_enable,Frac_ME_enable,lambda, RC_flag, avg_bit_count_row_vary_QP, target_bits_per_frame)
%Function to predcit frames based on inter prediction and intra prediction,
%with the given I-period
Y = int64(Y);
[no_rows, no_cols, no_frames] = size(Y);
no_blocks_in_row = (no_cols*block_size)/(block_size*block_size);
no_blocks_in_col = (no_rows*block_size)/(block_size*block_size);
total_blocks_per_frame = (no_rows*no_cols)/(block_size*block_size);
encoded_data_cell = cell(1,total_blocks_per_frame,no_frames);
encoded_data_per_frame = cell(1, total_blocks_per_frame);
ref_frame_inter = zeros(no_rows, no_cols, 1, 'int64') 128;
bit_count_coeff_per_frame = 0;
bit_count_mv_per_frame_cell = 0;
real_avg_bit_count_per_row_per_frame = 0;
QP_used_in_row = zeros(1,no_blocks_in_col,no_frames);
QP_used_in_row(:,:,:) = QP;
scene_change_frames = [];
SAD_value_per_frame = 0;
ref_frame_index_count = 1;
for k = 1:no_frames
if k>1
ref_frame_inter(:,:,1) = Y(:,:,k-1);
end
block_segment = 0;
bitCountMV = 0;
for row = 1 : block_size : no_rows - block_size 1
for col = 1 : block_size : no_cols - block_size 1
block_segment = block_segment 1;
row_start = row;
row_end = row_start block_size - 1;
col_start = col;
col_end = col_start block_size - 1;
row_end = min(row_end, no_rows);
col_end = min(col_end, no_cols);
% Making an array of blocks of size block_size
block_list_currframe(:,:,block_segment) = Y(row_start:row_end, col_start:col_end, k);
location_pointers(block_segment,:) = [row_start row_end col_start col_end];
end
end
%Parallelizing the block encoding process
max_index = size(block_list_currframe,3);
%Loop for processing blocks concurrently
parfor block_index = 1:max_index
% Funtion for inter-prediction
[encoded_data, reconstructed_block, residual_block, bit_count_per_block] = paral_debug_funct(block_index, location_pointers, block_list_currframe, ref_frame_inter, block_size, srch_rng, QP, no_rows, no_cols, ref_frame_index_count, VBS_enable, Fast_ME_enable, Frac_ME_enable, lambda);
%Buffering the output of each worker
reconstructed_blocks(:,:,block_index) = reconstructed_block;
residual_blocks_in_frame(:,:,block_index) = residual_block;
encoded_data_per_frame(:,:, block_index) = encoded_data;
total_bit_count_per_block(block_index) = bit_count_per_block;
end
%Processing the buffered outputs obtained after processing all the
%blocks.
for block_index = 1:size(block_list_currframe,3)
% [row_start, row_end, col_start, col_end] = location_pointers(block_index,:);
row_start = location_pointers(block_index, 1);
row_end = location_pointers(block_index, 2);
col_start = location_pointers(block_index, 3);
col_end = location_pointers(block_index, 4);
reconstructed_frames(row_start:row_end, col_start:col_end, k) = reconstructed_blocks(:,:,block_index);
residual_blocks(:,:,block_index,k) = residual_blocks_in_frame(:,:,block_index);
encoded_data_cell(:,:,block_index,k) = encoded_data_per_frame(:,:,block_index);
end
total_bit_count_per_frame(k) = sum(total_bit_count_per_block, 'all');
end
在上面的代碼中,塊不必相互通信。現在,我要求它們在某個時候相互通信,因為某些塊的處理將不得不等待前一個塊完成。我認為下面的圖片將有助于使其更清晰。

我已經知道有兩種型別的并行處理可用,多執行緒和多處理。我認為多執行緒適合我的用例。我已經閱讀了有關 spmd 和 parfeval 的內容,但是,我遇到的示例通常不是很詳細。由于我是并行處理的新手,這些選項感覺非常混亂,很難選擇要關注哪一個。我想我想要的是工人在執行程序中能夠相互交流嗎?我不確定。如果您需要大致了解資料大小: video_frame size = 288x352(CIF format) block size = 16 no of frames = 21
謝謝!
PS 很抱歉這篇很長的帖子,我試圖盡可能清楚地解釋它
uj5u.com熱心網友回復:
您可以使用 a parforinside a non parallel for,如下所示:
previous_blocks = {};
for color : ["green", "red", "blue"]
input_blocks = extract cell array of blocks with same color from the image
processed_blocks = cell(1, numel(input_blocks));
parfor i=1:numel(input_blocks)
processed_blocks{i} = process_based_on_previous_blocks (i, input_blocks{i}, previous_blocks);
end
previous_blocks = processed_blocks;
place processed_blocks in their original position in the image;
end
轉載請註明出處,本文鏈接:https://www.uj5u.com/caozuo/371814.html
