matlab - 导入多个csv文件并提取一列数据形成单个矩阵

Question

我对 MATLAB 很陌生，所以我相当肯定这是一个非常简单的问题。我有几个输出数据集，每个都有一个前缀（例如 PT_1 到 PT_20）。我想使用 for 循环将数据从每个 csv 文件的第二列导入到新矩阵中，并将其与时间对齐，这在所有文件中都是恒定的。

输入文件看起来像

PT_1.....

time    param 1 param 2 param 3
2/01/2001 23:00 11.449428   3   314.322471
3/01/2001 23:00 11.448935   3   311.683002

PT_2.....

time    param 1 param 2 param 3
2/01/2001 23:00 11.445892   0   296.523937
3/01/2001 23:00 11.445393   0   294.0944

我希望我的输出看起来像

time    PT_1    PT_2
2/01/2001 23:00 11.449428   11.445892
3/01/2001 23:00 11.448935   11.445393

到目前为止，我得到的代码是

files = 0:1:21;
for i=1:21;
filename = sprintf('WQ_%d.csv', files(i));
origdata = importdata (filename);
end

我可以看到它可以正确识别文件名，但它并没有真正做我想做的事情，因为它会在每个循环中重写数据。显然，我的编码错误。谁能帮我弄清楚如何为此编写合适的代码？非常感谢！

score 0 · Accepted Answer

试试这个：

%# Set the number of csv files
DirectoryPath = 'FullDirectoryPathHereWithTrailingSlash';
NumFile = 2;

%# Open the first file and get the first column (the date column)
File1Path = [DirectoryPath, 'PT_1.csv'];
fid1 = fopen(File1Path, 'r');
Date = textscan(fid1, '%s %*[^\n]', 'Delimiter', ',', 'HeaderLines', 1);
fclose(fid1);

%Convert dates to matlab date numbers and get number of rows
Date = datenum(Date{1, 1}, 'dd/mm/yyyy');
T = size(Date, 1);

%# Preallocate a matrix to hold all the data, and add the date column
D = [Date, NaN(T, NumFile)];

%# Loop over the csv files, get the second column and add it to the data matrix
for n = 1:NumFile

    %# Get the current file name
    CurFilePath = [DirectoryPath, 'PT_', num2str(n), '.csv'];

    %# Open the current file for reading and scan in the second colum using numerical format
    fid1 = fopen(CurFilePath, 'r');
    CurData = textscan(fid1, '%*s %f %*[^\n]', 'Delimiter', ',', 'HeaderLines', 1);
    fclose(fid1);

    %Add the current data to the cell array
    D(:, n+1) = CurData{1, 1};

end

希望使用我提供的注释代码应该是不言自明的。有点棘手的是我在textscan函数中使用的格式字符串。这是一个快速的解释：

1)'%s %*[^\n]'说：获取第一列，它是字符串格式（即%s）并跳过所有剩余的列（即%*[^\n]）。

2)'%*s %f %*[^\n]'说：跳过第一列，即字符串格式（即%*s），获取第二列，即浮点数（即%f），然后跳过所有剩余的列（即%*[^\n]）。

更新：我刚刚更新了代码以在顶部包含一个变量，该变量允许您指定 csv 文件所在的目录（以防它不是当前目录）。只需将文本 FullDirectoryPathHereWithTrailingSlash 替换为适当的路径，例如/home/username/Documents/在 Linux 或C:\Windows\Blah\Windows 上。

我刚刚在两个测试 csv 文件上测试了这段代码，命名PT_1.csv和PT_2.csv看起来完全一样：

time, param 1, param 2, param 3
2/01/2001 23:00, 11, 3, 314.322471
3/01/2001 23:00, 12, 3, 311.683002

和

time, param 1, param 2, param 3
2/01/2001 23:00, 13, 0, 296.523937
3/01/2001 23:00, 14, 0, 294.0944

结果？

>> D

D =

      730853          11          13
      730854          12          14

matlab - 导入多个csv文件并提取一列数据形成单个矩阵

1 回答 1

Related

Reference