1

我想要实现的是从 Excel 工作表中读取数据(保存在 Txt 中,以制表符分隔),逐行读取数据,每个单独的列都是我想要存储在数组中的不同数据。

我尝试了不同的方法。我什至从互联网上下载了 CSVReader 类,但它没有用。至少这一次它阅读真实的字符而不是胡言乱语。

我现在拥有的版本是使用 bufferedReader 和 String Tokenizer。但它没有正确阅读。

这是代码:

     import java.io.BufferedReader;
     import java.io.BufferedWriter;
     import java.io.File;
     import java.io.FileNotFoundException;
     import java.io.FileReader;
     import java.io.FileWriter;
     import java.io.IOException;
     import java.util.StringTokenizer;

     import com.csvreader.CsvReader;

     import au.com.bytecode.opencsv.CSVReader;


     public class excelToText{
     public static void main(String[] args) throws IOException {

    try
    {

        //csv file containing data
          BufferedReader CSVFile = new BufferedReader(new          FileReader("C:/Users/nhajjar/workspace/MB/src/yoyo.txt"));


          // Read first line.
          String dataRow; 
          int lineNumber = 0; 


          while ((dataRow = CSVFile.readLine() )!= null)
          {
                String [] dataArray;
                lineNumber++; 



                String delimiter = "\t";
                /* given string will be split by the argument delimiter provided. */
                dataArray = dataRow.split(delimiter);
                /* print substrings */

// for(int i =0; i < dataArray.length ; i++) // System.out.println(dataArray[i]);

                String PrinterName = dataArray[0];
                String Model = dataArray[1];
                String IP = dataArray[2];
                String Location = dataArray[3];
                String Department = dataArray[4];
                String PrimServer = dataArray[5];
                String SecServer = dataArray[6];
                String ShareName = dataArray[7];
                String GroupNamePrefix = dataArray[8];
                String GroupNameSuffix = dataArray[9];
                String GroupNameFinal = dataArray[10];
                String WSPPrefix = dataArray[11];
                String WSPFull = dataArray[12];
                String PrimWSP = dataArray[13];
                String SecWSP = dataArray[14];

              System.out.println("PrinterName is : " + PrinterName);
                System.out.println("Model is : " + Model);
                System.out.println("IP is : " + IP);
                System.out.println("Location is : " + Location);
                System.out.println("Department is : " + Department);
                System.out.println("PrimServer is : " + PrimServer);
                System.out.println("SecServer is : " + SecServer);
                System.out.println("ShareName is : " + ShareName);
                System.out.println("GroupNamePrefix is : " + GroupNamePrefix);
                System.out.println("GroupNameSuffix is : " + GroupNameSuffix);
                System.out.println("GroupNameFinal is : " + GroupNameFinal);
                System.out.println("WSPPrefix is : " + WSPPrefix);
                System.out.println("WSPFull is : " + WSPFull);
                System.out.println("PrimWSP is : " + PrimWSP);
                System.out.println("PrimWSP is : " + PrimWSP);
                System.out.println("SecWSP is : " + SecWSP);



                /*//writing file for ones that were not sent out. 
                  File file = new File("write.txt");
                  BufferedWriter output = new BufferedWriter(new FileWriter(file));
                  */

            }




            CSVFile.close();

        }   catch (FileNotFoundException e) {
            e.printStackTrace();
        }   catch (IOException e) {
            e.printStackTrace();
        }   catch(Exception e)          {
          System.out.println("Exception while reading/writing csv file: " + e);                   
        }// end Exceptions 

  }// end try block 
     }

输入是:

    Printer name    Model   IP  Location    Department  Primary Server  Secondary Server    Share Name
    Boundary Sprinter Techs Lexmark E360dn      Boundary    Sprinter    s173m928site    s173mho1site    928-Sprinter-techs-L-E360dn
    Boundary Sprinter Xerox 7232    Xerox WorkCentre 7232       Boundary    Sprinter    s173m928site    s173mho1site    928-Sprinter-WC7232
    Boundry Parts   HP LaserJet P2055dn     Boundary    Parts   s173m928site    s173mho1site    928-Parts-LJ-P2055dn
    Boundry Sales   HP Color LaserJet CP4005        Boundary    Sales   s173m928site    s173mho1site    928-Sales-Main-CP4005
    Boundry Techs East  HP LaserJet P3015       Boundary    Techs East  s173m928site    s173mho1site    928-Techs-east-LJ-P3015
    Boundry Techs West  Lexmark E352dn      Boundary    Techs West  s173m928site    s173mho1site    928-Techs-west-L-E352dn
    Concord     Lexmark E360dn      Concord         s173mho1site    
    Dundas Parts    Xerox WorkCentre 7232       Dundas  Parts   s173m910site    s173mho1site    910-Parts-WC-7232
    Dundas Preowned Xerox WorkCentre 7425       Dundas  Preowned    s173m910site    s173mho1site    910-PreOwned-WC-7425
    Dundas Sales 2nd Floor  HP Color LaserJet CP4025        Dundas  Sales   s173m910site    s173mho1site    910-Sales-2nd-CP4025
    Dundas Sales Main Floor HP Color LaserJet CP4025        Dundas  Sales   s173m910site    s173mho1site    910-Sales-Main-CP4025


output im getting is : 


PrinterName is : Printer name 
Model is : Model
IP is : IP
Location is : Location
Department is : Department
PrimServer is : Primary Server 
SecServer is : Secondary Server
ShareName is : Share Name
GroupNamePrefix is : Group Name Prefix
GroupNameSuffix is : Group Name Suffix
GroupNameFinal is : Group Name Final
WSPPrefix is : WSP Prefix
WSPFull is : WSP Full
PrimWSP is : Primary WSP
PrimWSP is : Primary WSP
SecWSP is : Secondary WSP


PrinterName is : Boundary Sprinter Techs
Model is : Lexmark E360dn
IP is : 53.254.177.138
Location is : Boundary
Department is : Sprinter
PrimServer is : s173m928site
SecServer is : s173mho1site
ShareName is : 928-Sprinter-techs-L-E360dn
GroupNamePrefix is : D173_PRINTER-
GroupNameSuffix is : 928-Sprinter-techs-L-E360dn
GroupNameFinal is : D173_PRINTER-928-Sprinter-techs-L-E360dn
WSPPrefix is : #;\D173\_GLOBALRESOURCES\GROUPS\Printers\
WSPFull is : #;\D173\_GLOBALRESOURCES\GROUPS\Printers\D173_PRINTER-928-Sprinter-techs-L-E360dn.wsp;D173_PRINTER-928-Sprinter-techs-L-E360dn
PrimWSP is : >;%;\\s173m928site.cambc.corpintra.net\928-Sprinter-techs-L-E360dn
PrimWSP is : >;%;\\s173m928site.cambc.corpintra.net\928-Sprinter-techs-L-E360dn
SecWSP is : ;>;%;\\s173mho1site.cambc.corpintra.net\928-Sprinter-techs-L-E360dn


PrinterName is : Boundary Sprinter Xerox 7232
Model is : Xerox WorkCentre 7232
IP is : 53.254.177.136
Location is : Boundary
Department is : Sprinter
PrimServer is : s173m928site
SecServer is : s173mho1site
ShareName is : 928-Sprinter-WC7232
GroupNamePrefix is : D173_PRINTER-
GroupNameSuffix is : 928-Sprinter-WC7232
GroupNameFinal is : D173_PRINTER-928-Sprinter-WC7232
WSPPrefix is : #;\D173\_GLOBALRESOURCES\GROUPS\Printers\
WSPFull is : #;\D173\_GLOBALRESOURCES\GROUPS\Printers\D173_PRINTER-928-Sprinter-WC7232.wsp;D173_PRINTER-928-Sprinter-WC7232
PrimWSP is : >;%;\\s173m928site.cambc.corpintra.net\928-Sprinter-WC7232
PrimWSP is : >;%;\\s173m928site.cambc.corpintra.net\928-Sprinter-WC7232
SecWSP is : ;>;%;\\s173mho1site.cambc.corpintra.net\928-Sprinter-WC7232


PrinterName is : Boundry Parts
Model is : HP LaserJet P2055dn
IP is : 53.254.193.222
Location is : Boundary
Department is : Parts
PrimServer is : s173m928site
SecServer is : s173mho1site
ShareName is : 928-Parts-LJ-P2055dn
GroupNamePrefix is : D173_PRINTER-
GroupNameSuffix is : 928-Parts-LJ-P2055dn
GroupNameFinal is : D173_PRINTER-928-Parts-LJ-P2055dn
WSPPrefix is : #;\D173\_GLOBALRESOURCES\GROUPS\Printers\
WSPFull is : #;\D173\_GLOBALRESOURCES\GROUPS\Printers\D173_PRINTER-928-Parts-LJ-P2055dn.wsp;D173_PRINTER-928-Parts-LJ-P2055dn
PrimWSP is : >;%;\\s173m928site.cambc.corpintra.net\928-Parts-LJ-P2055dn
PrimWSP is : >;%;\\s173m928site.cambc.corpintra.net\928-Parts-LJ-P2055dn
SecWSP is : ;>;%;\\s173mho1site.cambc.corpintra.net\928-Parts-LJ-P2055dn


PrinterName is : Boundry Sales
Model is : HP Color LaserJet CP4005
IP is : 53.254.193.117
Location is : Boundary
Department is : Sales
PrimServer is : s173m928site
SecServer is : s173mho1site
ShareName is : 928-Sales-Main-CP4005
GroupNamePrefix is : D173_PRINTER-
GroupNameSuffix is : 928-Sales-Main-CP4005
GroupNameFinal is : D173_PRINTER-928-Sales-Main-CP4005
WSPPrefix is : #;\D173\_GLOBALRESOURCES\GROUPS\Printers\
WSPFull is : #;\D173\_GLOBALRESOURCES\GROUPS\Printers\D173_PRINTER-928-Sales-Main-CP4005.wsp;D173_PRINTER-928-Sales-Main-CP4005
PrimWSP is : >;%;\\s173m928site.cambc.corpintra.net\928-Sales-Main-CP4005
PrimWSP is : >;%;\\s173m928site.cambc.corpintra.net\928-Sales-Main-CP4005
SecWSP is : ;>;%;\\s173mho1site.cambc.corpintra.net\928-Sales-Main-CP4005


PrinterName is : Boundry Techs East
Model is : HP LaserJet P3015
IP is : 53.254.193.220
Location is : Boundary
Department is : Techs East
PrimServer is : s173m928site
SecServer is : s173mho1site
ShareName is : 928-Techs-east-LJ-P3015
GroupNamePrefix is : D173_PRINTER-
GroupNameSuffix is : 928-Techs-east-LJ-P3015
GroupNameFinal is : D173_PRINTER-928-Techs-east-LJ-P3015
WSPPrefix is : #;\D173\_GLOBALRESOURCES\GROUPS\Printers\
WSPFull is : #;\D173\_GLOBALRESOURCES\GROUPS\Printers\D173_PRINTER-928-Techs-east-LJ-P3015.wsp;D173_PRINTER-928-Techs-east-LJ-P3015
PrimWSP is : >;%;\\s173m928site.cambc.corpintra.net\928-Techs-east-LJ-P3015
PrimWSP is : >;%;\\s173m928site.cambc.corpintra.net\928-Techs-east-LJ-P3015
SecWSP is : ;>;%;\\s173mho1site.cambc.corpintra.net\928-Techs-east-LJ-P3015


PrinterName is : Boundry Techs West
Model is : Lexmark E352dn
IP is : 53.254.193.221
Location is : Boundary
Department is : Techs West
PrimServer is : s173m928site
SecServer is : s173mho1site
ShareName is : 928-Techs-west-L-E352dn
GroupNamePrefix is : D173_PRINTER-
GroupNameSuffix is : 928-Techs-west-L-E352dn
GroupNameFinal is : D173_PRINTER-928-Techs-west-L-E352dn
WSPPrefix is : #;\D173\_GLOBALRESOURCES\GROUPS\Printers\
WSPFull is : #;\D173\_GLOBALRESOURCES\GROUPS\Printers\D173_PRINTER-928-Techs-west-L-E352dn.wsp;D173_PRINTER-928-Techs-west-L-E352dn
PrimWSP is : >;%;\\s173m928site.cambc.corpintra.net\928-Techs-west-L-E352dn
PrimWSP is : >;%;\\s173m928site.cambc.corpintra.net\928-Techs-west-L-E352dn
SecWSP is : ;>;%;\\s173mho1site.cambc.corpintra.net\928-Techs-west-L-E352dn
Exception while reading/writing csv file: java.lang.ArrayIndexOutOfBoundsException: 7

注意:我截断了输出。这些“空块”还有很多

4

4 回答 4

2

看起来你过于复杂了。CSV 文件具有逗号分隔值。因此,您的文件数据应类似于:

第 1 行:单元 1、单元 2、单元 3、单元 4 等...

第 2 行:第 2 行的单元格 1、第 2 行的单元格 2 等...

您使用制表符分隔的文件,因此您需要在制表符上拆分

//Read first line 
String dataRow;
int lineNumber = 0;
while((dataRow = CSVFile.readline()) != null)
{
    String [] dataArray;
    lineNumber++;
    String delimiter = "\t";
    dataArray = dataRow.split(delimiter); //Now dataArray contains all the tab delimited cells that were on line one of the .txt file
    //Start assigning the information to your variables since it is stored in dataArray
    String PrinterName = dataArray[0];//This would assign the first cell (row 1, column 1 in excel) that was read from the text file. From looking at your input this should be "Printer name"
    String Model = dataArray[1];
    String IP = dataArray[2];
    //etc...Assign the rest
    //Print your output
    //Do anything else you need to do
 }//end the while loop

以下是一些来源:

http://www.java-examples.com/java-string-split-example

注意:这个例子更接近你正在做的只是使用“\t”而不是“,”: http://www.daniweb.com/software-development/java/threads/17262/reading-in-a-。 csv-file-and-loading-the-data-into-an-array

希望这可以帮助

于 2012-08-10T15:33:35.667 回答
0

标记仅由空格和加号分隔。那是因为第二个参数 ofStringTokenizer不允许正则表达式。尝试使用"\t"似乎真正分开的字段。

顺便说一句,您并不需要每行有 100 个字符串数组。

于 2012-08-10T15:32:08.483 回答
0

我建议使用该String.split()函数,因为这将大大简化您的代码。

于 2012-08-10T15:49:34.047 回答
0

我不建议自己使用 .split 或 StringTokenizer 类拆分数据。他们不考虑字段引用等......

使用类似(如前所述)OpenCSV 或 StrTokenizer(commons-lang 的一部分)之类的库为您进行解析。

使用制表符分隔解析器的 StrTokenizer 示例:

StrTokenizer tokenizer = StrTokenizer.getTSVInstance();    
while ( (line = ...) != null) {
    tokenizer.reset(line);
    String tokArray[] = tokenizer.getTokenArray();
}
于 2012-08-10T15:54:28.077 回答