您的要求“从第 2 行到最后删除所有文本”将有点复杂,因为Word
表格单元格可以包含除文本之外的许多其他内容。
考虑下表:

因此,如果要求是从第 2 行到最后删除所有内容,那么您可以简单地将所有单元格替换为新的干净单元格。或者至少对于其中只有一个空段落的段落。
import java.io.FileOutputStream;
import java.io.FileInputStream;
import java.util.List;
import org.apache.poi.xwpf.usermodel.*;
import org.openxmlformats.schemas.wordprocessingml.x2006.main.CTTc;
import org.openxmlformats.schemas.wordprocessingml.x2006.main.CTTc;
import org.openxmlformats.schemas.wordprocessingml.x2006.main.CTP;
/*
needs the full ooxml-schemas-1.3.jar as mentioned in https://poi.apache.org/faq.html#faq-N10025
since the CTRowImpl is not fully shipped with poi-ooxml-schemas-3.13-*.jar
*/
public class WordCleanTableRows {
public static void main(String[] args) throws Exception {
FileInputStream fis = new FileInputStream("document.docx");
XWPFDocument doc = new XWPFDocument(fis);
List<XWPFTable> tables = doc.getTables();
XWPFTable table = tables.get(0);
XWPFTableRow[] rows = table.getRows().toArray(new XWPFTableRow[0]);
for (int r = 0; r < rows.length; r++) {
if (r > 0) {
XWPFTableRow row = rows[r];
CTTc[] cells = row.getCtRow().getTcList().toArray(new CTTc[0]);
for (int c = 0; c < cells.length; c++) {
CTTc cTTc = cells[c];
//clear only the paragraphs in the cell, keep cell styles
cTTc.setPArray(new CTP[] {CTP.Factory.newInstance()});
cells[c] = cTTc;
}
row.getCtRow().setTcArray(cells);
//System.out.println(row.getCtRow());
}
}
doc.write(new FileOutputStream("new document.docx"));
}
}
这需要完整的 ooxml-schemas-1.3.jar,如https://poi.apache.org/faq.html#faq-N10025中所述,
因为 CTRowImpl 没有完全随 poi-ooxml-schemas-3.13-*.jar 一起提供.
如果没有完整的 ooxml-schemas-1.3.jar,您可以简单地删除除第一行之外的所有行并添加新行。
import java.io.FileOutputStream;
import java.io.FileInputStream;
import java.util.List;
import org.apache.poi.xwpf.usermodel.*;
public class WordCleanTableRows2 {
public static void main(String[] args) throws Exception {
FileInputStream fis = new FileInputStream("document.docx");
XWPFDocument doc = new XWPFDocument(fis);
List<XWPFTable> tables = doc.getTables();
XWPFTable table = tables.get(0);
XWPFTableRow[] rows = table.getRows().toArray(new XWPFTableRow[0]);
for (int r = 0; r < rows.length; r++) {
if (r > 0) {
XWPFTableRow row = rows[r];
table.removeRow(1); //remove second row. others shift upwards
table.createRow(); //add new row at the end
}
}
doc.write(new FileOutputStream("new document.docx"));
}
}
编辑:
以下应该在没有 ooxml-schemas-1.3.jar 的情况下工作,并且与我的第一个示例相同。
import java.io.FileOutputStream;
import java.io.FileInputStream;
import java.util.List;
import org.apache.poi.xwpf.usermodel.*;
import org.openxmlformats.schemas.wordprocessingml.x2006.main.CTP;
import org.openxmlformats.schemas.wordprocessingml.x2006.main.STOnOff;
import java.math.BigInteger;
public class WordCleanTableRows3 {
public static void main(String[] args) throws Exception {
FileInputStream fis = new FileInputStream("document.docx");
XWPFDocument doc = new XWPFDocument(fis);
List<XWPFTable> tables = doc.getTables();
XWPFTable table = tables.get(0);
XWPFTableRow[] rows = table.getRows().toArray(new XWPFTableRow[0]);
for (int r = 0; r < rows.length; r++) {
if (r > 0) {
XWPFTableRow row = rows[r];
List<XWPFTableCell> cells = row.getTableCells();
for (XWPFTableCell cell : cells) {
//get CTTc and replace the CTPArray with one empty CTP
cell.getCTTc().setPArray(new CTP[] {CTP.Factory.newInstance()});
//set some default styles for the paragraphs in the cells:
//http://grepcode.com/file/repo1.maven.org/maven2/org.apache.poi/ooxml-schemas/1.1/org/openxmlformats/schemas/wordprocessingml/x2006/main/CTParaRPr.java
CTP cTP = cell.getCTTc().getPArray(0);
cTP.addNewPPr();
cTP.getPPr().addNewRPr();
cTP.getPPr().getRPr().addNewB().setVal(STOnOff.ON);
cTP.getPPr().getRPr().addNewColor().setVal("FF0000");
cTP.getPPr().getRPr().addNewSz().setVal(BigInteger.valueOf(40));
}
}
}
doc.write(new FileOutputStream("new document.docx"));
}
}
org.openxmlformats.schemas.wordprocessingml.x2006.main.CTP 随 poi-ooxml-schemas-3.13-*.jar 一起提供。