We're sorry Aspose doesn't work properply without JavaScript enabled.

Free Support Forum - aspose.com

UTF-8 support

Can I use java to extract data from an XLSX cell in UTF-8? In other
words, if chinese characters​ are stored in the XLSX file, can I use the
java component to extract those chinese characters​?

Hi,

Yes, you can extract those characters using Aspose.Cells for Java.

Please remember, all characters are stored in Unicode or its different encoding in Xls/Xlsx files.

Please refer to this article how to extract/read cell value.

http://docs.aspose.com/display/cellsjava/Retrieving+Data+from+Cells

I’m not certain what you mean by “all characters are stored in Unicode or its different encoding in XLS/XLSX files.” Does that mean the file needs to be converted into UTF-8 prior to parsing it? Or, do I have to do something like:

BufferedReader br = new BufferedReader(
new InputStreamReader(
new FileInputStream(xlsxFile), “UTF8”));

prior to parsing? Or is there another step I have to do to get the correct characters out of the XLSX file?

Hi,

You do not need to do anything, just load your workbook and start reading the characters.

e.g



Workbook workbook = new Workbook(“C:\yourFile.xlsx”);

Worksheet sheet = workbook.getWorkhseets().get(0);


Cell cellA1 = sheet.getCells().get(“A1”);


cellA1.getValue(); // will give you characters inside cell A1