Is openpyxl the fastest package to use to read big xlsx files?
问题 I'm a python beginner but I'm writing a script using openpyxl to read in big xlsx files (60000x187) into Numpy arrays to do some machine learning. My code: from openpyxl import load_workbook import re from numpy import * wb = load_workbook(filename = 'dataSheet.xlsx', use_iterators = True) #dataSheet.xlsx ws1 = wb.get_sheet_by_name(name = 'LogFileData') startCol = 1 #index from 1 startRow = 2 #start at least from 2 because header is in 1st row endCol = ws1.get_highest_column() #index of last