How to find the active number of elements on a single column/row using Openpyxl in Python?

走远了吗. 提交于 2019-12-11 14:54:35

问题


I am using openpyxl.

  1. I have the below basic spreadsheet.

  1. I am trying to just get the active number of elements in a particular column using len() and filter but still not getting what I wanted.

Sample Code:

load_xls_file = open("./sample.xlsx", "r") 
wb = load_workbook(load_xls_file)
sheet = wb.get_sheet_by_name("Sheet")


rock = len(sheet['A'])

print '_code : Value of rock from spreadsheet is',rock 
print '_code : Values are', filter(None,sheet['A'])
print '_code : Values are', sheet['A'] 
print '_code : Values of b', len(sheet['B']) 

Output:

    _code : Value of rock from spreadsheet is 30
    _code : Values are (<Cell u'Sheet'.A1>, <Cell u'Sheet'.A2>, <Cell u'Sheet'.A3>, <Cell u'Sheet'.A4>, <Cell u'Sheet'.A5>, <Cell u'Sheet'.A6>, <Cell u'Sheet'.A7>, <Cell u'Sheet'.A8>, <Cell u'Sheet'.A9>, <Cell u'Sheet'.A10>, <Cell u'Sheet'.A11>, <Cell u'Sheet'.A12>, <Cell u'Sheet'.A13>, <Cell u'Sheet'.A14>, <Cell u'Sheet'.A15>, <Cell u'Sheet'.A16>, <Cell u'Sheet'.A17>, <Cell u'Sheet'.A18>, <Cell u'Sheet'.A19>, <Cell u'Sheet'.A20>, <Cell u'Sheet'.A21>, <Cell u'Sheet'.A22>, <Cell u'Sheet'.A23>, <Cell u'Sheet'.A24>, <Cell u'Sheet'.A25>, <Cell u'Sheet'.A26>, <Cell u'Sheet'.A27>, <Cell u'Sheet'.A28>, <Cell u'Sheet'.A29>, <Cell u'Sheet'.A30>)
    _code : Values are (<Cell u'Sheet'.A1>, <Cell u'Sheet'.A2>, <Cell u'Sheet'.A3>, <Cell u'Sheet'.A4>, <Cell u'Sheet'.A5>, <Cell u'Sheet'.A6>, <Cell u'Sheet'.A7>, <Cell u'Sheet'.A8>, <Cell u'Sheet'.A9>, <Cell u'Sheet'.A10>, <Cell u'Sheet'.A11>, <Cell u'Sheet'.A12>, <Cell u'Sheet'.A13>, <Cell u'Sheet'.A14>, <Cell u'Sheet'.A15>, <Cell u'Sheet'.A16>, <Cell u'Sheet'.A17>, <Cell u'Sheet'.A18>, <Cell u'Sheet'.A19>, <Cell u'Sheet'.A20>, <Cell u'Sheet'.A21>, <Cell u'Sheet'.A22>, <Cell u'Sheet'.A23>, <Cell u'Sheet'.A24>, <Cell u'Sheet'.A25>, <Cell u'Sheet'.A26>, <Cell u'Sheet'.A27>, <Cell u'Sheet'.A28>, <Cell u'Sheet'.A29>, <Cell u'Sheet'.A30>)
_code : Values of b 30

Neither len(), nor filter isn't providing the expected value i.e. 7 rather it prints the max value of 30 all the time. Also even when I do len(sheet['B']) is still provide the same value of 30.

Am i making any simple mistake ? Kindly provide your comments.


回答1:


Spreadsheets are tables of rows and columns so if a spreadsheet has 30 rows then all columns have 30 cells. As to whether a cell is active or not, you have to decide what are the criteria for this but this is trivial to do:

not_empty = [c for c in ws.iter_rows(min_col=1, max_col=1, values_only=True) if c[0] is not None]
print(len(not_empty))



回答2:


Question: get the active number of elements in a particular column


Worksheet:

Title   Title   Title   
1       3       4   
None    None    None    
1       3       4   
min_col = 1  # 'A'
val_counter = 0

# Iterate all Rows, starting at 'min_row='
# Iterate only ONE Column, therefore 'min_col=' and 'max_col=' have the same value
# Returns a Tuple of Column Values ((value A2,), (value A3), ...)

for cells in ws.iter_rows(min_row=2, 
                          min_col=min_col, max_col=min_col,
                          values_only=True):
    value = cells[0]

    # Condition, which to count
    if value is not None:
        val_counter += 1

print('Values are {}'.format(val_counter))
# >>> Values are 2

OOP Solution:

Extending openpyxl class Worksheet with a .filter(... methode.

import openpyxl

class Worksheet:
    def __init__(self, pyxl):
        for attr in ['filter', 'filter_list']:
            setattr(pyxl.worksheet.worksheet.Worksheet, 
                    attr, 
                    getattr(Worksheet, attr)
                   )

    def filter(self, f, range=None, values_only=True):
        cells = self.iter_rows(min_row=range[0],
                               min_col=range[1],
                               max_row=range[2],
                               max_col=range[3],
                               values_only=values_only
                              )

        for row in cells: 
            yield from (c for c in row if f(c))

    def filter_list(self, f, range=None, values_only=True):
        return [v for v in self.filter(f, range, values_only)]

# Extend openpyxl Worksheet
Worksheet(openpyxl)

Usage:

wb = openpyxl.Workbook()
ws = wb.active

# Add some test data
ws.append(['Title', 'Title', 'Title'])
for r in range(3):
    if r == 1:
        ws.append([None, None, None])
    else:
        ws.append([1, 3, 4])


# Filter Values, where Cell.value is not None
# range(min_row, min_col, max_row, max_col)
# Return a List of Values
cells = ws.filter_list(lambda v: v is not None, 
                       range=(2, 1, ws.max_row, 1)

print('Values are {}'.format(len(cells)))
# >>> Values are 2


来源:https://stackoverflow.com/questions/58550548/how-to-find-the-active-number-of-elements-on-a-single-column-row-using-openpyxl

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!