Filter by Row in Excel (Part 2 - Efficiency!)

Posted on 2014-02-16
Last Modified: 2014-02-18
I previously asked a query on Filtering by a value in a Row, rather than a column:

"I would like a way, if possibly, to filter through this by ROW, rather than Column.

For example, in each group we have a Row labelled Responsibility Application. I want to Filter by Responsibility Application ."

This was answered perfectly by user: byundt

Attached is a working example:  ExpertsExchangeExample2Q28363142.xlsm

This solution works perfectly for a dataset of about 5000 rows or less. It takes 40-50 seconds to calculate, but works and is usable.

However, I have been given the same task on a file with 25,000 rows. This solution of filtering by Row causes Excel and the laptops in question to almost break down, so unfortunately can be used.

Can anyone, byundt included, come up with a way to run this or a solution that will work on very large data sets?

Thank you,

Question by:EllenExcel
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions

Author Comment

ID: 39863335
I'm attaching a blank example (with just data, not the previous solution):

LVL 22

Assisted Solution

by:Ejgil Hedegaard
Ejgil Hedegaard earned 125 total points
ID: 39863421
Smart solution.

Computations for entire columns take a long time.
If the search is narrowed down to just done when a new section starts,
(Col A=Responsibility), and only looking at a smaller range.

For column D, the numbering.
I have changed to numbers, to use the Max function, only performed when a new block start.
For D6, copied down.

Open in new window

For Column E, Result True/False, only looking 100 rows down
For E6, copied down

Open in new window

With 4.500 rows it just takes a second.
See file
LVL 92

Assisted Solution

by:Patrick Matthews
Patrick Matthews earned 125 total points
ID: 39863538
Another approach, relying wholly on VBA:


Use A2:B2 to change what field you filter, and what value you seek for that field.

Note the following:
Code looks at key values in Columns B:???.  To control how many columns are being used, change the NumColumns constant
If the value being sought is found anywhere in those columns, the item is kept (i.e., not hidden)
Industry Leaders: We Want Your Opinion!

We value your feedback.

Take our survey and automatically be enter to win anyone of the following:
Yeti Cooler, Amazon eGift Card, and Movie eGift Card!

LVL 81

Accepted Solution

byundt earned 125 total points
ID: 39863921
I had thought of restricting the range in the MAX and COUNTIFS functions in your original question, but chose not to in the interest of simplicity. To take advantage of these formula improvements, you would need to specify both the maximum "gap" between records and the maximum number of rows possible in a "record".

hgholt has already suggested a suitable improvement for the COUNTIFS formula in column E. He assumed no more than 100 rows in a record.

Here is an improved MAX formula in cell D6 (should be copied down). It assumes there are no more than four blank rows between records.

With 26,000 rows of data, the macro found a single record located near the end of the list in a couple of seconds on my six  year old Macintosh running Windows in emulation. No change in macro code was necessary.

If this were my workbook, I would be thinking about putting each record on a single row. The more you analyze the data, the more firmly I feel such a change would be needed. Putting each record on a single row would mean that each of the different fields would be in its own column. A macro to remap the data wouldn't be hard to write.


Assisted Solution

c_kedar earned 125 total points
ID: 39864470
See attached.

Type the FieldName by which you want to filter in B1 i.e. "Responsibility Name"
column D will be populated.

Use filter in D3 to filter by value of the FieldName you have selected.

The formula for populating D, in D4 is
=IF(A4="Responsibility", VLOOKUP($B$1, A4:B999,2,FALSE), D3)

Open in new window

Here is the explanation for the same:
A4="Responsibility" => Detect first row of the block of of rows constituting one record
VLOOKUP($B$1, A4:B999,2,FALSE) => Value of selected FieldName in this record. 999 here is arbitrary, should be greater or equal to max no of rows in a record.
D3 => Cell above

Logic: If it is start of the block, then set cell to value of selected key in the record, for subsequent rows, simply copy the value set above.

Note: If selected field is not present in a record but present in subsequent record that is covered in next 999 rows, then result will be erroneous.
There is more complex formula in column E which solves this issue but has problem with last record. This can be solved by ensuring an empty record at the end, i.e. insert "Responsibility" in in A142.

Author Closing Comment

ID: 39866855
All of your solutions work, so I've spread the points across the lot.

Thanks all for the comments - Brad you are right it would be much simpler to transpose everything up to a column view looking at it.

I wanted this solution because who this is for preferred to keep the sheet in the same format as the 3rd party product produces, so I was restricted to working with that and fell short with the know-how.

Thank you all,

LVL 92

Expert Comment

by:Patrick Matthews
ID: 39868534
Glad to help :)

Featured Post

Industry Leaders: We Want Your Opinion!

We value your feedback.

Take our survey and automatically be enter to win anyone of the following:
Yeti Cooler, Amazon eGift Card, and Movie eGift Card!

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

This article descibes how to create a connection between Excel and SAP and how to move data from Excel to SAP or the other way around.
This article describes how you can use Custom Document Properties to store settings and other information in your workbook so that they will be available the next time you open the workbook.
The viewer will learn how to use a discrete random variable to simulate the return on an investment over a period of years, create a Monte Carlo simulation using the discrete random variable, and create a graph to represent the possible returns over…
This Micro Tutorial demonstrates in Microsoft Excel how to consolidate your marketing data by creating an interactive charts using form controls. This creates cool drop-downs for viewers of your chart to choose from.

724 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question