Sadly, the use of the volatile INDIRECT
and OFFSET
are almost ubiquitously recommended in such cases on the various Excel sites around, even though there exists a perfectly good, minimally volatile (in fact, volatile "at workbook open" only, as I understand) set-up using INDEX
.
To determine the last non-empty row in a column, column A say, it will be useful to know the datatype of the entries within that column, and also whether there are any null strings ("") present.
Although there are set-ups available which will work whatever the datatype of the entries in question, they are necessarily of a type such that they must process each element within the range passed to them. As such - and particularly if the range over which it is necessary to calculate is quite large - they can be extremely resource-heavy (the second suggestion by kyle being a good example, having to process more than a million cells, irrespective of whether the last non-empty cell is in row 1 or row 1048576).
It is also worthwhile storing the value of the last non-empty row as a Defined Name, LRow say.
The possible definitions for LRow are then as follows:
1) If there are no null strings and all entries are non-numerical, you can define LRow as:
=MATCH("Ω",A:A)
2) If there are no null strings and all entries are numerical, you can define LRow as:
=MATCH(9.9E+307,A:A)
3) If there are no null strings and entries are of a mixed datatype, i.e. some numerical, some non-numerical, you can define LRow as:
=MAX(MATCH("Ω",A:A),MATCH(9.9E+307,A:A))
If there are null strings within the range then the solutions become necessarily more complex, and it is no longer possible to avoid a construction in which each cell within the array is processed individually. As such, the best we can do is to minimize the range which we pass to be processed.
Hence, we have:
4) If there are null strings, you can define LRow as:
=MATCH(1,0/(LEN(A1:INDEX(A:A,MAX(MATCH("Ω",A:A),MATCH(9.9E+307,A:A))))>0))
(You could of course make this even more efficient if the datatype is consistently numerical or non-numerical, as above.)
which requires array-entry (CSE) and which, in general, will vastly reduce the range being processed, especially compared to something such as:
=MATCH(1,0/(LEN(A:A)>0))
which is disastrous in terms of efficiency.
Note that the additional functions within the former are of very little detriment: it is far more important to reduce the number of cells being processed than to worry about an additional function call or two.
You can now use this within an INDEX
construction to dynamically define your range. For example:
=A1:INDEX(A:A,LRow)
which, as mentioned, is barely volatile at all.
Regards
Without using VBA (for which there are tons of examples on the Internet), the most common solution is the use of
INDIRECT
, as you have pointed out. Are we to assume you want to avoid VBA? – CharlieRB – 2015-10-05T13:25:29.930This is still volatile, but it will update if you add/remove columns.
=OFFSET(B1,0,0,COUNTA(B:B))
. You could also use something like=OFFSET(B1,0,0,MAX(IF(NOT(ISBLANK(B:B)),ROW(B:B))))
if you do not have contiguous data, but I've not tested it to see if it's actually faster. – Kyle – 2015-10-05T15:25:36.440