You can use your favorite social network to register or link an existing account:
Or use your email address to register without a social network:
Sign in with these social networks:
Or enter your username and password
Forgot your password?
Yes, please link my existing account with for quick, secure access.
No, I would like to create a new account with my profile information.
Today’s author: Chris Rae, a program manager on the Excel team. Chris is going to discuss iteration in Excel.
Iteration is a powerful feature in Excel, but not one that is familiar to a large number of users. I’m going to use iteration in Excel to create something which will be familiar to many people – Conway’s Game of Life.
Much the same as iteration in any programming language, there’s a small leap to be made to understand the basic principle of interation in a spreadsheet. An iterative spreadsheet is one which no longer simply produces the answer given a set of inputs - it is an evolving process. Each time you calculate the spreadsheet, it will produce a new answer based upon the current data it contains. Iteration has been in spreadsheets for a long time – it was in Lotus 1-2-3 and was a feature in Excel's predecessor, Multiplan. To use iteration and retain all of your hair you need to have a much finer control over when Excel calculates, so let’s go ahead and switch into manual calculation mode by clicking the Office Button, then "Excel Options" and in the "Formulas" section under "Workbook Calculation", check the “Manual” flag. And while we’re here, let’s switch on iteration by checking “Enable ierative calculation”, and setting “Maximum Iterations” to 1.
Now, into cell A1 enter the formula =A1+1.
This, of course, is a circular reference, but there’s no warning message. Excel turns the message off because circular references are the bread and butter of an iterative spreadsheet - each time we calculate it we want to base the results on the previous values, so circular references are a necessity. The result of that formula should show up as 1, because A1 was empty before the formula was entered, and empty cells evaluate to zero. Press the F9 key to recalculate the spreadsheet again – A1 will become 2, and so on. If you went back into the options and turned the number of iterations up, the workook would be iterating several times per recalc and the value of A1 would jump in larger steps.
The next problem is how to restart the counter – this is most easily done with a “reset” flag somewhere on the sheet. So into A2 enter TRUE, hit Ctrl-F3 to bring up the Name Manager and give that cell the name “reset”. Change A1 to read =IF(reset,0,A1+1). Recalculate the spreadsheet once, and A1 will become zero. Type FALSE into A2 and recalculate – now the count has started again.
The Game of Life
Conway’s Game of Life is a simulation involving very simple “life forms” on an infinite grid. There is an excellent Wikipedia article about it with some history, a detailed explanation and some examples. Essentially, each cell in the grid is determined to be either be alive or dead based on a particular set of criteria related to its immediate neighbours – these being the three cells above it, the ones to the left and right and the three below.
The criteria for life or death are:
We’re going to create an iterative spreadsheet with each cell representing one, erm, cell in Conway’s Game of Life. Inside each will be a formula which determines whether it is alive or dead at the end of the iteration, and each time we calculate the spreadsheet we’ll perform one more iteration in the game.
To keep our “alive or dead” formula simple, let’s assume that the named range “nbors” represents the number of neighbours a cell has. And let’s assume that TRUE and FALSE are used to represent the living state of each cell. Based on the above table, we now have the logic:
Let’s deal first with what to do in the case where the current cell is alive. Excel has the CHOOSE function, which will pick the nth item from an array – so =CHOOSE(1,”pig”,”dog”) returns “pig”, and =CHOOSE(2,”pig”,”dog”) returns “dog”. If the cell is alive, we can use CHOOSE to pick its resulting state using the number of neighbours as an index. Each cell has a minimum of zero and a maximum eight possible neighbours, so our new cell value could look something like:
There’s a bit of wastage in there because the CHOOSE function doesn’t have any ability to understand “4 or more”. To get around this, we can use the MIN function to cap the number if it was 4 or more. So we can shorten the above formula to the following (remember we’re using nbors+1):
So that’s the case dealt with where the cell is alive. If it’s dead, we need to bring it alive if it has exactly three neighbours. All the formulas here will relate to the spreadsheet cell B2 – the reason being that the very top-left cell becomes a bit more complex when trying to count its neighbours, which is something I’ll deal with later. We can use a circular reference to check the current state – if the cell is alive we’ll use the CHOOSE() formula above to set our state; if it’s dead it’ll become alive if nbors is exactly 3. We could enter:
In actual fact the formula “nbors=3” is a statement which itself evaluates to TRUE or FALSE. Whenever you want the results of an IF() to be TRUE or FALSE, you can just write the condition in instead of the whole IF(x,TRUE,FALSE) thing, so:
It’s mostly for this reason that TRUE and FALSE are better things to use for spreadsheets like this than, say, 1 and 0, or “x” and “”.
So we now have a formula for one iteration of the game, but we’ve no real way to tell the simulator which values to actually use to start off with. So let's create a new worksheet called “template”, and type some values to start the process. The sample object here is what’s called a “glider” – it’s a cyclical formation which will move forever.
Let’s add a reset cell to our spreadsheet using Ctrl-F3 as before. We can now extend our cell formula in B2 on the main grid sheet to read the starter values from the template if the reset flag is set:
We now have the complete formula for one cell in our game board. Right now it’s #NAME? though, because we don’t have “nbors” defined. As I mentioned earlier, a cell’s neighbours are all the cells which border it. For B2 this would be A1, B1, C1 above, then A2 and C2 to the sides, and then A3, B3 and C3 below. If you use TRUE and FALSE as if they were numbers they will evaluate to 1 and 0, so adding TRUE/FALSE cells together using “+” will return the number of TRUEs in the given cells. So a tempting way to define “nbors” would be to select cell B2, then create new name which evaluated to:
Note the lack of $ signs– this name contains relative references, and will point to a different range when a different cell is selected. If you choose cell C3 and then bring up the name manager, you’ll see:
Anyway – at any particular point in the grid, this will return how many neighbours the cell has. There’s a problem, though. The Game of Life is played on a board which flips state in one single move, and unfortunately what we’ve made is a board which will change as we iterate through the cells establishing their new values. The way the game is supposed to work, no changes are made as you run through the cells – you just work out what the new values will be and then set them all at once at the end of the pass. Fiddlesticks.
We need to find some way of saying to Excel that the number of neighbours the cell has should come from the board as it stood at the beginning of the iteration, not from the board as it stands now with us in the middle of modifying it. To do this we can take advantage of one of the powerful foibles of iterative calculation mode – in iterative mode, Excel will calculate worksheets one by one in alphabetical order.
So - to get around this problem, we can create a new sheet containing the number of neighbours for each cell and we know that this sheet won’t be calculated until the whole of the main grid has finished. Let’s go ahead and rename our main sheet to “1.run” and create a new “2.nbors” worksheet. I’ve used the numeric prefixes to remind myself that these sheets need to be calculated in that order. In actual fact in this instance it doesn’t matter in which order they’re calculated, but for other iterative sheets it might well, and I find it makes debugging easier.
Into B2 on the new “2.nbors” sheet we can now enter a formula to count the number of neighbours that ‘1.run’!B2 has. B2 is going to be the top left of our board and as such some of the surrounding cells are going to be empty – let’s ignore this for the moment and fix it later. The formula I was intending using above now becomes:
This isn’t the prettiest formula. Because iterative calc allows circular references, we can simplify it by just including B2 itself, and taking account of that in our original single-cell formula. So we could change that formula to:
Well, no. Unfortunately SUM doesn’t treat boolean TRUE/FALSE values quite the same as “+” does – SUM only sums proper numeric values, and so we get zero from summing TRUEs. A handy trick here is to use Excel’s SIGN() function. This returns 1 if a number is positive and 0 if it’s zero – it is quite happy taking booleans, so =SIGN(TRUE) is 1 and =SIGN(FALSE) is 0. We can array-enter this to make it act upon a range of booleans, and then SUM the results. If you’re new to array formulas I don’t have room to go into too much detail here – esentially they are formulas which can act upon an array of cells sequentially, and also return more than one result. If you search for “array formulas” in Excel’s help there are some good examples.
Still working on cell ‘2.nbors’!B2 we can array-enter (Ctrl-Shift-Enter) the following formula to sum all of the TRUE values in A1:C3 on the 1.run sheet:
A handy tip (especially handy for array formulas) is to use F9 in the formula bar to evaluate parts of your formula – if you highlight just the SIGN() portion of that formula:
And hit F9:
It shows the results of the SIGN() function. If you then highlight the SUM segment:
And F9 again:
You can see it evaluated the SUM portion too. Hit escape now to cancel editing the formula.
Now that “nbors” isn’t exactly the number of neighbours any more (remember we’re including the cell itself now) we’ll have to go back to our original formula and fix it up. There’s no change needed if the cell is dead, but now if the cell is alive the number of neighbours is going to be greater by 1. We can change our original cell formula (on ‘1.run’!B2) to stop adding 1 to nbors in the “if alive” case:
We should now be able to populate some more cells and run the sheet. Extend the formulas on 1.run and 2.nbors right and downwards to make them into a bigger board, change the reset flag to TRUE and hit F9 to reset the sheet to the starting values. Let's make the live cells a bit clearer to spot - bring up the conditional formatting dialog (Home..Styles..Conditional Formatting..New Rule..Format only cells that contain) and format B2 to have a dark background if its value is TRUE. You can now copy/paste this format onto the rest of the board.
Now change the reset flag to FALSE and hit F9 a few times – successive recalculates will show the iteration sequence in the game.
There’s only one problem left – the Game of Life is supposed to be played on an infinite board, but what we’ve created is one with edges – any shape which slips off the edge of the board will vanish forever, whereas what we really ought to do is have it come on the other side. We can effect this by modifying the 2.nbors sheet – the way I found to do this isn’t overly pretty so I’d appreciate other suggestions. On the 2.nbors sheet, B2 is at the top left so the cells above and to the left of it aren’t actually on the board at all. We really want the cells above it to come from the bottom row and the cells to the left of it to come from the rightmost column. We can do this by simply switching back to using “+” for the corner cells – in my somewhat arbitrary 40x35 board, B2 becomes:
I know, I know. It’s not pretty. Perhaps something nicer could be made with INDEX() and ROW()/COLUMN(). We need to do a similar wraparound at the other corners and a less complicated one just using two SUM(SIGN())s for the top, left, right and bottom rows. Now when 1.run!B2 needs to know how many neighbours it has, we’ll accurately be taking into account the ones from the opposite edge of our “infinite” board.
Now that there are more rows and columns in Excel 2007, we can run a bigger simulation – the one below is 500x500. The row and column headers look a little odd as I’ve had to make the rows heights and column widths rather small.
And there we have it. There are a few websites with interesting Game of Life models on them – The Internet Encyclopedia of Science has some good simple ones, and there are more on the Wikipedia page. Here is the workbook I created during this post – it’s in XLS format and will work fine in Excel versions 97 and above. If you extend this at all, I'd be interested to see the results.
If you use 0-s (or blanks) and 1-s, =(nbors=3)+AND(nbors=2,B2) is a bit shorter :-)
The easiest way I can see to achieve the wrap around would be to add one additional cell at each boundary (top, bottom, left and right) of both sheets which simply takes the result calculated at the opposite edge of the board.
This would then not require any modification to the more complex formulae involved in the calculation.
When I saw the Print preview of the above article, words in "BOLD" were looking jumbled up with other words. Anyway A nice Article.
Below is a link highlighting something which I feel should be categorized as a "BUG" and may be of interest to you. I am reporting here because I don't where I should have been reporting. Sorry for that.
Problem with copying filtered data in Excel 2007 + workaround
It says "you cannot copy only the filtered data in Excel 2007. If you use the autofilter to make a selection and then you only want to copy the filtered rows, Excel 2007 copied everything, even the hidden rows. This is different from the way Excel 2000-2003 works". In that way Excel 2000-2003 is better than Excel 2007.
CA kanwaljit Singh Dhunna
Excellent article explaining a generally poorly understood area of Excel!
Keep up the good work.
kanwaljit: Can you use the this form: (blogs.msdn.com/.../contact.aspx) to send us an email with a sample workbook and the repro steps for the selection/copy/paste issue you are running into? That way, we can follow up and better understand what's going on.
Alex - that's a good point. And all that after me singing the praises of using TRUE/FALSE.
Rick - that's a much more elegant solution than my rather ugly one. I wish I'd thought of it!
Nice job. I thought it was a good implementation of Excel iteration, a tough concept to get a grip on, for Excel users, myself included! Keep up the fine work.
Chris, when do we see the updated tutorial then? :)
I posted my comments at the place you specified, but don't know where to look for the follow up comments.
CA Kanwaljit Singh Dhunna
Kanwaljit - looks like we never got your email, could you use the email form again and resend it?
Instead of using that second sheet, you can do it all in an array formula - so the entire grid is a single array formula, which is also circular. I did this and a lot of other examples when I was first playing with circular references in '96. The results (including my Life.zip solution) can be found in the "Worksheet Functions" section of my Excel page - www.oaltd.co.uk/Excel.