Hello,
I have 2 data sheets in an Excel workbook that have ~8000 rows of data each. Each row has 2 columns. Col A is an entity ID and Col B contains a set of long string values (i.e., institutional affiliation and location), each value separated by a semicolon (

. More specifically, Col A is a paper ID#, Col B includes the institutional affiliations for each author.
What I need to do is to take the distinct values in Col B and separate them into individual rows for data processing. Also, when I separate the values, I need to retain the relationship between the values in Col B and Col A. That is, I need to put each institutional affiliation in its own row, but ensure that the relationship to the paper remains intact. Put another way, I want to cut everything that follows a ; in Col B, insert a row, paste what I cut into the next row in Col B, and then go back up one row, over one cell, copy the unique ID, go down one cell, and paste.
So here is my problem: when I execute the find function in Excel 2008 and search for the semicolon, it returns the entire cell. Normally, if one executes a find command, it highlights the character. The next logical step would be a shift+end to highlight the remainder of the text to the end of the line. This does not work because the find command returns the entire cell. Does anyone have any advice on how to fix this? Right now I am thinking I need to export the data into another program, execute the find and replace, and then re-import the data. I would rather find a more elegant solution to the problem.
-MrInc
Here