Reply
 
LinkBack Thread Tools Search this Thread Display Modes
  #1   Report Post  
Junior Member
 
Posts: 1
Default partial match help

Hello,
I am in desperate need of help! Here are my details:
I have one column with different gene identities, such as "gi|351702631|gb|EHB05550.1|".
I have another column with the identities matched with gene descriptions, for example "351702631gb|EHB05550.1|EHB05550.1cGMP-gated cation channel alpha-1 [Heterocephalus glaber]".
I need to find the match for column 1 in column 2; however, as you can see, they are not exact matches. I need to put the match from column 2 into a 3rd column. The lengths of column 1 and 2 do not match.
Please help!
Thanks :)
  #2   Report Post  
Senior Member
 
Posts: 663
Default

Quote:
Originally Posted by cupcakeluv_333 View Post
Hello,
I am in desperate need of help! Here are my details:
I have one column with different gene identities, such as "gi|351702631|gb|EHB05550.1|".
I have another column with the identities matched with gene descriptions, for example "351702631gb|EHB05550.1|EHB05550.1cGMP-gated cation channel alpha-1 [Heterocephalus glaber]".
I need to find the match for column 1 in column 2; however, as you can see, they are not exact matches. I need to put the match from column 2 into a 3rd column. The lengths of column 1 and 2 do not match.
Please help!
Thanks :)
Any chance you could post a sample workbook with a manually input example of what you want the result to look like?
  #3   Report Post  
Posted to microsoft.public.excel.worksheet.functions
external usenet poster
 
Posts: 1,045
Default partial match help

On Thu, 28 Jun 2012 03:35:08 +0000, cupcakeluv_333 wrote:


Hello,
I am in desperate need of help! Here are my details:
I have one column with different gene identities, such as
"gi|351702631|gb|EHB05550.1|".
I have another column with the identities matched with gene
descriptions, for example "351702631gb|EHB05550.1|EHB05550.1cGMP-gated
cation channel alpha-1 [Heterocephalus glaber]".
I need to find the match for column 1 in column 2; however, as you can
see, they are not exact matches. I need to put the match from column 2
into a 3rd column. The lengths of column 1 and 2 do not match.
Please help!
Thanks :)


This is not straightforward as it would require multiple substitutions in one or the other to develop a match. In other words, in the example you present, one would have to remove from the gene identity the leading "gi|" and the second "|" in order to develop a partial match. Without knowing how these gene identity strings, and gene defintion strings are constructed, it would be very difficult to develop an accurate algorithm to determine what kinds of matches are proper, and what are improper.

Some questions that come to mind have to do with the location of the pipes, especially since they are different in both instances;
the leading "gi|" in the gene identity string -- is there something at the beginning that can always be ignored?
the significance of the 2nd EHB05550.1 in the gene description string
how to determine how much of the gene identity has to match with the gene description in order to constitute a proper match
etc.

Reply
Thread Tools Search this Thread
Search this Thread:

Advanced Search
Display Modes

Posting Rules

Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On


Similar Threads
Thread Thread Starter Forum Replies Last Post
Vlookup on partial match Robert_L_Ross Excel Worksheet Functions 1 October 27th 09 11:25 PM
Find partial match from column A,B and fill partial match in C? Tacrier Excel Discussion (Misc queries) 4 October 24th 08 11:24 PM
DSUM Partial Match Bob H[_2_] Excel Worksheet Functions 3 July 27th 07 10:48 PM
vbscript for partial match in two columns Qing Xu Excel Programming 0 January 25th 07 08:44 PM
partial lookup/match myra_deV Excel Worksheet Functions 0 May 9th 06 03:19 PM


All times are GMT +1. The time now is 02:27 PM.

Powered by vBulletin® Copyright ©2000 - 2025, Jelsoft Enterprises Ltd.
Copyright ©2004-2025 ExcelBanter.
The comments are property of their posters.
 

About Us

"It's about Microsoft Excel"