-
Notifications
You must be signed in to change notification settings - Fork 191
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Extract edge weights from graph file #945
Comments
Hi gauravdiwan89, The format of the As far as I am aware the gene IDs used in If you need further help implementing this I would be happy to provide a python function which does this. |
Hi Jonathan, Thanks a lot for your comment. However its still not very clear to me how the file is structured. Here are a few example lines from my file
There are even number of entries on each line, so its clear there are multiple pairs. But when you say that the format is If you can share a python function, that would be super! Thank you so much! Best regards, |
Hi Gaurav, The first number in each row is a protein ID, with each further enter being a Protein which OrthoFinder has linked via BLAST score. In your example the first number 0 refers to a protein which is the first gene (row) in the
We can extract 4 gene pairs and weights compared to geneID 0. Through this you can extract the sparse matrix of all gene pair connections. However each gene is coded numerically. The data in |
Oh I see! So they are the indices of the protein in the SequenceID file. That helps me a lot. I will try and script this for my set of proteins and will get back to you if this doesn't work somehow. Thanks! |
I hope that helped. Sorry if it was unclear the Protein IDs undergo several changes during the OrthoFinder run. If have quickly coded something that will check and and return the weights for a given pair of proteins if you need something to start with. Input:
Output:
Function
|
Hello,
Is there a way to extract the MCL graph edge weights between pairs of proteins that have been declared as orthologues?
As far as I can tell, the OrthoFinder_graph.txt file should have these. However is there a way to connect what is in this file with the pair of proteins whose connection has the corresponding weights?
Or maybe I'm misunderstanding something completely.
Any help would be appreciated.
The text was updated successfully, but these errors were encountered: