Hi friends
1. I have an ordinal data [data with lots of No and YES in different fields]. I have to convert it in to 2s and 3s [2 for NO and a 3 for YES, since the data already has a 0 and 1]. How do i do it
2. I have two different files: a) an input file, that contains 178 attributes--one line for each attribute with its explanation and the possible values it can take and (b) a data file that does not contain the coloumn name but has only the values of these attributes. My interest would be to connect these two datasets. how do i do it. I tried to do it by transposing the rows and coloumns of the attribute file and merging them, but couldnt get any output. a sample data is shown below:
G239476,Non-Essential,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,0,No,0,No,0,No,0,No,0,No,0,No,0,No,0,No,0,No,0,No,0,No,0,No,0,No,0,No,0,No,0,No,0,No,0,No,0,No,0,No,0,No,0,No,0,No,0,No,0,No,0,No,0,No,0,No, ?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,Yes,No,No,No,Yes,No,No,No,Yes,No,No,Yes,No,No,nucleus,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,
3. I would like to know any construct as how to do parsing using PP. A sample data is shown below:
Gene: label.
ESSENTIAL: Essential, Non-Essential, Ambiguous-Essential.
CLASS Actin related proteins : Yes, No.
CLASS Actins : Yes, No.
CLASS Adaptins: Yes, No.
CLASS ATPases: Yes, No.
CLASS Cyclins: Yes, No.
CLASS Dehydrogenases: Yes, No.
CLASS GTP-binding proteins: Yes, No.
CLASS GTP/GDP dissociation inhibitors (GDIs): Yes, No.
CLASS GTP/GDP-exchange factors (GEFs): Yes, No.
CLASS GTPase activating proteins (GAPs): Yes, No.
CLASS Histones: Yes, No.
CLASS Major facilitator superfamily proteins (MFS) : Yes, No.
in the sample code shown above, I would like to remove the options [Yes,No] AND the first term which are in capitals. I have done parsing using perl, but how do we do it using pp