creating binary data

Hi friends

1. I have an ordinal data [data with lots of No and YES in different fields]. I have to convert it in to 2s and 3s [2 for NO and a 3 for YES, since the data already has a 0 and 1]. How do i do it

2. I have two different files: a) an input file, that contains 178 attributes--one line for each attribute with its explanation and the possible values it can take and (b) a data file that does not contain the coloumn name but has only the values of these attributes. My interest would be to connect these two datasets. how do i do it. I tried to do it by transposing the rows and coloumns of the attribute file and merging them, but couldnt get any output. a sample data is shown below:

G239476,Non-Essential,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,0,No,0,No,0,No,0,No,0,No,0,No,0,No,0,No,0,No,0,No,0,No,0,No,0,No,0,No,0,No,0,No,0,No,0,No,0,No,0,No,0,No,0,No,0,No,0,No,0,No,0,No,0,No,0,No, ?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,?,Yes,No,No,No,Yes,No,No,No,Yes,No,No,Yes,No,No,nucleus,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,No,

3. I would like to know any construct as how to do parsing using PP. A sample data is shown below:

Gene:          label.

ESSENTIAL:          Essential, Non-Essential, Ambiguous-Essential.

CLASS Actin related proteins     :          Yes, No.

CLASS Actins       :          Yes, No.

CLASS Adaptins:          Yes, No.

CLASS ATPases:          Yes, No.

CLASS Cyclins:          Yes, No.

CLASS Dehydrogenases:          Yes, No.

CLASS GTP-binding proteins:          Yes, No.

CLASS GTP/GDP dissociation inhibitors (GDIs):          Yes, No.

CLASS GTP/GDP-exchange factors (GEFs):          Yes, No.

CLASS GTPase activating proteins (GAPs):          Yes, No.

CLASS Histones:          Yes, No.

CLASS Major facilitator superfamily proteins (MFS)   :          Yes, No.

in the sample code shown above, I would like to remove the options [Yes,No] AND the first term which are in capitals. I have done parsing using perl, but how do we do it using pp