the number of occurences of each character of one string,in another

Question

1 voto

i have a string of more than 100 characters (fasta format of a protein sequence. like

'MEQNGLDHDSRSSIDTTINDTQKTFLEFRSYTQLSEKLASSSSYTAPPLNEDGPKGVASAVSQGSESVVSWTTLTHVYSILGAYGGPTCLYPTATYFLMGTSKGCVLIFNYNEHLQTILVPTLSEDPSIH'

which is being shortened here for simplicity) and i want to find out whether or not it is hydrophobic. so i have to check the number of occurrences of each of the characters in the set 'A C F I L M P V W Y'(hydrophob amino acids) in my fasta string. considering the very long length of fasta strings, is there any easy way to do that by matlab string functions?

0 comentarios
Mostrar -2 comentarios más antiguos Ocultar -2 comentarios más antiguos

Iniciar sesión para comentar.

Iniciar sesión para responder a esta pregunta.

Follow Question

Answer 1

Azzi Abdelmalek el 28 de Dic. de 2014

Editada: Azzi Abdelmalek el 28 de Dic. de 2014

Abrir en MATLAB Online

1 voto

str='MEQNGLDHDSRSSIDTTINDTQKTFLEFRSYTQLSEKLASSSSYTAPPLNEDGPKGVASAVSQGSESVVSWTTLTHVYSILGAYGGPTCLYPTATYFLMGTSKGCVLIFNYNEHLQTILVPTLSEDPSIH'
p={'A' 'C' 'F' 'I' 'L' 'M' 'P' 'V' 'W' 'Y'}'
out=[p cellfun(@(x) nnz(ismember(str,x)),p,'un',0)]

2 comentarios
Mostrar Ninguno Ocultar Ninguno

hiva el 29 de Dic. de 2014

thanks a lot.i guess this works well for a lot of similar cases that are supposed to work the same way in my code(since it is feature extraction and there are lots of features). also tells me how much i don't know from matlab.thanks.

Stephen23 el 30 de Dic. de 2014

Editada: Stephen23 el 30 de Dic. de 2014

Abrir en MATLAB Online

This could be simplified and speeded-up by using arrayfun instead of cellfun, and removing the ismember:

>> t = 'ACFILMPVWY';
>> arrayfun(@(x)sum(str==x), t)
ans =
     6     2     4     6    13     2     7     7     1     7

Iniciar sesión para comentar.

Answer 2

Peter Perkins el 29 de Dic. de 2014

Abrir en MATLAB Online

2 votos

Another possibility:

>> s = 'MEQNGLDHDSRSSIDTTINDTQKTFLEFRSYTQLSEKLASSSSYTAPPLNEDGPKGVASAVSQGSESVVSWTTLTHVYSILGAYGGPTCLYPTATYFLMGTSKGCVLIFNYNEHLQTILVPTLSEDPSIH';
>> t = 'ACFILMPVWY';
>> n = hist(double(s),1:90);
>> n(t)
ans =
     6     2     4     6    13     2     7     7     1     7

1 comentario
Mostrar -1 comentarios más antiguos Ocultar -1 comentarios más antiguos

Jan el 30 de Dic. de 2014

This is a histogram problem, so histc is an efficient and direct solution.

Iniciar sesión para comentar.

Answer 3

Luuk van Oosten el 24 de En. de 2015

Editada: Luuk van Oosten el 24 de En. de 2015

Abrir en MATLAB Online

2 votos

I reckon you are using the BioInformatics Toolbox. In that case you can probably use:

aacount('SEQ')

Where SEQ is of course your sequence of interest: MEQNGLDHDSRSSIDTTINDTQKTFLEF....

and using

nr_A = All.A
nr_C = All.C
nr_F = All.F

etc. (you get the idea)

you get the numbers of your hydrophobic residues. Sum these and you have your hydrophobic score. You might want to 'normalize' this number by dividing this number by the total amount of amino acids in the sequence.

Of course you can write a loop for this and calculate the hydrophobic score for all your sequences in your FASTA file.

0 comentarios
Mostrar -2 comentarios más antiguos Ocultar -2 comentarios más antiguos

Iniciar sesión para comentar.

Answer 4

Shoaibur Rahman el 28 de Dic. de 2014

Abrir en MATLAB Online

1 voto

s = 'MEQNGLDHDSRSSIDTTINDTQKTFLEFRSYTQLSEKLASSSSYTAPPLNEDGPKGVASAVSQGSESVVSWTTLTHVYSILGAYGGPTCLYPTATYFLMGTSKGCVLIFNYNEHLQTILVPTLSEDPSIH';
numA = sum(s=='A')
numC = sum(s=='C')
numF = sum(s=='F')
numI = sum(s=='I')
numL = sum(s=='L')
numM = sum(s=='M')
numP = sum(s=='P')
numV = sum(s=='V')
numW = sum(s=='W')
numY = sum(s=='Y')

1 comentario
Mostrar -1 comentarios más antiguos Ocultar -1 comentarios más antiguos

hiva el 29 de Dic. de 2014

very simple and delicate. really thanks

Iniciar sesión para comentar.

Answer 5

Stephen23 el 30 de Dic. de 2014

Editada: Stephen23 el 30 de Dic. de 2014

Abrir en MATLAB Online

1 voto

A neat solution using bsxfun :

>> s = 'MEQNGLDHDSRSSIDTTINDTQKTFLEFRSYTQLSEKLASSSSYTAPPLNEDGPKGVASAVSQGSESVVSWTTLTHVYSILGAYGGPTCLYPTATYFLMGTSKGCVLIFNYNEHLQTILVPTLSEDPSIH';
>> t = 'ACFILMPVWY';
>> sum(bsxfun(@eq,s.',t))
ans =
     6     2     4     6    13     2     7     7     1     7

1 comentario
Mostrar -1 comentarios más antiguos Ocultar -1 comentarios más antiguos

hiva el 30 de Dic. de 2014

Editada: hiva el 30 de Dic. de 2014

wow!!! just wonderful. it works pretty well.thanks a lot.

Iniciar sesión para comentar.

the number of occurences of each character of one string,in another

0 comentarios
Mostrar -2 comentarios más antiguos Ocultar -2 comentarios más antiguos

Respuesta aceptada

2 comentarios
Mostrar Ninguno Ocultar Ninguno

Más respuestas (4)

1 comentario
Mostrar -1 comentarios más antiguos Ocultar -1 comentarios más antiguos

0 comentarios
Mostrar -2 comentarios más antiguos Ocultar -2 comentarios más antiguos

1 comentario
Mostrar -1 comentarios más antiguos Ocultar -1 comentarios más antiguos

1 comentario
Mostrar -1 comentarios más antiguos Ocultar -1 comentarios más antiguos

Categorías

Etiquetas

Community Treasure Hunt

the number of occurences of each character of one string,in another

0 comentarios Mostrar -2 comentarios más antiguos Ocultar -2 comentarios más antiguos

Respuesta aceptada

2 comentarios Mostrar Ninguno Ocultar Ninguno

Más respuestas (4)

1 comentario Mostrar -1 comentarios más antiguos Ocultar -1 comentarios más antiguos

0 comentarios Mostrar -2 comentarios más antiguos Ocultar -2 comentarios más antiguos

1 comentario Mostrar -1 comentarios más antiguos Ocultar -1 comentarios más antiguos

1 comentario Mostrar -1 comentarios más antiguos Ocultar -1 comentarios más antiguos

Categorías

Etiquetas

Ver también

Community Treasure Hunt

0 comentarios
Mostrar -2 comentarios más antiguos Ocultar -2 comentarios más antiguos

2 comentarios
Mostrar Ninguno Ocultar Ninguno

1 comentario
Mostrar -1 comentarios más antiguos Ocultar -1 comentarios más antiguos

0 comentarios
Mostrar -2 comentarios más antiguos Ocultar -2 comentarios más antiguos

1 comentario
Mostrar -1 comentarios más antiguos Ocultar -1 comentarios más antiguos

1 comentario
Mostrar -1 comentarios más antiguos Ocultar -1 comentarios más antiguos