Amino acid dipepetide frequency for Frankliniella occidentalis (Western flower thrips) (Euthrips occidentalis)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.146AlaAla: 9.146 ± 0.07
1.35AlaCys: 1.35 ± 0.018
3.832AlaAsp: 3.832 ± 0.023
4.704AlaGlu: 4.704 ± 0.032
2.221AlaPhe: 2.221 ± 0.016
5.24AlaGly: 5.24 ± 0.031
1.879AlaHis: 1.879 ± 0.015
2.776AlaIle: 2.776 ± 0.02
3.481AlaLys: 3.481 ± 0.023
7.271AlaLeu: 7.271 ± 0.046
1.697AlaMet: 1.697 ± 0.012
2.497AlaAsn: 2.497 ± 0.016
5.35AlaPro: 5.35 ± 0.051
3.249AlaGln: 3.249 ± 0.028
4.234AlaArg: 4.234 ± 0.027
6.669AlaSer: 6.669 ± 0.037
4.237AlaThr: 4.237 ± 0.028
5.504AlaVal: 5.504 ± 0.029
0.726AlaTrp: 0.726 ± 0.01
1.618AlaTyr: 1.618 ± 0.022
0.001AlaXaa: 0.001 ± 0.0
Cys
1.324CysAla: 1.324 ± 0.016
0.482CysCys: 0.482 ± 0.009
1.18CysAsp: 1.18 ± 0.014
1.124CysGlu: 1.124 ± 0.015
0.651CysPhe: 0.651 ± 0.007
1.502CysGly: 1.502 ± 0.024
0.562CysHis: 0.562 ± 0.009
0.807CysIle: 0.807 ± 0.012
0.992CysLys: 0.992 ± 0.012
1.754CysLeu: 1.754 ± 0.018
0.368CysMet: 0.368 ± 0.006
0.849CysAsn: 0.849 ± 0.013
1.203CysPro: 1.203 ± 0.026
0.812CysGln: 0.812 ± 0.013
1.195CysArg: 1.195 ± 0.028
1.74CysSer: 1.74 ± 0.019
1.098CysThr: 1.098 ± 0.017
1.291CysVal: 1.291 ± 0.021
0.23CysTrp: 0.23 ± 0.005
0.49CysTyr: 0.49 ± 0.012
0.0CysXaa: 0.0 ± 0.0
Asp
3.819AspAla: 3.819 ± 0.023
1.042AspCys: 1.042 ± 0.015
4.151AspAsp: 4.151 ± 0.034
4.145AspGlu: 4.145 ± 0.028
1.921AspPhe: 1.921 ± 0.016
3.796AspGly: 3.796 ± 0.024
1.26AspHis: 1.26 ± 0.011
2.587AspIle: 2.587 ± 0.017
2.869AspLys: 2.869 ± 0.02
5.078AspLeu: 5.078 ± 0.026
1.304AspMet: 1.304 ± 0.012
2.055AspAsn: 2.055 ± 0.016
2.853AspPro: 2.853 ± 0.029
1.894AspGln: 1.894 ± 0.012
2.985AspArg: 2.985 ± 0.026
4.622AspSer: 4.622 ± 0.026
2.619AspThr: 2.619 ± 0.019
3.784AspVal: 3.784 ± 0.02
0.648AspTrp: 0.648 ± 0.009
1.422AspTyr: 1.422 ± 0.013
0.001AspXaa: 0.001 ± 0.0
Glu
4.904GluAla: 4.904 ± 0.033
1.313GluCys: 1.313 ± 0.027
4.395GluAsp: 4.395 ± 0.026
6.156GluGlu: 6.156 ± 0.057
1.756GluPhe: 1.756 ± 0.014
3.461GluGly: 3.461 ± 0.023
1.493GluHis: 1.493 ± 0.016
2.66GluIle: 2.66 ± 0.021
4.454GluLys: 4.454 ± 0.045
5.608GluLeu: 5.608 ± 0.033
1.494GluMet: 1.494 ± 0.012
2.758GluAsn: 2.758 ± 0.022
2.825GluPro: 2.825 ± 0.031
2.757GluGln: 2.757 ± 0.025
4.319GluArg: 4.319 ± 0.04
4.435GluSer: 4.435 ± 0.029
3.225GluThr: 3.225 ± 0.026
4.116GluVal: 4.116 ± 0.025
0.639GluTrp: 0.639 ± 0.008
1.483GluTyr: 1.483 ± 0.012
0.001GluXaa: 0.001 ± 0.0
Phe
2.045PheAla: 2.045 ± 0.016
0.735PheCys: 0.735 ± 0.008
1.723PheAsp: 1.723 ± 0.014
1.873PheGlu: 1.873 ± 0.018
1.22PhePhe: 1.22 ± 0.012
2.144PheGly: 2.144 ± 0.02
0.89PheHis: 0.89 ± 0.009
1.465PheIle: 1.465 ± 0.015
1.707PheLys: 1.707 ± 0.014
3.095PheLeu: 3.095 ± 0.023
0.763PheMet: 0.763 ± 0.01
1.386PheAsn: 1.386 ± 0.013
1.666PhePro: 1.666 ± 0.013
1.345PheGln: 1.345 ± 0.011
1.849PheArg: 1.849 ± 0.016
2.836PheSer: 2.836 ± 0.022
1.809PheThr: 1.809 ± 0.015
2.145PheVal: 2.145 ± 0.017
0.383PheTrp: 0.383 ± 0.006
0.977PheTyr: 0.977 ± 0.01
0.001PheXaa: 0.001 ± 0.0
Gly
5.171GlyAla: 5.171 ± 0.044
1.125GlyCys: 1.125 ± 0.014
3.533GlyAsp: 3.533 ± 0.026
3.873GlyGlu: 3.873 ± 0.031
2.104GlyPhe: 2.104 ± 0.016
6.198GlyGly: 6.198 ± 0.065
1.857GlyHis: 1.857 ± 0.023
2.344GlyIle: 2.344 ± 0.018
3.242GlyLys: 3.242 ± 0.027
5.462GlyLeu: 5.462 ± 0.032
1.426GlyMet: 1.426 ± 0.017
2.334GlyAsn: 2.334 ± 0.017
4.205GlyPro: 4.205 ± 0.059
2.669GlyGln: 2.669 ± 0.028
3.949GlyArg: 3.949 ± 0.027
6.068GlySer: 6.068 ± 0.038
3.286GlyThr: 3.286 ± 0.023
4.094GlyVal: 4.094 ± 0.028
0.736GlyTrp: 0.736 ± 0.011
1.757GlyTyr: 1.757 ± 0.019
0.002GlyXaa: 0.002 ± 0.0
His
1.746HisAla: 1.746 ± 0.016
0.583HisCys: 0.583 ± 0.009
1.192HisAsp: 1.192 ± 0.011
1.303HisGlu: 1.303 ± 0.011
0.94HisPhe: 0.94 ± 0.009
1.836HisGly: 1.836 ± 0.024
1.31HisHis: 1.31 ± 0.022
1.217HisIle: 1.217 ± 0.011
1.238HisLys: 1.238 ± 0.014
2.797HisLeu: 2.797 ± 0.026
0.685HisMet: 0.685 ± 0.01
1.008HisAsn: 1.008 ± 0.011
1.75HisPro: 1.75 ± 0.023
1.431HisGln: 1.431 ± 0.017
1.668HisArg: 1.668 ± 0.014
2.265HisSer: 2.265 ± 0.017
1.641HisThr: 1.641 ± 0.022
1.643HisVal: 1.643 ± 0.014
0.298HisTrp: 0.298 ± 0.005
0.707HisTyr: 0.707 ± 0.009
0.0HisXaa: 0.0 ± 0.0
Ile
2.781IleAla: 2.781 ± 0.02
0.874IleCys: 0.874 ± 0.01
2.167IleAsp: 2.167 ± 0.018
2.429IleGlu: 2.429 ± 0.024
1.429IlePhe: 1.429 ± 0.014
2.191IleGly: 2.191 ± 0.017
1.108IleHis: 1.108 ± 0.012
1.925IleIle: 1.925 ± 0.017
2.309IleLys: 2.309 ± 0.02
3.789IleLeu: 3.789 ± 0.025
0.908IleMet: 0.908 ± 0.01
1.817IleAsn: 1.817 ± 0.017
2.372IlePro: 2.372 ± 0.019
1.777IleGln: 1.777 ± 0.015
2.211IleArg: 2.211 ± 0.015
3.345IleSer: 3.345 ± 0.024
2.297IleThr: 2.297 ± 0.021
2.695IleVal: 2.695 ± 0.019
0.38IleTrp: 0.38 ± 0.006
1.003IleTyr: 1.003 ± 0.01
0.0IleXaa: 0.0 ± 0.0
Lys
3.581LysAla: 3.581 ± 0.024
1.079LysCys: 1.079 ± 0.017
3.121LysAsp: 3.121 ± 0.024
4.225LysGlu: 4.225 ± 0.039
1.731LysPhe: 1.731 ± 0.017
2.835LysGly: 2.835 ± 0.025
1.314LysHis: 1.314 ± 0.013
2.345LysIle: 2.345 ± 0.021
4.569LysLys: 4.569 ± 0.051
4.708LysLeu: 4.708 ± 0.029
1.285LysMet: 1.285 ± 0.012
2.155LysAsn: 2.155 ± 0.02
3.029LysPro: 3.029 ± 0.039
2.341LysGln: 2.341 ± 0.016
3.419LysArg: 3.419 ± 0.026
4.058LysSer: 4.058 ± 0.03
2.918LysThr: 2.918 ± 0.023
3.314LysVal: 3.314 ± 0.023
0.547LysTrp: 0.547 ± 0.008
1.41LysTyr: 1.41 ± 0.011
0.001LysXaa: 0.001 ± 0.0
Leu
7.011LeuAla: 7.011 ± 0.036
1.807LeuCys: 1.807 ± 0.016
4.875LeuAsp: 4.875 ± 0.025
6.035LeuGlu: 6.035 ± 0.04
2.844LeuPhe: 2.844 ± 0.022
5.277LeuGly: 5.277 ± 0.033
2.641LeuHis: 2.641 ± 0.021
3.245LeuIle: 3.245 ± 0.024
5.177LeuLys: 5.177 ± 0.032
9.194LeuLeu: 9.194 ± 0.055
1.868LeuMet: 1.868 ± 0.015
3.486LeuAsn: 3.486 ± 0.019
5.556LeuPro: 5.556 ± 0.032
4.989LeuGln: 4.989 ± 0.035
6.199LeuArg: 6.199 ± 0.045
7.652LeuSer: 7.652 ± 0.029
4.693LeuThr: 4.693 ± 0.027
5.614LeuVal: 5.614 ± 0.029
0.984LeuTrp: 0.984 ± 0.01
2.145LeuTyr: 2.145 ± 0.015
0.002LeuXaa: 0.002 ± 0.0
Met
1.857MetAla: 1.857 ± 0.014
0.414MetCys: 0.414 ± 0.006
1.336MetAsp: 1.336 ± 0.011
1.533MetGlu: 1.533 ± 0.013
0.752MetPhe: 0.752 ± 0.01
1.442MetGly: 1.442 ± 0.021
0.524MetHis: 0.524 ± 0.007
0.779MetIle: 0.779 ± 0.008
1.25MetLys: 1.25 ± 0.011
1.905MetLeu: 1.905 ± 0.015
0.627MetMet: 0.627 ± 0.008
0.833MetAsn: 0.833 ± 0.01
1.203MetPro: 1.203 ± 0.012
1.013MetGln: 1.013 ± 0.012
1.277MetArg: 1.277 ± 0.012
1.87MetSer: 1.87 ± 0.013
1.149MetThr: 1.149 ± 0.011
1.35MetVal: 1.35 ± 0.012
0.262MetTrp: 0.262 ± 0.005
0.594MetTyr: 0.594 ± 0.007
0.0MetXaa: 0.0 ± 0.0
Asn
2.577AsnAla: 2.577 ± 0.019
0.782AsnCys: 0.782 ± 0.014
1.899AsnAsp: 1.899 ± 0.013
2.271AsnGlu: 2.271 ± 0.016
1.442AsnPhe: 1.442 ± 0.013
2.735AsnGly: 2.735 ± 0.025
1.058AsnHis: 1.058 ± 0.013
2.014AsnIle: 2.014 ± 0.017
2.201AsnLys: 2.201 ± 0.018
3.635AsnLeu: 3.635 ± 0.023
1.0AsnMet: 1.0 ± 0.01
2.111AsnAsn: 2.111 ± 0.02
2.097AsnPro: 2.097 ± 0.02
1.658AsnGln: 1.658 ± 0.015
1.928AsnArg: 1.928 ± 0.015
3.413AsnSer: 3.413 ± 0.026
2.067AsnThr: 2.067 ± 0.014
2.58AsnVal: 2.58 ± 0.016
0.403AsnTrp: 0.403 ± 0.006
1.066AsnTyr: 1.066 ± 0.012
0.001AsnXaa: 0.001 ± 0.0
Pro
5.699ProAla: 5.699 ± 0.047
0.971ProCys: 0.971 ± 0.032
3.036ProAsp: 3.036 ± 0.024
3.551ProGlu: 3.551 ± 0.029
1.857ProPhe: 1.857 ± 0.018
4.825ProGly: 4.825 ± 0.058
1.665ProHis: 1.665 ± 0.02
2.064ProIle: 2.064 ± 0.019
2.69ProLys: 2.69 ± 0.026
4.99ProLeu: 4.99 ± 0.025
1.171ProMet: 1.171 ± 0.015
2.18ProAsn: 2.18 ± 0.02
6.629ProPro: 6.629 ± 0.08
2.953ProGln: 2.953 ± 0.03
3.469ProArg: 3.469 ± 0.035
6.204ProSer: 6.204 ± 0.047
3.783ProThr: 3.783 ± 0.042
4.075ProVal: 4.075 ± 0.028
0.584ProTrp: 0.584 ± 0.008
1.468ProTyr: 1.468 ± 0.015
0.003ProXaa: 0.003 ± 0.001
Gln
3.446GlnAla: 3.446 ± 0.029
0.837GlnCys: 0.837 ± 0.015
2.215GlnAsp: 2.215 ± 0.017
2.963GlnGlu: 2.963 ± 0.021
1.248GlnPhe: 1.248 ± 0.013
2.664GlnGly: 2.664 ± 0.029
1.48GlnHis: 1.48 ± 0.017
1.671GlnIle: 1.671 ± 0.015
2.176GlnLys: 2.176 ± 0.021
4.376GlnLeu: 4.376 ± 0.032
1.051GlnMet: 1.051 ± 0.014
1.697GlnAsn: 1.697 ± 0.014
2.982GlnPro: 2.982 ± 0.034
4.361GlnGln: 4.361 ± 0.079
2.902GlnArg: 2.902 ± 0.021
3.351GlnSer: 3.351 ± 0.024
2.216GlnThr: 2.216 ± 0.015
2.724GlnVal: 2.724 ± 0.02
0.462GlnTrp: 0.462 ± 0.007
1.117GlnTyr: 1.117 ± 0.012
0.001GlnXaa: 0.001 ± 0.0
Arg
4.32ArgAla: 4.32 ± 0.027
1.21ArgCys: 1.21 ± 0.02
3.388ArgAsp: 3.388 ± 0.032
3.903ArgGlu: 3.903 ± 0.031
1.877ArgPhe: 1.877 ± 0.013
3.802ArgGly: 3.802 ± 0.033
1.751ArgHis: 1.751 ± 0.015
2.252ArgIle: 2.252 ± 0.018
3.565ArgLys: 3.565 ± 0.022
5.55ArgLeu: 5.55 ± 0.035
1.224ArgMet: 1.224 ± 0.011
2.227ArgAsn: 2.227 ± 0.017
3.668ArgPro: 3.668 ± 0.033
2.63ArgGln: 2.63 ± 0.02
5.132ArgArg: 5.132 ± 0.038
4.942ArgSer: 4.942 ± 0.026
3.152ArgThr: 3.152 ± 0.022
3.501ArgVal: 3.501 ± 0.021
0.701ArgTrp: 0.701 ± 0.01
1.42ArgTyr: 1.42 ± 0.012
0.002ArgXaa: 0.002 ± 0.0
Ser
6.504SerAla: 6.504 ± 0.036
1.545SerCys: 1.545 ± 0.021
4.629SerAsp: 4.629 ± 0.026
4.869SerGlu: 4.869 ± 0.031
2.665SerPhe: 2.665 ± 0.019
5.971SerGly: 5.971 ± 0.036
2.209SerHis: 2.209 ± 0.019
3.2SerIle: 3.2 ± 0.021
4.281SerLys: 4.281 ± 0.032
7.594SerLeu: 7.594 ± 0.035
1.742SerMet: 1.742 ± 0.014
3.425SerAsn: 3.425 ± 0.025
6.29SerPro: 6.29 ± 0.051
3.597SerGln: 3.597 ± 0.026
4.821SerArg: 4.821 ± 0.03
10.836SerSer: 10.836 ± 0.076
5.41SerThr: 5.41 ± 0.046
5.284SerVal: 5.284 ± 0.028
0.903SerTrp: 0.903 ± 0.011
1.91SerTyr: 1.91 ± 0.017
0.002SerXaa: 0.002 ± 0.0
Thr
4.413ThrAla: 4.413 ± 0.033
1.118ThrCys: 1.118 ± 0.019
2.606ThrAsp: 2.606 ± 0.017
3.165ThrGlu: 3.165 ± 0.03
1.814ThrPhe: 1.814 ± 0.015
3.584ThrGly: 3.584 ± 0.027
1.472ThrHis: 1.472 ± 0.019
2.291ThrIle: 2.291 ± 0.018
2.515ThrLys: 2.515 ± 0.017
4.968ThrLeu: 4.968 ± 0.024
1.095ThrMet: 1.095 ± 0.01
2.015ThrAsn: 2.015 ± 0.014
4.358ThrPro: 4.358 ± 0.038
2.052ThrGln: 2.052 ± 0.018
2.693ThrArg: 2.693 ± 0.02
5.331ThrSer: 5.331 ± 0.039
4.05ThrThr: 4.05 ± 0.07
3.923ThrVal: 3.923 ± 0.025
0.569ThrTrp: 0.569 ± 0.008
1.271ThrTyr: 1.271 ± 0.012
0.001ThrXaa: 0.001 ± 0.0
Val
5.1ValAla: 5.1 ± 0.029
1.563ValCys: 1.563 ± 0.021
3.6ValAsp: 3.6 ± 0.021
4.044ValGlu: 4.044 ± 0.025
2.166ValPhe: 2.166 ± 0.016
3.622ValGly: 3.622 ± 0.024
1.749ValHis: 1.749 ± 0.014
2.632ValIle: 2.632 ± 0.018
3.378ValLys: 3.378 ± 0.023
6.169ValLeu: 6.169 ± 0.032
1.396ValMet: 1.396 ± 0.011
2.542ValAsn: 2.542 ± 0.021
4.0ValPro: 4.0 ± 0.025
2.844ValGln: 2.844 ± 0.021
3.681ValArg: 3.681 ± 0.023
5.239ValSer: 5.239 ± 0.025
3.753ValThr: 3.753 ± 0.024
4.766ValVal: 4.766 ± 0.029
0.732ValTrp: 0.732 ± 0.009
1.578ValTyr: 1.578 ± 0.014
0.001ValXaa: 0.001 ± 0.0
Trp
0.668TrpAla: 0.668 ± 0.009
0.229TrpCys: 0.229 ± 0.005
0.627TrpAsp: 0.627 ± 0.01
0.642TrpGlu: 0.642 ± 0.007
0.387TrpPhe: 0.387 ± 0.007
0.642TrpGly: 0.642 ± 0.01
0.272TrpHis: 0.272 ± 0.005
0.445TrpIle: 0.445 ± 0.007
0.622TrpLys: 0.622 ± 0.009
1.096TrpLeu: 1.096 ± 0.011
0.286TrpMet: 0.286 ± 0.005
0.482TrpAsn: 0.482 ± 0.007
0.498TrpPro: 0.498 ± 0.007
0.436TrpGln: 0.436 ± 0.006
0.767TrpArg: 0.767 ± 0.01
0.841TrpSer: 0.841 ± 0.01
0.619TrpThr: 0.619 ± 0.009
0.645TrpVal: 0.645 ± 0.008
0.178TrpTrp: 0.178 ± 0.004
0.279TrpTyr: 0.279 ± 0.005
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.568TyrAla: 1.568 ± 0.015
0.598TyrCys: 0.598 ± 0.008
1.351TyrAsp: 1.351 ± 0.012
1.443TyrGlu: 1.443 ± 0.014
1.032TyrPhe: 1.032 ± 0.011
1.667TyrGly: 1.667 ± 0.018
0.762TyrHis: 0.762 ± 0.009
1.088TyrIle: 1.088 ± 0.01
1.266TyrLys: 1.266 ± 0.013
2.254TyrLeu: 2.254 ± 0.015
0.578TyrMet: 0.578 ± 0.008
1.078TyrAsn: 1.078 ± 0.012
1.33TyrPro: 1.33 ± 0.022
1.129TyrGln: 1.129 ± 0.014
1.518TyrArg: 1.518 ± 0.012
1.97TyrSer: 1.97 ± 0.017
1.28TyrThr: 1.28 ± 0.012
1.506TyrVal: 1.506 ± 0.012
0.296TyrTrp: 0.296 ± 0.007
0.849TyrTyr: 0.849 ± 0.009
0.001TyrXaa: 0.001 ± 0.0
Xaa
0.002XaaAla: 0.002 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.001XaaAsp: 0.001 ± 0.0
0.001XaaGlu: 0.001 ± 0.0
0.001XaaPhe: 0.001 ± 0.0
0.002XaaGly: 0.002 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.001XaaIle: 0.001 ± 0.0
0.001XaaLys: 0.001 ± 0.0
0.002XaaLeu: 0.002 ± 0.0
0.001XaaMet: 0.001 ± 0.0
0.001XaaAsn: 0.001 ± 0.0
0.003XaaPro: 0.003 ± 0.0
0.001XaaGln: 0.001 ± 0.0
0.002XaaArg: 0.002 ± 0.0
0.002XaaSer: 0.002 ± 0.0
0.001XaaThr: 0.001 ± 0.0
0.001XaaVal: 0.001 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.001XaaTyr: 0.001 ± 0.0
0.001XaaXaa: 0.001 ± 0.001
Statistics based on 20529 proteins (12625742 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski