Amino acid dipepetide frequency for Trichinella murrelli

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.624AlaAla: 5.624 ± 0.047
1.502AlaCys: 1.502 ± 0.014
3.579AlaAsp: 3.579 ± 0.021
4.488AlaGlu: 4.488 ± 0.029
2.801AlaPhe: 2.801 ± 0.019
3.373AlaGly: 3.373 ± 0.028
1.299AlaHis: 1.299 ± 0.014
3.445AlaIle: 3.445 ± 0.022
3.631AlaLys: 3.631 ± 0.023
6.178AlaLeu: 6.178 ± 0.037
1.598AlaMet: 1.598 ± 0.015
2.971AlaAsn: 2.971 ± 0.023
2.371AlaPro: 2.371 ± 0.023
2.275AlaGln: 2.275 ± 0.019
2.979AlaArg: 2.979 ± 0.026
5.264AlaSer: 5.264 ± 0.031
3.581AlaThr: 3.581 ± 0.026
5.069AlaVal: 5.069 ± 0.028
0.677AlaTrp: 0.677 ± 0.009
1.864AlaTyr: 1.864 ± 0.015
0.001AlaXaa: 0.001 ± 0.0
Cys
1.436CysAla: 1.436 ± 0.017
0.887CysCys: 0.887 ± 0.014
1.355CysAsp: 1.355 ± 0.023
1.464CysGlu: 1.464 ± 0.023
1.303CysPhe: 1.303 ± 0.013
1.419CysGly: 1.419 ± 0.019
0.696CysHis: 0.696 ± 0.011
1.556CysIle: 1.556 ± 0.019
1.482CysLys: 1.482 ± 0.017
2.564CysLeu: 2.564 ± 0.024
0.586CysMet: 0.586 ± 0.009
1.202CysAsn: 1.202 ± 0.012
1.219CysPro: 1.219 ± 0.025
1.119CysGln: 1.119 ± 0.014
1.625CysArg: 1.625 ± 0.017
2.435CysSer: 2.435 ± 0.024
1.369CysThr: 1.369 ± 0.014
1.516CysVal: 1.516 ± 0.019
0.354CysTrp: 0.354 ± 0.007
0.777CysTyr: 0.777 ± 0.01
0.0CysXaa: 0.0 ± 0.0
Asp
3.321AspAla: 3.321 ± 0.02
1.39AspCys: 1.39 ± 0.019
3.979AspAsp: 3.979 ± 0.035
4.236AspGlu: 4.236 ± 0.028
2.364AspPhe: 2.364 ± 0.017
3.168AspGly: 3.168 ± 0.036
1.223AspHis: 1.223 ± 0.013
2.819AspIle: 2.819 ± 0.02
2.557AspLys: 2.557 ± 0.018
4.629AspLeu: 4.629 ± 0.027
1.168AspMet: 1.168 ± 0.013
2.366AspAsn: 2.366 ± 0.018
2.055AspPro: 2.055 ± 0.02
2.065AspGln: 2.065 ± 0.017
2.728AspArg: 2.728 ± 0.023
4.168AspSer: 4.168 ± 0.027
2.19AspThr: 2.19 ± 0.018
3.693AspVal: 3.693 ± 0.026
0.681AspTrp: 0.681 ± 0.011
1.619AspTyr: 1.619 ± 0.014
0.001AspXaa: 0.001 ± 0.0
Glu
3.942GluAla: 3.942 ± 0.031
1.347GluCys: 1.347 ± 0.022
3.197GluAsp: 3.197 ± 0.023
5.551GluGlu: 5.551 ± 0.045
2.452GluPhe: 2.452 ± 0.018
2.269GluGly: 2.269 ± 0.025
1.41GluHis: 1.41 ± 0.013
3.975GluIle: 3.975 ± 0.029
4.977GluLys: 4.977 ± 0.042
5.779GluLeu: 5.779 ± 0.033
1.947GluMet: 1.947 ± 0.016
4.066GluAsn: 4.066 ± 0.029
2.136GluPro: 2.136 ± 0.019
3.07GluGln: 3.07 ± 0.023
3.574GluArg: 3.574 ± 0.03
4.473GluSer: 4.473 ± 0.027
3.287GluThr: 3.287 ± 0.022
3.458GluVal: 3.458 ± 0.028
0.697GluTrp: 0.697 ± 0.011
1.717GluTyr: 1.717 ± 0.016
0.001GluXaa: 0.001 ± 0.0
Phe
2.806PheAla: 2.806 ± 0.021
1.402PheCys: 1.402 ± 0.014
2.606PheAsp: 2.606 ± 0.021
2.645PheGlu: 2.645 ± 0.019
2.218PhePhe: 2.218 ± 0.019
2.641PheGly: 2.641 ± 0.022
1.334PheHis: 1.334 ± 0.012
2.557PheIle: 2.557 ± 0.025
2.251PheLys: 2.251 ± 0.019
4.332PheLeu: 4.332 ± 0.031
0.895PheMet: 0.895 ± 0.011
2.1PheAsn: 2.1 ± 0.016
1.978PhePro: 1.978 ± 0.019
1.929PheGln: 1.929 ± 0.017
2.352PheArg: 2.352 ± 0.02
3.885PheSer: 3.885 ± 0.024
2.432PheThr: 2.432 ± 0.017
3.064PheVal: 3.064 ± 0.021
0.605PheTrp: 0.605 ± 0.009
1.715PheTyr: 1.715 ± 0.015
0.001PheXaa: 0.001 ± 0.0
Gly
2.738GlyAla: 2.738 ± 0.026
1.29GlyCys: 1.29 ± 0.014
2.605GlyAsp: 2.605 ± 0.027
2.891GlyGlu: 2.891 ± 0.028
2.231GlyPhe: 2.231 ± 0.019
3.281GlyGly: 3.281 ± 0.036
1.284GlyHis: 1.284 ± 0.013
2.954GlyIle: 2.954 ± 0.02
3.416GlyLys: 3.416 ± 0.027
4.049GlyLeu: 4.049 ± 0.035
1.215GlyMet: 1.215 ± 0.014
2.5GlyAsn: 2.5 ± 0.021
1.787GlyPro: 1.787 ± 0.025
2.043GlyGln: 2.043 ± 0.02
3.284GlyArg: 3.284 ± 0.032
4.223GlySer: 4.223 ± 0.031
2.605GlyThr: 2.605 ± 0.022
3.054GlyVal: 3.054 ± 0.021
0.652GlyTrp: 0.652 ± 0.011
1.69GlyTyr: 1.69 ± 0.018
0.0GlyXaa: 0.0 ± 0.0
His
1.442HisAla: 1.442 ± 0.015
0.884HisCys: 0.884 ± 0.013
1.048HisAsp: 1.048 ± 0.014
1.227HisGlu: 1.227 ± 0.014
1.405HisPhe: 1.405 ± 0.014
1.328HisGly: 1.328 ± 0.013
0.969HisHis: 0.969 ± 0.017
1.266HisIle: 1.266 ± 0.012
0.992HisLys: 0.992 ± 0.012
2.729HisLeu: 2.729 ± 0.02
0.559HisMet: 0.559 ± 0.01
0.972HisAsn: 0.972 ± 0.012
1.227HisPro: 1.227 ± 0.012
1.081HisGln: 1.081 ± 0.013
1.555HisArg: 1.555 ± 0.016
2.059HisSer: 2.059 ± 0.017
1.094HisThr: 1.094 ± 0.012
1.593HisVal: 1.593 ± 0.014
0.356HisTrp: 0.356 ± 0.007
0.875HisTyr: 0.875 ± 0.011
0.0HisXaa: 0.0 ± 0.0
Ile
3.711IleAla: 3.711 ± 0.021
1.698IleCys: 1.698 ± 0.019
3.024IleAsp: 3.024 ± 0.023
3.319IleGlu: 3.319 ± 0.023
2.821IlePhe: 2.821 ± 0.025
2.856IleGly: 2.856 ± 0.022
1.335IleHis: 1.335 ± 0.014
3.154IleIle: 3.154 ± 0.024
2.836IleLys: 2.836 ± 0.023
5.313IleLeu: 5.313 ± 0.037
1.202IleMet: 1.202 ± 0.013
2.478IleAsn: 2.478 ± 0.019
2.592IlePro: 2.592 ± 0.021
2.184IleGln: 2.184 ± 0.019
3.189IleArg: 3.189 ± 0.021
4.788IleSer: 4.788 ± 0.029
2.895IleThr: 2.895 ± 0.028
3.66IleVal: 3.66 ± 0.025
0.666IleTrp: 0.666 ± 0.009
1.911IleTyr: 1.911 ± 0.02
0.0IleXaa: 0.0 ± 0.0
Lys
3.605LysAla: 3.605 ± 0.024
1.499LysCys: 1.499 ± 0.021
2.644LysAsp: 2.644 ± 0.023
3.962LysGlu: 3.962 ± 0.033
2.555LysPhe: 2.555 ± 0.019
2.265LysGly: 2.265 ± 0.027
1.518LysHis: 1.518 ± 0.014
3.639LysIle: 3.639 ± 0.024
4.675LysLys: 4.675 ± 0.052
6.118LysLeu: 6.118 ± 0.037
1.726LysMet: 1.726 ± 0.015
3.187LysAsn: 3.187 ± 0.023
2.44LysPro: 2.44 ± 0.024
2.896LysGln: 2.896 ± 0.022
3.886LysArg: 3.886 ± 0.03
4.666LysSer: 4.666 ± 0.031
3.007LysThr: 3.007 ± 0.022
3.366LysVal: 3.366 ± 0.023
0.751LysTrp: 0.751 ± 0.01
1.863LysTyr: 1.863 ± 0.015
0.0LysXaa: 0.0 ± 0.0
Leu
6.053LeuAla: 6.053 ± 0.03
2.48LeuCys: 2.48 ± 0.022
4.513LeuAsp: 4.513 ± 0.025
5.799LeuGlu: 5.799 ± 0.043
4.584LeuPhe: 4.584 ± 0.034
3.872LeuGly: 3.872 ± 0.023
2.713LeuHis: 2.713 ± 0.02
5.394LeuIle: 5.394 ± 0.035
6.535LeuLys: 6.535 ± 0.037
10.59LeuLeu: 10.59 ± 0.066
2.277LeuMet: 2.277 ± 0.019
4.939LeuAsn: 4.939 ± 0.031
4.749LeuPro: 4.749 ± 0.032
4.472LeuGln: 4.472 ± 0.027
5.498LeuArg: 5.498 ± 0.033
7.709LeuSer: 7.709 ± 0.043
5.029LeuThr: 5.029 ± 0.027
5.513LeuVal: 5.513 ± 0.028
1.062LeuTrp: 1.062 ± 0.013
2.917LeuTyr: 2.917 ± 0.027
0.0LeuXaa: 0.0 ± 0.0
Met
1.629MetAla: 1.629 ± 0.014
0.492MetCys: 0.492 ± 0.008
1.29MetAsp: 1.29 ± 0.011
1.662MetGlu: 1.662 ± 0.017
1.032MetPhe: 1.032 ± 0.012
0.912MetGly: 0.912 ± 0.012
0.645MetHis: 0.645 ± 0.009
1.325MetIle: 1.325 ± 0.012
1.795MetLys: 1.795 ± 0.016
2.466MetLeu: 2.466 ± 0.021
0.689MetMet: 0.689 ± 0.011
1.363MetAsn: 1.363 ± 0.014
1.142MetPro: 1.142 ± 0.013
1.18MetGln: 1.18 ± 0.014
1.28MetArg: 1.28 ± 0.014
1.723MetSer: 1.723 ± 0.015
1.268MetThr: 1.268 ± 0.015
1.375MetVal: 1.375 ± 0.015
0.235MetTrp: 0.235 ± 0.005
0.676MetTyr: 0.676 ± 0.011
0.0MetXaa: 0.0 ± 0.0
Asn
3.265AsnAla: 3.265 ± 0.023
1.414AsnCys: 1.414 ± 0.013
2.799AsnAsp: 2.799 ± 0.023
3.27AsnGlu: 3.27 ± 0.024
2.523AsnPhe: 2.523 ± 0.021
3.04AsnGly: 3.04 ± 0.028
1.044AsnHis: 1.044 ± 0.013
2.738AsnIle: 2.738 ± 0.019
2.537AsnLys: 2.537 ± 0.022
4.537AsnLeu: 4.537 ± 0.029
1.156AsnMet: 1.156 ± 0.014
2.977AsnAsn: 2.977 ± 0.037
1.927AsnPro: 1.927 ± 0.016
1.798AsnGln: 1.798 ± 0.018
2.647AsnArg: 2.647 ± 0.022
4.265AsnSer: 4.265 ± 0.029
2.251AsnThr: 2.251 ± 0.018
3.443AsnVal: 3.443 ± 0.023
0.611AsnTrp: 0.611 ± 0.009
1.687AsnTyr: 1.687 ± 0.017
0.001AsnXaa: 0.001 ± 0.0
Pro
2.85ProAla: 2.85 ± 0.022
0.945ProCys: 0.945 ± 0.015
2.252ProAsp: 2.252 ± 0.02
2.608ProGlu: 2.608 ± 0.025
1.966ProPhe: 1.966 ± 0.016
2.327ProGly: 2.327 ± 0.06
0.921ProHis: 0.921 ± 0.012
2.181ProIle: 2.181 ± 0.02
2.36ProLys: 2.36 ± 0.022
4.047ProLeu: 4.047 ± 0.027
0.95ProMet: 0.95 ± 0.011
2.074ProAsn: 2.074 ± 0.017
2.988ProPro: 2.988 ± 0.039
1.473ProGln: 1.473 ± 0.017
1.962ProArg: 1.962 ± 0.018
3.774ProSer: 3.774 ± 0.028
2.652ProThr: 2.652 ± 0.022
3.144ProVal: 3.144 ± 0.022
0.486ProTrp: 0.486 ± 0.009
1.353ProTyr: 1.353 ± 0.015
0.0ProXaa: 0.0 ± 0.0
Gln
2.583GlnAla: 2.583 ± 0.018
1.152GlnCys: 1.152 ± 0.018
1.464GlnAsp: 1.464 ± 0.015
2.162GlnGlu: 2.162 ± 0.02
1.939GlnPhe: 1.939 ± 0.017
1.486GlnGly: 1.486 ± 0.015
1.213GlnHis: 1.213 ± 0.014
2.401GlnIle: 2.401 ± 0.019
2.51GlnLys: 2.51 ± 0.02
4.867GlnLeu: 4.867 ± 0.034
1.211GlnMet: 1.211 ± 0.014
2.108GlnAsn: 2.108 ± 0.017
1.878GlnPro: 1.878 ± 0.022
3.545GlnGln: 3.545 ± 0.063
2.688GlnArg: 2.688 ± 0.018
3.3GlnSer: 3.3 ± 0.019
2.121GlnThr: 2.121 ± 0.015
2.106GlnVal: 2.106 ± 0.018
0.616GlnTrp: 0.616 ± 0.009
1.282GlnTyr: 1.282 ± 0.013
0.0GlnXaa: 0.0 ± 0.0
Arg
3.032ArgAla: 3.032 ± 0.022
1.619ArgCys: 1.619 ± 0.021
2.45ArgAsp: 2.45 ± 0.019
3.006ArgGlu: 3.006 ± 0.026
2.612ArgPhe: 2.612 ± 0.02
2.52ArgGly: 2.52 ± 0.027
1.562ArgHis: 1.562 ± 0.016
3.269ArgIle: 3.269 ± 0.023
3.764ArgLys: 3.764 ± 0.028
5.919ArgLeu: 5.919 ± 0.033
1.453ArgMet: 1.453 ± 0.015
2.728ArgAsn: 2.728 ± 0.019
2.366ArgPro: 2.366 ± 0.021
2.647ArgGln: 2.647 ± 0.022
4.836ArgArg: 4.836 ± 0.043
4.533ArgSer: 4.533 ± 0.035
2.812ArgThr: 2.812 ± 0.023
3.002ArgVal: 3.002 ± 0.02
0.818ArgTrp: 0.818 ± 0.011
1.714ArgTyr: 1.714 ± 0.016
0.001ArgXaa: 0.001 ± 0.0
Ser
5.558SerAla: 5.558 ± 0.027
2.094SerCys: 2.094 ± 0.02
4.508SerAsp: 4.508 ± 0.03
4.871SerGlu: 4.871 ± 0.031
3.611SerPhe: 3.611 ± 0.02
4.565SerGly: 4.565 ± 0.03
1.65SerHis: 1.65 ± 0.017
4.21SerIle: 4.21 ± 0.029
4.76SerLys: 4.76 ± 0.031
7.456SerLeu: 7.456 ± 0.038
1.824SerMet: 1.824 ± 0.016
4.151SerAsn: 4.151 ± 0.03
3.524SerPro: 3.524 ± 0.029
2.842SerGln: 2.842 ± 0.021
4.401SerArg: 4.401 ± 0.029
9.56SerSer: 9.56 ± 0.072
5.259SerThr: 5.259 ± 0.03
5.632SerVal: 5.632 ± 0.027
0.933SerTrp: 0.933 ± 0.012
2.204SerTyr: 2.204 ± 0.016
0.0SerXaa: 0.0 ± 0.0
Thr
4.191ThrAla: 4.191 ± 0.028
1.327ThrCys: 1.327 ± 0.021
2.814ThrAsp: 2.814 ± 0.02
3.143ThrGlu: 3.143 ± 0.023
2.379ThrPhe: 2.379 ± 0.018
2.911ThrGly: 2.911 ± 0.025
0.942ThrHis: 0.942 ± 0.012
2.883ThrIle: 2.883 ± 0.02
2.754ThrLys: 2.754 ± 0.023
5.055ThrLeu: 5.055 ± 0.028
1.273ThrMet: 1.273 ± 0.013
2.481ThrAsn: 2.481 ± 0.021
2.397ThrPro: 2.397 ± 0.022
1.508ThrGln: 1.508 ± 0.015
2.273ThrArg: 2.273 ± 0.02
4.513ThrSer: 4.513 ± 0.029
3.885ThrThr: 3.885 ± 0.043
4.466ThrVal: 4.466 ± 0.03
0.559ThrTrp: 0.559 ± 0.009
1.395ThrTyr: 1.395 ± 0.014
0.001ThrXaa: 0.001 ± 0.0
Val
4.297ValAla: 4.297 ± 0.029
1.677ValCys: 1.677 ± 0.021
4.098ValAsp: 4.098 ± 0.027
4.679ValGlu: 4.679 ± 0.028
2.619ValPhe: 2.619 ± 0.02
3.388ValGly: 3.388 ± 0.023
1.763ValHis: 1.763 ± 0.015
3.496ValIle: 3.496 ± 0.024
3.812ValLys: 3.812 ± 0.026
5.777ValLeu: 5.777 ± 0.032
1.392ValMet: 1.392 ± 0.014
3.041ValAsn: 3.041 ± 0.021
2.727ValPro: 2.727 ± 0.019
2.776ValGln: 2.776 ± 0.021
3.39ValArg: 3.39 ± 0.024
4.773ValSer: 4.773 ± 0.024
3.27ValThr: 3.27 ± 0.026
4.632ValVal: 4.632 ± 0.028
0.717ValTrp: 0.717 ± 0.011
1.851ValTyr: 1.851 ± 0.017
0.0ValXaa: 0.0 ± 0.0
Trp
0.614TrpAla: 0.614 ± 0.01
0.284TrpCys: 0.284 ± 0.006
0.543TrpAsp: 0.543 ± 0.009
0.561TrpGlu: 0.561 ± 0.009
0.571TrpPhe: 0.571 ± 0.01
0.403TrpGly: 0.403 ± 0.008
0.333TrpHis: 0.333 ± 0.006
0.789TrpIle: 0.789 ± 0.012
0.966TrpLys: 0.966 ± 0.013
1.316TrpLeu: 1.316 ± 0.013
0.352TrpMet: 0.352 ± 0.007
0.769TrpAsn: 0.769 ± 0.01
0.507TrpPro: 0.507 ± 0.008
0.546TrpGln: 0.546 ± 0.008
0.751TrpArg: 0.751 ± 0.011
1.019TrpSer: 1.019 ± 0.012
0.69TrpThr: 0.69 ± 0.012
0.499TrpVal: 0.499 ± 0.008
0.187TrpTrp: 0.187 ± 0.005
0.413TrpTyr: 0.413 ± 0.011
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.869TyrAla: 1.869 ± 0.017
0.98TyrCys: 0.98 ± 0.011
1.651TyrAsp: 1.651 ± 0.018
1.778TyrGlu: 1.778 ± 0.018
1.762TyrPhe: 1.762 ± 0.017
1.768TyrGly: 1.768 ± 0.017
0.806TyrHis: 0.806 ± 0.01
1.583TyrIle: 1.583 ± 0.016
1.657TyrLys: 1.657 ± 0.019
2.978TyrLeu: 2.978 ± 0.021
0.704TyrMet: 0.704 ± 0.01
1.456TyrAsn: 1.456 ± 0.016
1.305TyrPro: 1.305 ± 0.015
1.183TyrGln: 1.183 ± 0.013
1.766TyrArg: 1.766 ± 0.016
2.484TyrSer: 2.484 ± 0.021
1.453TyrThr: 1.453 ± 0.015
1.893TyrVal: 1.893 ± 0.015
0.448TyrTrp: 0.448 ± 0.011
1.212TyrTyr: 1.212 ± 0.014
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.001XaaAla: 0.001 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.001XaaAsp: 0.001 ± 0.0
0.001XaaGlu: 0.001 ± 0.0
0.001XaaPhe: 0.001 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.001XaaIle: 0.001 ± 0.0
0.001XaaLys: 0.001 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.001XaaSer: 0.001 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.013XaaXaa: 0.013 ± 0.004
Statistics based on 17983 proteins (8302106 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski