Amino acid dipepetide frequency for Rhizophagus irregularis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.634AlaAla: 2.634 ± 0.024
0.688AlaCys: 0.688 ± 0.011
2.215AlaAsp: 2.215 ± 0.019
2.838AlaGlu: 2.838 ± 0.024
2.122AlaPhe: 2.122 ± 0.019
2.024AlaGly: 2.024 ± 0.022
0.917AlaHis: 0.917 ± 0.01
3.775AlaIle: 3.775 ± 0.022
3.461AlaLys: 3.461 ± 0.023
4.234AlaLeu: 4.234 ± 0.032
0.915AlaMet: 0.915 ± 0.012
2.631AlaAsn: 2.631 ± 0.023
1.719AlaPro: 1.719 ± 0.02
1.601AlaGln: 1.601 ± 0.019
2.026AlaArg: 2.026 ± 0.016
3.425AlaSer: 3.425 ± 0.024
2.553AlaThr: 2.553 ± 0.019
2.293AlaVal: 2.293 ± 0.017
0.455AlaTrp: 0.455 ± 0.009
1.608AlaTyr: 1.608 ± 0.015
0.0AlaXaa: 0.0 ± 0.0
Cys
0.621CysAla: 0.621 ± 0.009
0.284CysCys: 0.284 ± 0.007
0.855CysAsp: 0.855 ± 0.012
0.966CysGlu: 0.966 ± 0.011
0.726CysPhe: 0.726 ± 0.011
0.976CysGly: 0.976 ± 0.015
0.392CysHis: 0.392 ± 0.007
1.139CysIle: 1.139 ± 0.013
1.209CysLys: 1.209 ± 0.015
1.515CysLeu: 1.515 ± 0.015
0.289CysMet: 0.289 ± 0.006
1.014CysAsn: 1.014 ± 0.013
0.695CysPro: 0.695 ± 0.011
0.591CysGln: 0.591 ± 0.009
0.67CysArg: 0.67 ± 0.01
1.147CysSer: 1.147 ± 0.014
0.718CysThr: 0.718 ± 0.009
0.699CysVal: 0.699 ± 0.01
0.261CysTrp: 0.261 ± 0.006
0.689CysTyr: 0.689 ± 0.012
0.0CysXaa: 0.0 ± 0.0
Asp
2.334AspAla: 2.334 ± 0.02
0.806AspCys: 0.806 ± 0.012
4.967AspAsp: 4.967 ± 0.042
4.813AspGlu: 4.813 ± 0.03
2.913AspPhe: 2.913 ± 0.023
2.629AspGly: 2.629 ± 0.021
1.162AspHis: 1.162 ± 0.013
4.954AspIle: 4.954 ± 0.027
4.286AspLys: 4.286 ± 0.032
5.141AspLeu: 5.141 ± 0.028
1.135AspMet: 1.135 ± 0.012
4.159AspAsn: 4.159 ± 0.026
2.218AspPro: 2.218 ± 0.019
1.834AspGln: 1.834 ± 0.017
2.074AspArg: 2.074 ± 0.021
3.997AspSer: 3.997 ± 0.025
2.711AspThr: 2.711 ± 0.019
3.001AspVal: 3.001 ± 0.022
0.704AspTrp: 0.704 ± 0.01
2.448AspTyr: 2.448 ± 0.02
0.001AspXaa: 0.001 ± 0.0
Glu
2.905GluAla: 2.905 ± 0.021
0.958GluCys: 0.958 ± 0.013
4.021GluAsp: 4.021 ± 0.027
6.349GluGlu: 6.349 ± 0.047
3.249GluPhe: 3.249 ± 0.021
2.514GluGly: 2.514 ± 0.019
1.265GluHis: 1.265 ± 0.014
6.369GluIle: 6.369 ± 0.036
6.261GluLys: 6.261 ± 0.043
6.517GluLeu: 6.517 ± 0.032
1.481GluMet: 1.481 ± 0.015
5.329GluAsn: 5.329 ± 0.032
1.914GluPro: 1.914 ± 0.019
2.452GluGln: 2.452 ± 0.023
3.224GluArg: 3.224 ± 0.026
4.522GluSer: 4.522 ± 0.025
3.374GluThr: 3.374 ± 0.023
3.427GluVal: 3.427 ± 0.024
0.885GluTrp: 0.885 ± 0.011
2.747GluTyr: 2.747 ± 0.021
0.001GluXaa: 0.001 ± 0.0
Phe
2.011PheAla: 2.011 ± 0.018
0.809PheCys: 0.809 ± 0.009
2.861PheAsp: 2.861 ± 0.019
3.063PheGlu: 3.063 ± 0.024
2.234PhePhe: 2.234 ± 0.02
2.627PheGly: 2.627 ± 0.024
1.127PheHis: 1.127 ± 0.013
3.87PheIle: 3.87 ± 0.025
3.331PheLys: 3.331 ± 0.026
4.235PheLeu: 4.235 ± 0.027
0.952PheMet: 0.952 ± 0.01
3.185PheAsn: 3.185 ± 0.023
1.761PhePro: 1.761 ± 0.013
1.648PheGln: 1.648 ± 0.015
1.898PheArg: 1.898 ± 0.015
3.738PheSer: 3.738 ± 0.023
2.574PheThr: 2.574 ± 0.018
2.455PheVal: 2.455 ± 0.019
0.591PheTrp: 0.591 ± 0.01
1.94PheTyr: 1.94 ± 0.018
0.001PheXaa: 0.001 ± 0.0
Gly
1.897GlyAla: 1.897 ± 0.019
0.743GlyCys: 0.743 ± 0.01
2.368GlyAsp: 2.368 ± 0.018
2.617GlyGlu: 2.617 ± 0.023
2.221GlyPhe: 2.221 ± 0.021
2.842GlyGly: 2.842 ± 0.033
1.043GlyHis: 1.043 ± 0.012
3.812GlyIle: 3.812 ± 0.024
3.624GlyLys: 3.624 ± 0.031
3.811GlyLeu: 3.811 ± 0.024
0.9GlyMet: 0.9 ± 0.012
2.891GlyAsn: 2.891 ± 0.021
1.417GlyPro: 1.417 ± 0.016
1.494GlyGln: 1.494 ± 0.016
2.128GlyArg: 2.128 ± 0.02
3.354GlySer: 3.354 ± 0.024
2.483GlyThr: 2.483 ± 0.024
2.463GlyVal: 2.463 ± 0.022
0.607GlyTrp: 0.607 ± 0.009
1.856GlyTyr: 1.856 ± 0.017
0.001GlyXaa: 0.001 ± 0.0
His
0.958HisAla: 0.958 ± 0.012
0.369HisCys: 0.369 ± 0.008
1.171HisAsp: 1.171 ± 0.013
1.312HisGlu: 1.312 ± 0.014
1.072HisPhe: 1.072 ± 0.011
0.983HisGly: 0.983 ± 0.012
0.71HisHis: 0.71 ± 0.013
1.716HisIle: 1.716 ± 0.016
1.44HisLys: 1.44 ± 0.014
2.148HisLeu: 2.148 ± 0.017
0.427HisMet: 0.427 ± 0.007
1.44HisAsn: 1.44 ± 0.015
1.014HisPro: 1.014 ± 0.012
0.927HisGln: 0.927 ± 0.013
0.993HisArg: 0.993 ± 0.013
1.665HisSer: 1.665 ± 0.017
1.017HisThr: 1.017 ± 0.01
1.16HisVal: 1.16 ± 0.012
0.273HisTrp: 0.273 ± 0.006
0.909HisTyr: 0.909 ± 0.011
0.0HisXaa: 0.0 ± 0.0
Ile
3.53IleAla: 3.53 ± 0.021
1.346IleCys: 1.346 ± 0.013
4.71IleAsp: 4.71 ± 0.027
5.346IleGlu: 5.346 ± 0.024
3.747IlePhe: 3.747 ± 0.027
3.491IleGly: 3.491 ± 0.023
1.759IleHis: 1.759 ± 0.016
6.969IleIle: 6.969 ± 0.041
6.486IleLys: 6.486 ± 0.039
7.533IleLeu: 7.533 ± 0.037
1.6IleMet: 1.6 ± 0.016
5.65IleAsn: 5.65 ± 0.031
3.815IlePro: 3.815 ± 0.022
2.94IleGln: 2.94 ± 0.019
3.418IleArg: 3.418 ± 0.019
6.539IleSer: 6.539 ± 0.032
4.424IleThr: 4.424 ± 0.023
3.981IleVal: 3.981 ± 0.023
0.921IleTrp: 0.921 ± 0.013
3.269IleTyr: 3.269 ± 0.026
0.002IleXaa: 0.002 ± 0.0
Lys
3.541LysAla: 3.541 ± 0.026
1.236LysCys: 1.236 ± 0.016
4.528LysAsp: 4.528 ± 0.031
6.383LysGlu: 6.383 ± 0.038
3.642LysPhe: 3.642 ± 0.025
3.025LysGly: 3.025 ± 0.026
1.583LysHis: 1.583 ± 0.014
6.504LysIle: 6.504 ± 0.04
7.616LysLys: 7.616 ± 0.07
7.254LysLeu: 7.254 ± 0.036
1.557LysMet: 1.557 ± 0.014
5.788LysAsn: 5.788 ± 0.044
2.576LysPro: 2.576 ± 0.023
2.795LysGln: 2.795 ± 0.024
4.111LysArg: 4.111 ± 0.03
5.711LysSer: 5.711 ± 0.029
3.718LysThr: 3.718 ± 0.024
3.9LysVal: 3.9 ± 0.019
0.923LysTrp: 0.923 ± 0.011
3.382LysTyr: 3.382 ± 0.023
0.001LysXaa: 0.001 ± 0.0
Leu
4.37LeuAla: 4.37 ± 0.028
1.467LeuCys: 1.467 ± 0.013
4.935LeuAsp: 4.935 ± 0.025
6.333LeuGlu: 6.333 ± 0.033
4.191LeuPhe: 4.191 ± 0.026
3.928LeuGly: 3.928 ± 0.028
2.053LeuHis: 2.053 ± 0.017
6.801LeuIle: 6.801 ± 0.038
7.547LeuLys: 7.547 ± 0.04
8.276LeuLeu: 8.276 ± 0.043
1.781LeuMet: 1.781 ± 0.015
5.962LeuAsn: 5.962 ± 0.029
4.044LeuPro: 4.044 ± 0.026
3.722LeuGln: 3.722 ± 0.024
4.313LeuArg: 4.313 ± 0.03
7.239LeuSer: 7.239 ± 0.037
4.741LeuThr: 4.741 ± 0.02
4.427LeuVal: 4.427 ± 0.024
1.144LeuTrp: 1.144 ± 0.013
3.444LeuTyr: 3.444 ± 0.027
0.001LeuXaa: 0.001 ± 0.0
Met
1.118MetAla: 1.118 ± 0.013
0.251MetCys: 0.251 ± 0.005
1.226MetAsp: 1.226 ± 0.012
1.534MetGlu: 1.534 ± 0.018
0.829MetPhe: 0.829 ± 0.01
0.861MetGly: 0.861 ± 0.011
0.342MetHis: 0.342 ± 0.007
1.568MetIle: 1.568 ± 0.016
1.627MetLys: 1.627 ± 0.013
1.593MetLeu: 1.593 ± 0.016
0.5MetMet: 0.5 ± 0.009
1.382MetAsn: 1.382 ± 0.014
0.707MetPro: 0.707 ± 0.009
0.687MetGln: 0.687 ± 0.011
0.856MetArg: 0.856 ± 0.011
1.689MetSer: 1.689 ± 0.016
1.12MetThr: 1.12 ± 0.011
1.04MetVal: 1.04 ± 0.012
0.199MetTrp: 0.199 ± 0.005
0.615MetTyr: 0.615 ± 0.008
0.001MetXaa: 0.001 ± 0.0
Asn
2.66AsnAla: 2.66 ± 0.019
1.021AsnCys: 1.021 ± 0.011
4.524AsnAsp: 4.524 ± 0.029
5.096AsnGlu: 5.096 ± 0.044
3.326AsnPhe: 3.326 ± 0.022
3.104AsnGly: 3.104 ± 0.023
1.476AsnHis: 1.476 ± 0.014
6.151AsnIle: 6.151 ± 0.032
5.224AsnLys: 5.224 ± 0.034
6.336AsnLeu: 6.336 ± 0.037
1.358AsnMet: 1.358 ± 0.012
6.831AsnAsn: 6.831 ± 0.05
2.662AsnPro: 2.662 ± 0.021
2.535AsnGln: 2.535 ± 0.021
2.637AsnArg: 2.637 ± 0.021
5.436AsnSer: 5.436 ± 0.03
3.532AsnThr: 3.532 ± 0.024
3.552AsnVal: 3.552 ± 0.038
0.81AsnTrp: 0.81 ± 0.01
2.898AsnTyr: 2.898 ± 0.024
0.001AsnXaa: 0.001 ± 0.0
Pro
1.745ProAla: 1.745 ± 0.023
0.479ProCys: 0.479 ± 0.009
2.261ProAsp: 2.261 ± 0.017
2.683ProGlu: 2.683 ± 0.021
1.888ProPhe: 1.888 ± 0.017
1.688ProGly: 1.688 ± 0.016
0.798ProHis: 0.798 ± 0.012
3.227ProIle: 3.227 ± 0.02
2.933ProLys: 2.933 ± 0.023
3.4ProLeu: 3.4 ± 0.024
0.656ProMet: 0.656 ± 0.01
2.84ProAsn: 2.84 ± 0.019
2.719ProPro: 2.719 ± 0.036
1.588ProGln: 1.588 ± 0.02
1.575ProArg: 1.575 ± 0.014
3.725ProSer: 3.725 ± 0.028
2.593ProThr: 2.593 ± 0.023
2.107ProVal: 2.107 ± 0.018
0.386ProTrp: 0.386 ± 0.007
1.49ProTyr: 1.49 ± 0.015
0.0ProXaa: 0.0 ± 0.0
Gln
1.598GlnAla: 1.598 ± 0.015
0.525GlnCys: 0.525 ± 0.009
1.898GlnAsp: 1.898 ± 0.016
2.716GlnGlu: 2.716 ± 0.026
1.634GlnPhe: 1.634 ± 0.014
1.334GlnGly: 1.334 ± 0.015
0.935GlnHis: 0.935 ± 0.012
2.986GlnIle: 2.986 ± 0.018
3.049GlnLys: 3.049 ± 0.023
3.464GlnLeu: 3.464 ± 0.022
0.784GlnMet: 0.784 ± 0.01
2.765GlnAsn: 2.765 ± 0.022
1.529GlnPro: 1.529 ± 0.021
2.456GlnGln: 2.456 ± 0.041
1.814GlnArg: 1.814 ± 0.016
2.64GlnSer: 2.64 ± 0.022
1.924GlnThr: 1.924 ± 0.015
1.807GlnVal: 1.807 ± 0.016
0.38GlnTrp: 0.38 ± 0.007
1.473GlnTyr: 1.473 ± 0.015
0.0GlnXaa: 0.0 ± 0.0
Arg
1.965ArgAla: 1.965 ± 0.017
0.689ArgCys: 0.689 ± 0.009
2.409ArgAsp: 2.409 ± 0.026
2.986ArgGlu: 2.986 ± 0.023
2.006ArgPhe: 2.006 ± 0.021
1.878ArgGly: 1.878 ± 0.021
1.04ArgHis: 1.04 ± 0.012
3.464ArgIle: 3.464 ± 0.02
3.995ArgLys: 3.995 ± 0.027
3.949ArgLeu: 3.949 ± 0.022
0.905ArgMet: 0.905 ± 0.011
3.11ArgAsn: 3.11 ± 0.028
1.813ArgPro: 1.813 ± 0.017
1.741ArgGln: 1.741 ± 0.018
2.672ArgArg: 2.672 ± 0.026
3.27ArgSer: 3.27 ± 0.026
2.205ArgThr: 2.205 ± 0.017
2.212ArgVal: 2.212 ± 0.018
0.564ArgTrp: 0.564 ± 0.01
1.689ArgTyr: 1.689 ± 0.017
0.001ArgXaa: 0.001 ± 0.0
Ser
3.26SerAla: 3.26 ± 0.025
1.132SerCys: 1.132 ± 0.015
4.472SerAsp: 4.472 ± 0.026
4.808SerGlu: 4.808 ± 0.025
3.747SerPhe: 3.747 ± 0.022
3.447SerGly: 3.447 ± 0.021
1.654SerHis: 1.654 ± 0.018
5.871SerIle: 5.871 ± 0.029
5.802SerLys: 5.802 ± 0.027
7.268SerLeu: 7.268 ± 0.036
1.372SerMet: 1.372 ± 0.015
5.502SerAsn: 5.502 ± 0.03
3.507SerPro: 3.507 ± 0.03
3.13SerGln: 3.13 ± 0.023
3.431SerArg: 3.431 ± 0.024
8.206SerSer: 8.206 ± 0.056
4.724SerThr: 4.724 ± 0.03
3.594SerVal: 3.594 ± 0.024
0.802SerTrp: 0.802 ± 0.01
2.769SerTyr: 2.769 ± 0.019
0.001SerXaa: 0.001 ± 0.0
Thr
2.436ThrAla: 2.436 ± 0.02
0.831ThrCys: 0.831 ± 0.01
2.64ThrAsp: 2.64 ± 0.022
3.151ThrGlu: 3.151 ± 0.027
2.619ThrPhe: 2.619 ± 0.019
2.429ThrGly: 2.429 ± 0.022
1.079ThrHis: 1.079 ± 0.013
4.301ThrIle: 4.301 ± 0.023
3.997ThrLys: 3.997 ± 0.024
4.878ThrLeu: 4.878 ± 0.031
0.985ThrMet: 0.985 ± 0.011
3.58ThrAsn: 3.58 ± 0.025
2.717ThrPro: 2.717 ± 0.023
1.844ThrGln: 1.844 ± 0.017
2.268ThrArg: 2.268 ± 0.019
4.885ThrSer: 4.885 ± 0.031
3.652ThrThr: 3.652 ± 0.033
2.619ThrVal: 2.619 ± 0.021
0.613ThrTrp: 0.613 ± 0.01
1.948ThrTyr: 1.948 ± 0.014
0.001ThrXaa: 0.001 ± 0.0
Val
2.507ValAla: 2.507 ± 0.021
0.753ValCys: 0.753 ± 0.009
3.063ValAsp: 3.063 ± 0.019
3.442ValGlu: 3.442 ± 0.022
2.167ValPhe: 2.167 ± 0.019
2.333ValGly: 2.333 ± 0.021
1.075ValHis: 1.075 ± 0.012
4.018ValIle: 4.018 ± 0.024
3.94ValLys: 3.94 ± 0.038
4.457ValLeu: 4.457 ± 0.027
1.046ValMet: 1.046 ± 0.011
3.29ValAsn: 3.29 ± 0.023
2.205ValPro: 2.205 ± 0.025
1.793ValGln: 1.793 ± 0.017
2.175ValArg: 2.175 ± 0.018
3.683ValSer: 3.683 ± 0.024
2.868ValThr: 2.868 ± 0.017
2.819ValVal: 2.819 ± 0.024
0.541ValTrp: 0.541 ± 0.009
1.848ValTyr: 1.848 ± 0.015
0.001ValXaa: 0.001 ± 0.0
Trp
0.501TrpAla: 0.501 ± 0.008
0.28TrpCys: 0.28 ± 0.007
0.712TrpAsp: 0.712 ± 0.012
0.778TrpGlu: 0.778 ± 0.011
0.549TrpPhe: 0.549 ± 0.01
0.463TrpGly: 0.463 ± 0.008
0.251TrpHis: 0.251 ± 0.006
1.009TrpIle: 1.009 ± 0.012
1.168TrpLys: 1.168 ± 0.013
0.925TrpLeu: 0.925 ± 0.012
0.301TrpMet: 0.301 ± 0.006
0.911TrpAsn: 0.911 ± 0.011
0.32TrpPro: 0.32 ± 0.006
0.37TrpGln: 0.37 ± 0.007
0.563TrpArg: 0.563 ± 0.008
0.779TrpSer: 0.779 ± 0.01
0.637TrpThr: 0.637 ± 0.01
0.579TrpVal: 0.579 ± 0.01
0.161TrpTrp: 0.161 ± 0.005
0.485TrpTyr: 0.485 ± 0.009
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.543TyrAla: 1.543 ± 0.013
0.788TyrCys: 0.788 ± 0.011
2.446TyrAsp: 2.446 ± 0.02
2.552TyrGlu: 2.552 ± 0.019
2.027TyrPhe: 2.027 ± 0.017
1.993TyrGly: 1.993 ± 0.019
1.042TyrHis: 1.042 ± 0.012
2.973TyrIle: 2.973 ± 0.019
2.778TyrLys: 2.778 ± 0.024
3.781TyrLeu: 3.781 ± 0.024
0.801TyrMet: 0.801 ± 0.01
2.91TyrAsn: 2.91 ± 0.025
1.428TyrPro: 1.428 ± 0.014
1.564TyrGln: 1.564 ± 0.015
1.718TyrArg: 1.718 ± 0.015
2.85TyrSer: 2.85 ± 0.021
1.905TyrThr: 1.905 ± 0.017
1.887TyrVal: 1.887 ± 0.014
0.522TyrTrp: 0.522 ± 0.009
1.964TyrTyr: 1.964 ± 0.02
0.001TyrXaa: 0.001 ± 0.0
Xaa
0.001XaaAla: 0.001 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.001XaaAsp: 0.001 ± 0.0
0.001XaaGlu: 0.001 ± 0.0
0.001XaaPhe: 0.001 ± 0.0
0.001XaaGly: 0.001 ± 0.0
0.001XaaHis: 0.001 ± 0.0
0.001XaaIle: 0.001 ± 0.0
0.001XaaLys: 0.001 ± 0.0
0.001XaaLeu: 0.001 ± 0.0
0.001XaaMet: 0.001 ± 0.0
0.001XaaAsn: 0.001 ± 0.0
0.001XaaPro: 0.001 ± 0.0
0.001XaaGln: 0.001 ± 0.0
0.001XaaArg: 0.001 ± 0.0
0.001XaaSer: 0.001 ± 0.0
0.001XaaThr: 0.001 ± 0.0
0.001XaaVal: 0.001 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.001XaaTyr: 0.001 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 25730 proteins (8183250 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski