Amino acid dipepetide frequency for Aphis glycines (Soybean aphid)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.178AlaAla: 4.178 ± 0.047
0.986AlaCys: 0.986 ± 0.022
2.491AlaAsp: 2.491 ± 0.02
2.697AlaGlu: 2.697 ± 0.021
2.014AlaPhe: 2.014 ± 0.018
2.487AlaGly: 2.487 ± 0.026
1.112AlaHis: 1.112 ± 0.013
3.308AlaIle: 3.308 ± 0.024
3.031AlaLys: 3.031 ± 0.024
4.445AlaLeu: 4.445 ± 0.029
1.18AlaMet: 1.18 ± 0.014
2.397AlaAsn: 2.397 ± 0.017
2.056AlaPro: 2.056 ± 0.025
1.641AlaGln: 1.641 ± 0.015
2.135AlaArg: 2.135 ± 0.022
3.68AlaSer: 3.68 ± 0.025
2.937AlaThr: 2.937 ± 0.022
3.55AlaVal: 3.55 ± 0.027
0.473AlaTrp: 0.473 ± 0.008
1.58AlaTyr: 1.58 ± 0.015
0.0AlaXaa: 0.0 ± 0.0
Cys
0.952CysAla: 0.952 ± 0.014
0.547CysCys: 0.547 ± 0.011
1.19CysAsp: 1.19 ± 0.019
1.125CysGlu: 1.125 ± 0.018
1.003CysPhe: 1.003 ± 0.011
1.242CysGly: 1.242 ± 0.031
0.547CysHis: 0.547 ± 0.011
1.549CysIle: 1.549 ± 0.024
1.48CysLys: 1.48 ± 0.021
2.087CysLeu: 2.087 ± 0.021
0.469CysMet: 0.469 ± 0.008
1.349CysAsn: 1.349 ± 0.019
0.971CysPro: 0.971 ± 0.03
0.793CysGln: 0.793 ± 0.017
1.077CysArg: 1.077 ± 0.027
1.897CysSer: 1.897 ± 0.032
1.251CysThr: 1.251 ± 0.019
1.305CysVal: 1.305 ± 0.023
0.25CysTrp: 0.25 ± 0.006
0.81CysTyr: 0.81 ± 0.014
0.0CysXaa: 0.0 ± 0.0
Asp
2.269AspAla: 2.269 ± 0.016
1.062AspCys: 1.062 ± 0.019
3.967AspAsp: 3.967 ± 0.032
3.904AspGlu: 3.904 ± 0.03
2.451AspPhe: 2.451 ± 0.017
2.719AspGly: 2.719 ± 0.023
1.231AspHis: 1.231 ± 0.014
3.981AspIle: 3.981 ± 0.036
3.508AspLys: 3.508 ± 0.028
4.682AspLeu: 4.682 ± 0.029
1.177AspMet: 1.177 ± 0.012
3.282AspAsn: 3.282 ± 0.025
1.954AspPro: 1.954 ± 0.028
1.744AspGln: 1.744 ± 0.017
2.229AspArg: 2.229 ± 0.023
4.034AspSer: 4.034 ± 0.03
2.837AspThr: 2.837 ± 0.024
3.309AspVal: 3.309 ± 0.024
0.553AspTrp: 0.553 ± 0.009
1.888AspTyr: 1.888 ± 0.016
0.0AspXaa: 0.0 ± 0.0
Glu
2.582GluAla: 2.582 ± 0.023
1.223GluCys: 1.223 ± 0.033
3.269GluAsp: 3.269 ± 0.026
4.384GluGlu: 4.384 ± 0.041
2.361GluPhe: 2.361 ± 0.019
1.976GluGly: 1.976 ± 0.021
1.347GluHis: 1.347 ± 0.014
4.34GluIle: 4.34 ± 0.028
4.833GluLys: 4.833 ± 0.037
5.333GluLeu: 5.333 ± 0.034
1.456GluMet: 1.456 ± 0.016
4.387GluAsn: 4.387 ± 0.031
2.019GluPro: 2.019 ± 0.021
2.239GluGln: 2.239 ± 0.02
2.609GluArg: 2.609 ± 0.025
4.057GluSer: 4.057 ± 0.03
3.339GluThr: 3.339 ± 0.028
3.167GluVal: 3.167 ± 0.024
0.596GluTrp: 0.596 ± 0.009
2.118GluTyr: 2.118 ± 0.018
0.0GluXaa: 0.0 ± 0.0
Phe
1.842PheAla: 1.842 ± 0.017
1.032PheCys: 1.032 ± 0.013
2.44PheAsp: 2.44 ± 0.017
2.573PheGlu: 2.573 ± 0.023
2.277PhePhe: 2.277 ± 0.023
2.313PheGly: 2.313 ± 0.021
1.04PheHis: 1.04 ± 0.014
3.2PheIle: 3.2 ± 0.024
3.382PheLys: 3.382 ± 0.023
4.207PheLeu: 4.207 ± 0.029
0.98PheMet: 0.98 ± 0.012
2.884PheAsn: 2.884 ± 0.02
1.686PhePro: 1.686 ± 0.017
1.561PheGln: 1.561 ± 0.016
1.957PheArg: 1.957 ± 0.016
3.714PheSer: 3.714 ± 0.026
2.489PheThr: 2.489 ± 0.018
2.702PheVal: 2.702 ± 0.017
0.485PheTrp: 0.485 ± 0.008
1.759PheTyr: 1.759 ± 0.017
0.0PheXaa: 0.0 ± 0.0
Gly
2.292GlyAla: 2.292 ± 0.021
1.025GlyCys: 1.025 ± 0.016
2.309GlyAsp: 2.309 ± 0.022
2.271GlyGlu: 2.271 ± 0.027
2.137GlyPhe: 2.137 ± 0.022
3.497GlyGly: 3.497 ± 0.055
1.357GlyHis: 1.357 ± 0.016
3.007GlyIle: 3.007 ± 0.025
2.956GlyLys: 2.956 ± 0.025
3.997GlyLeu: 3.997 ± 0.029
0.972GlyMet: 0.972 ± 0.012
2.494GlyAsn: 2.494 ± 0.023
1.86GlyPro: 1.86 ± 0.026
1.649GlyGln: 1.649 ± 0.017
2.426GlyArg: 2.426 ± 0.024
3.824GlySer: 3.824 ± 0.04
2.622GlyThr: 2.622 ± 0.021
2.834GlyVal: 2.834 ± 0.022
0.547GlyTrp: 0.547 ± 0.009
1.796GlyTyr: 1.796 ± 0.021
0.0GlyXaa: 0.0 ± 0.0
His
0.953HisAla: 0.953 ± 0.012
0.637HisCys: 0.637 ± 0.01
1.088HisAsp: 1.088 ± 0.012
1.202HisGlu: 1.202 ± 0.015
1.269HisPhe: 1.269 ± 0.013
1.152HisGly: 1.152 ± 0.013
1.083HisHis: 1.083 ± 0.022
1.74HisIle: 1.74 ± 0.017
1.643HisLys: 1.643 ± 0.018
2.424HisLeu: 2.424 ± 0.019
0.585HisMet: 0.585 ± 0.008
1.482HisAsn: 1.482 ± 0.013
1.07HisPro: 1.07 ± 0.012
1.183HisGln: 1.183 ± 0.016
1.32HisArg: 1.32 ± 0.014
1.958HisSer: 1.958 ± 0.018
1.367HisThr: 1.367 ± 0.016
1.432HisVal: 1.432 ± 0.017
0.294HisTrp: 0.294 ± 0.006
0.987HisTyr: 0.987 ± 0.013
0.0HisXaa: 0.0 ± 0.0
Ile
3.213IleAla: 3.213 ± 0.024
1.576IleCys: 1.576 ± 0.023
3.971IleAsp: 3.971 ± 0.027
4.3IleGlu: 4.3 ± 0.03
3.276IlePhe: 3.276 ± 0.024
3.024IleGly: 3.024 ± 0.022
1.713IleHis: 1.713 ± 0.015
5.585IleIle: 5.585 ± 0.032
5.715IleLys: 5.715 ± 0.043
6.645IleLeu: 6.645 ± 0.036
1.597IleMet: 1.597 ± 0.016
4.871IleAsn: 4.871 ± 0.03
3.131IlePro: 3.131 ± 0.022
2.746IleGln: 2.746 ± 0.022
3.055IleArg: 3.055 ± 0.024
5.649IleSer: 5.649 ± 0.033
4.179IleThr: 4.179 ± 0.025
4.278IleVal: 4.278 ± 0.025
0.63IleTrp: 0.63 ± 0.009
2.48IleTyr: 2.48 ± 0.02
0.0IleXaa: 0.0 ± 0.0
Lys
2.884LysAla: 2.884 ± 0.023
1.704LysCys: 1.704 ± 0.024
3.322LysAsp: 3.322 ± 0.025
4.238LysGlu: 4.238 ± 0.034
3.077LysPhe: 3.077 ± 0.025
2.465LysGly: 2.465 ± 0.024
1.945LysHis: 1.945 ± 0.016
5.89LysIle: 5.89 ± 0.039
6.966LysLys: 6.966 ± 0.056
6.984LysLeu: 6.984 ± 0.042
1.833LysMet: 1.833 ± 0.015
5.391LysAsn: 5.391 ± 0.035
3.059LysPro: 3.059 ± 0.032
2.854LysGln: 2.854 ± 0.027
3.642LysArg: 3.642 ± 0.027
5.815LysSer: 5.815 ± 0.035
4.715LysThr: 4.715 ± 0.027
3.653LysVal: 3.653 ± 0.027
0.779LysTrp: 0.779 ± 0.01
3.022LysTyr: 3.022 ± 0.022
0.0LysXaa: 0.0 ± 0.0
Leu
4.511LeuAla: 4.511 ± 0.028
2.068LeuCys: 2.068 ± 0.018
4.632LeuAsp: 4.632 ± 0.029
5.283LeuGlu: 5.283 ± 0.035
4.101LeuPhe: 4.101 ± 0.027
3.714LeuGly: 3.714 ± 0.027
2.291LeuHis: 2.291 ± 0.02
6.105LeuIle: 6.105 ± 0.035
7.574LeuLys: 7.574 ± 0.043
9.086LeuLeu: 9.086 ± 0.054
2.196LeuMet: 2.196 ± 0.018
6.068LeuAsn: 6.068 ± 0.036
4.266LeuPro: 4.266 ± 0.026
3.973LeuGln: 3.973 ± 0.026
4.338LeuArg: 4.338 ± 0.026
7.604LeuSer: 7.604 ± 0.036
5.232LeuThr: 5.232 ± 0.028
5.078LeuVal: 5.078 ± 0.029
0.939LeuTrp: 0.939 ± 0.012
3.29LeuTyr: 3.29 ± 0.025
0.0LeuXaa: 0.0 ± 0.0
Met
1.386MetAla: 1.386 ± 0.015
0.558MetCys: 0.558 ± 0.009
1.334MetAsp: 1.334 ± 0.013
1.341MetGlu: 1.341 ± 0.014
1.12MetPhe: 1.12 ± 0.012
0.929MetGly: 0.929 ± 0.012
0.516MetHis: 0.516 ± 0.008
1.479MetIle: 1.479 ± 0.015
1.63MetLys: 1.63 ± 0.014
2.081MetLeu: 2.081 ± 0.019
0.635MetMet: 0.635 ± 0.01
1.41MetAsn: 1.41 ± 0.013
0.972MetPro: 0.972 ± 0.011
0.772MetGln: 0.772 ± 0.01
0.966MetArg: 0.966 ± 0.013
2.014MetSer: 2.014 ± 0.017
1.364MetThr: 1.364 ± 0.015
1.347MetVal: 1.347 ± 0.015
0.235MetTrp: 0.235 ± 0.006
0.938MetTyr: 0.938 ± 0.01
0.0MetXaa: 0.0 ± 0.0
Asn
2.708AsnAla: 2.708 ± 0.023
1.369AsnCys: 1.369 ± 0.016
3.577AsnAsp: 3.577 ± 0.026
3.867AsnGlu: 3.867 ± 0.032
2.801AsnPhe: 2.801 ± 0.02
3.057AsnGly: 3.057 ± 0.026
1.531AsnHis: 1.531 ± 0.017
5.408AsnIle: 5.408 ± 0.036
4.889AsnLys: 4.889 ± 0.033
5.636AsnLeu: 5.636 ± 0.037
1.474AsnMet: 1.474 ± 0.013
5.332AsnAsn: 5.332 ± 0.05
2.272AsnPro: 2.272 ± 0.026
2.273AsnGln: 2.273 ± 0.022
2.689AsnArg: 2.689 ± 0.021
5.153AsnSer: 5.153 ± 0.032
3.771AsnThr: 3.771 ± 0.024
3.821AsnVal: 3.821 ± 0.024
0.627AsnTrp: 0.627 ± 0.008
2.492AsnTyr: 2.492 ± 0.018
0.0AsnXaa: 0.0 ± 0.0
Pro
2.253ProAla: 2.253 ± 0.025
0.788ProCys: 0.788 ± 0.04
2.098ProAsp: 2.098 ± 0.017
2.526ProGlu: 2.526 ± 0.028
1.695ProPhe: 1.695 ± 0.018
2.058ProGly: 2.058 ± 0.047
0.933ProHis: 0.933 ± 0.012
2.974ProIle: 2.974 ± 0.023
2.961ProLys: 2.961 ± 0.025
3.741ProLeu: 3.741 ± 0.024
0.906ProMet: 0.906 ± 0.01
2.605ProAsn: 2.605 ± 0.021
3.25ProPro: 3.25 ± 0.052
1.715ProGln: 1.715 ± 0.026
1.819ProArg: 1.819 ± 0.018
3.996ProSer: 3.996 ± 0.041
2.859ProThr: 2.859 ± 0.029
2.87ProVal: 2.87 ± 0.027
0.434ProTrp: 0.434 ± 0.009
1.512ProTyr: 1.512 ± 0.023
0.0ProXaa: 0.0 ± 0.0
Gln
1.624GlnAla: 1.624 ± 0.018
0.837GlnCys: 0.837 ± 0.018
1.575GlnAsp: 1.575 ± 0.017
1.985GlnGlu: 1.985 ± 0.019
1.632GlnPhe: 1.632 ± 0.015
1.367GlnGly: 1.367 ± 0.016
1.146GlnHis: 1.146 ± 0.015
2.683GlnIle: 2.683 ± 0.023
2.768GlnLys: 2.768 ± 0.023
3.855GlnLeu: 3.855 ± 0.025
0.964GlnMet: 0.964 ± 0.011
2.694GlnAsn: 2.694 ± 0.023
1.83GlnPro: 1.83 ± 0.026
3.123GlnGln: 3.123 ± 0.074
1.859GlnArg: 1.859 ± 0.016
2.936GlnSer: 2.936 ± 0.023
2.295GlnThr: 2.295 ± 0.02
1.96GlnVal: 1.96 ± 0.02
0.426GlnTrp: 0.426 ± 0.009
1.426GlnTyr: 1.426 ± 0.014
0.0GlnXaa: 0.0 ± 0.0
Arg
2.139ArgAla: 2.139 ± 0.022
1.046ArgCys: 1.046 ± 0.019
2.095ArgAsp: 2.095 ± 0.022
2.342ArgGlu: 2.342 ± 0.019
2.078ArgPhe: 2.078 ± 0.015
2.122ArgGly: 2.122 ± 0.02
1.346ArgHis: 1.346 ± 0.014
2.928ArgIle: 2.928 ± 0.02
3.561ArgLys: 3.561 ± 0.026
4.381ArgLeu: 4.381 ± 0.029
1.031ArgMet: 1.031 ± 0.013
2.694ArgAsn: 2.694 ± 0.02
2.212ArgPro: 2.212 ± 0.028
1.924ArgGln: 1.924 ± 0.017
3.235ArgArg: 3.235 ± 0.034
3.663ArgSer: 3.663 ± 0.026
2.564ArgThr: 2.564 ± 0.022
2.544ArgVal: 2.544 ± 0.019
0.595ArgTrp: 0.595 ± 0.009
1.674ArgTyr: 1.674 ± 0.016
0.0ArgXaa: 0.0 ± 0.0
Ser
3.921SerAla: 3.921 ± 0.024
1.625SerCys: 1.625 ± 0.035
4.408SerAsp: 4.408 ± 0.029
4.416SerGlu: 4.416 ± 0.026
3.482SerPhe: 3.482 ± 0.025
4.005SerGly: 4.005 ± 0.035
1.752SerHis: 1.752 ± 0.017
5.827SerIle: 5.827 ± 0.033
5.746SerLys: 5.746 ± 0.036
7.243SerLeu: 7.243 ± 0.037
1.803SerMet: 1.803 ± 0.016
5.27SerAsn: 5.27 ± 0.037
3.829SerPro: 3.829 ± 0.043
2.74SerGln: 2.74 ± 0.022
3.501SerArg: 3.501 ± 0.025
8.794SerSer: 8.794 ± 0.067
5.428SerThr: 5.428 ± 0.04
4.895SerVal: 4.895 ± 0.032
0.789SerTrp: 0.789 ± 0.009
2.812SerTyr: 2.812 ± 0.023
0.0SerXaa: 0.0 ± 0.0
Thr
3.34ThrAla: 3.34 ± 0.028
1.236ThrCys: 1.236 ± 0.021
3.104ThrAsp: 3.104 ± 0.025
3.265ThrGlu: 3.265 ± 0.031
2.685ThrPhe: 2.685 ± 0.02
2.787ThrGly: 2.787 ± 0.022
1.29ThrHis: 1.29 ± 0.014
4.421ThrIle: 4.421 ± 0.026
4.037ThrLys: 4.037 ± 0.027
5.313ThrLeu: 5.313 ± 0.027
1.341ThrMet: 1.341 ± 0.014
3.705ThrAsn: 3.705 ± 0.022
2.894ThrPro: 2.894 ± 0.025
1.996ThrGln: 1.996 ± 0.017
2.417ThrArg: 2.417 ± 0.017
5.196ThrSer: 5.196 ± 0.036
4.39ThrThr: 4.39 ± 0.044
3.929ThrVal: 3.929 ± 0.028
0.594ThrTrp: 0.594 ± 0.009
2.052ThrTyr: 2.052 ± 0.02
0.0ThrXaa: 0.0 ± 0.0
Val
3.281ValAla: 3.281 ± 0.026
1.4ValCys: 1.4 ± 0.022
3.434ValAsp: 3.434 ± 0.024
3.451ValGlu: 3.451 ± 0.029
2.718ValPhe: 2.718 ± 0.021
2.68ValGly: 2.68 ± 0.024
1.482ValHis: 1.482 ± 0.016
3.951ValIle: 3.951 ± 0.021
4.016ValLys: 4.016 ± 0.026
5.596ValLeu: 5.596 ± 0.029
1.348ValMet: 1.348 ± 0.012
3.306ValAsn: 3.306 ± 0.027
2.775ValPro: 2.775 ± 0.027
2.315ValGln: 2.315 ± 0.019
2.615ValArg: 2.615 ± 0.019
4.536ValSer: 4.536 ± 0.027
3.554ValThr: 3.554 ± 0.022
4.026ValVal: 4.026 ± 0.029
0.655ValTrp: 0.655 ± 0.01
2.156ValTyr: 2.156 ± 0.018
0.0ValXaa: 0.0 ± 0.0
Trp
0.487TrpAla: 0.487 ± 0.008
0.267TrpCys: 0.267 ± 0.006
0.52TrpAsp: 0.52 ± 0.009
0.506TrpGlu: 0.506 ± 0.008
0.476TrpPhe: 0.476 ± 0.009
0.446TrpGly: 0.446 ± 0.009
0.235TrpHis: 0.235 ± 0.005
0.699TrpIle: 0.699 ± 0.009
0.834TrpLys: 0.834 ± 0.012
1.066TrpLeu: 1.066 ± 0.011
0.273TrpMet: 0.273 ± 0.006
0.672TrpAsn: 0.672 ± 0.009
0.425TrpPro: 0.425 ± 0.007
0.375TrpGln: 0.375 ± 0.006
0.549TrpArg: 0.549 ± 0.009
0.871TrpSer: 0.871 ± 0.012
0.651TrpThr: 0.651 ± 0.009
0.559TrpVal: 0.559 ± 0.009
0.158TrpTrp: 0.158 ± 0.006
0.4TrpTyr: 0.4 ± 0.007
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.561TyrAla: 1.561 ± 0.015
0.906TyrCys: 0.906 ± 0.01
1.957TyrAsp: 1.957 ± 0.019
1.959TyrGlu: 1.959 ± 0.017
1.871TyrPhe: 1.871 ± 0.016
1.835TyrGly: 1.835 ± 0.02
0.975TyrHis: 0.975 ± 0.013
2.559TyrIle: 2.559 ± 0.021
2.71TyrLys: 2.71 ± 0.022
3.547TyrLeu: 3.547 ± 0.025
0.807TyrMet: 0.807 ± 0.012
2.453TyrAsn: 2.453 ± 0.02
1.46TyrPro: 1.46 ± 0.023
1.365TyrGln: 1.365 ± 0.014
1.734TyrArg: 1.734 ± 0.016
2.897TyrSer: 2.897 ± 0.026
2.147TyrThr: 2.147 ± 0.016
2.036TyrVal: 2.036 ± 0.016
0.413TyrTrp: 0.413 ± 0.008
1.68TyrTyr: 1.68 ± 0.018
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.002XaaXaa: 0.002 ± 0.001
Statistics based on 18358 proteins (8501140 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski