Amino acid dipepetide frequency for Acrocarpospora corrugata

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
19.465AlaAla: 19.465 ± 0.122
1.066AlaCys: 1.066 ± 0.023
7.988AlaAsp: 7.988 ± 0.052
8.465AlaGlu: 8.465 ± 0.071
3.623AlaPhe: 3.623 ± 0.041
12.524AlaGly: 12.524 ± 0.086
2.563AlaHis: 2.563 ± 0.032
4.909AlaIle: 4.909 ± 0.05
2.858AlaLys: 2.858 ± 0.041
13.914AlaLeu: 13.914 ± 0.092
2.579AlaMet: 2.579 ± 0.031
2.35AlaAsn: 2.35 ± 0.039
5.895AlaPro: 5.895 ± 0.044
3.633AlaGln: 3.633 ± 0.04
9.687AlaArg: 9.687 ± 0.072
5.693AlaSer: 5.693 ± 0.056
7.296AlaThr: 7.296 ± 0.05
11.135AlaVal: 11.135 ± 0.066
1.921AlaTrp: 1.921 ± 0.032
2.737AlaTyr: 2.737 ± 0.038
0.0AlaXaa: 0.0 ± 0.0
Cys
1.038CysAla: 1.038 ± 0.019
0.104CysCys: 0.104 ± 0.006
0.492CysAsp: 0.492 ± 0.013
0.392CysGlu: 0.392 ± 0.013
0.223CysPhe: 0.223 ± 0.008
0.93CysGly: 0.93 ± 0.02
0.225CysHis: 0.225 ± 0.009
0.138CysIle: 0.138 ± 0.008
0.101CysLys: 0.101 ± 0.006
0.737CysLeu: 0.737 ± 0.019
0.118CysMet: 0.118 ± 0.007
0.126CysAsn: 0.126 ± 0.008
0.506CysPro: 0.506 ± 0.013
0.206CysGln: 0.206 ± 0.01
0.583CysArg: 0.583 ± 0.016
0.406CysSer: 0.406 ± 0.011
0.442CysThr: 0.442 ± 0.013
0.658CysVal: 0.658 ± 0.016
0.145CysTrp: 0.145 ± 0.009
0.167CysTyr: 0.167 ± 0.008
0.0CysXaa: 0.0 ± 0.0
Asp
6.751AspAla: 6.751 ± 0.056
0.409AspCys: 0.409 ± 0.013
3.415AspAsp: 3.415 ± 0.044
3.567AspGlu: 3.567 ± 0.04
1.679AspPhe: 1.679 ± 0.029
5.898AspGly: 5.898 ± 0.06
1.412AspHis: 1.412 ± 0.024
2.012AspIle: 2.012 ± 0.03
1.027AspLys: 1.027 ± 0.023
6.879AspLeu: 6.879 ± 0.058
0.833AspMet: 0.833 ± 0.014
1.032AspAsn: 1.032 ± 0.024
4.582AspPro: 4.582 ± 0.046
1.885AspGln: 1.885 ± 0.03
4.837AspArg: 4.837 ± 0.048
2.458AspSer: 2.458 ± 0.033
2.707AspThr: 2.707 ± 0.033
4.671AspVal: 4.671 ± 0.045
1.017AspTrp: 1.017 ± 0.021
1.25AspTyr: 1.25 ± 0.024
0.0AspXaa: 0.0 ± 0.0
Glu
6.475GluAla: 6.475 ± 0.054
0.346GluCys: 0.346 ± 0.011
2.551GluAsp: 2.551 ± 0.03
3.037GluGlu: 3.037 ± 0.039
1.679GluPhe: 1.679 ± 0.027
3.64GluGly: 3.64 ± 0.04
1.455GluHis: 1.455 ± 0.023
2.727GluIle: 2.727 ± 0.031
1.175GluLys: 1.175 ± 0.025
6.895GluLeu: 6.895 ± 0.06
0.949GluMet: 0.949 ± 0.019
1.133GluAsn: 1.133 ± 0.022
3.274GluPro: 3.274 ± 0.042
2.036GluGln: 2.036 ± 0.031
5.02GluArg: 5.02 ± 0.056
2.634GluSer: 2.634 ± 0.034
2.815GluThr: 2.815 ± 0.039
4.441GluVal: 4.441 ± 0.045
0.803GluTrp: 0.803 ± 0.017
1.093GluTyr: 1.093 ± 0.021
0.0GluXaa: 0.0 ± 0.0
Phe
3.955PheAla: 3.955 ± 0.041
0.282PheCys: 0.282 ± 0.011
2.155PheAsp: 2.155 ± 0.028
1.466PheGlu: 1.466 ± 0.026
0.94PhePhe: 0.94 ± 0.021
3.206PheGly: 3.206 ± 0.039
0.637PheHis: 0.637 ± 0.017
0.914PheIle: 0.914 ± 0.024
0.542PheLys: 0.542 ± 0.015
2.746PheLeu: 2.746 ± 0.04
0.447PheMet: 0.447 ± 0.012
0.692PheAsn: 0.692 ± 0.02
1.499PhePro: 1.499 ± 0.025
0.789PheGln: 0.789 ± 0.018
1.974PheArg: 1.974 ± 0.025
1.515PheSer: 1.515 ± 0.027
2.195PheThr: 2.195 ± 0.031
2.339PheVal: 2.339 ± 0.032
0.487PheTrp: 0.487 ± 0.014
0.632PheTyr: 0.632 ± 0.016
0.0PheXaa: 0.0 ± 0.0
Gly
9.714GlyAla: 9.714 ± 0.068
0.781GlyCys: 0.781 ± 0.019
5.246GlyAsp: 5.246 ± 0.053
5.029GlyGlu: 5.029 ± 0.046
3.026GlyPhe: 3.026 ± 0.037
8.323GlyGly: 8.323 ± 0.074
2.193GlyHis: 2.193 ± 0.032
3.543GlyIle: 3.543 ± 0.046
2.343GlyLys: 2.343 ± 0.034
9.737GlyLeu: 9.737 ± 0.076
1.979GlyMet: 1.979 ± 0.029
1.992GlyAsn: 1.992 ± 0.042
4.862GlyPro: 4.862 ± 0.049
2.896GlyGln: 2.896 ± 0.036
7.321GlyArg: 7.321 ± 0.063
5.163GlySer: 5.163 ± 0.051
5.708GlyThr: 5.708 ± 0.063
7.799GlyVal: 7.799 ± 0.065
1.759GlyTrp: 1.759 ± 0.029
2.436GlyTyr: 2.436 ± 0.036
0.0GlyXaa: 0.0 ± 0.0
His
2.568HisAla: 2.568 ± 0.033
0.201HisCys: 0.201 ± 0.009
1.317HisAsp: 1.317 ± 0.023
1.156HisGlu: 1.156 ± 0.022
0.606HisPhe: 0.606 ± 0.016
2.13HisGly: 2.13 ± 0.032
0.662HisHis: 0.662 ± 0.019
0.724HisIle: 0.724 ± 0.017
0.298HisLys: 0.298 ± 0.01
2.449HisLeu: 2.449 ± 0.027
0.315HisMet: 0.315 ± 0.01
0.401HisAsn: 0.401 ± 0.012
1.733HisPro: 1.733 ± 0.026
0.66HisGln: 0.66 ± 0.016
1.921HisArg: 1.921 ± 0.028
0.929HisSer: 0.929 ± 0.018
1.154HisThr: 1.154 ± 0.022
1.676HisVal: 1.676 ± 0.028
0.347HisTrp: 0.347 ± 0.011
0.475HisTyr: 0.475 ± 0.013
0.0HisXaa: 0.0 ± 0.0
Ile
5.562IleAla: 5.562 ± 0.045
0.336IleCys: 0.336 ± 0.011
2.617IleAsp: 2.617 ± 0.035
2.399IleGlu: 2.399 ± 0.031
1.026IlePhe: 1.026 ± 0.023
3.974IleGly: 3.974 ± 0.047
0.724IleHis: 0.724 ± 0.017
1.334IleIle: 1.334 ± 0.027
0.901IleLys: 0.901 ± 0.021
3.198IleLeu: 3.198 ± 0.043
0.684IleMet: 0.684 ± 0.017
0.884IleAsn: 0.884 ± 0.021
2.314IlePro: 2.314 ± 0.029
0.965IleGln: 0.965 ± 0.023
2.928IleArg: 2.928 ± 0.034
2.209IleSer: 2.209 ± 0.029
2.759IleThr: 2.759 ± 0.037
3.399IleVal: 3.399 ± 0.035
0.562IleTrp: 0.562 ± 0.016
0.744IleTyr: 0.744 ± 0.017
0.0IleXaa: 0.0 ± 0.0
Lys
2.593LysAla: 2.593 ± 0.035
0.133LysCys: 0.133 ± 0.008
1.078LysAsp: 1.078 ± 0.021
1.027LysGlu: 1.027 ± 0.019
0.494LysPhe: 0.494 ± 0.013
1.583LysGly: 1.583 ± 0.028
0.416LysHis: 0.416 ± 0.012
1.109LysIle: 1.109 ± 0.022
0.626LysLys: 0.626 ± 0.02
2.052LysLeu: 2.052 ± 0.03
0.386LysMet: 0.386 ± 0.011
0.513LysAsn: 0.513 ± 0.017
1.41LysPro: 1.41 ± 0.026
0.613LysGln: 0.613 ± 0.015
1.472LysArg: 1.472 ± 0.025
1.158LysSer: 1.158 ± 0.024
1.28LysThr: 1.28 ± 0.025
1.884LysVal: 1.884 ± 0.031
0.298LysTrp: 0.298 ± 0.011
0.454LysTyr: 0.454 ± 0.014
0.0LysXaa: 0.0 ± 0.0
Leu
15.92LeuAla: 15.92 ± 0.098
0.782LeuCys: 0.782 ± 0.02
6.655LeuAsp: 6.655 ± 0.062
4.501LeuGlu: 4.501 ± 0.047
2.775LeuPhe: 2.775 ± 0.038
9.378LeuGly: 9.378 ± 0.074
2.054LeuHis: 2.054 ± 0.026
4.317LeuIle: 4.317 ± 0.053
1.926LeuLys: 1.926 ± 0.031
11.191LeuLeu: 11.191 ± 0.099
1.697LeuMet: 1.697 ± 0.023
2.062LeuAsn: 2.062 ± 0.027
6.458LeuPro: 6.458 ± 0.053
2.147LeuGln: 2.147 ± 0.028
8.825LeuArg: 8.825 ± 0.073
5.691LeuSer: 5.691 ± 0.043
7.301LeuThr: 7.301 ± 0.059
8.844LeuVal: 8.844 ± 0.063
1.373LeuTrp: 1.373 ± 0.024
1.882LeuTyr: 1.882 ± 0.027
0.0LeuXaa: 0.0 ± 0.0
Met
2.399MetAla: 2.399 ± 0.031
0.119MetCys: 0.119 ± 0.007
0.895MetAsp: 0.895 ± 0.02
0.74MetGlu: 0.74 ± 0.017
0.548MetPhe: 0.548 ± 0.015
1.351MetGly: 1.351 ± 0.023
0.362MetHis: 0.362 ± 0.012
0.926MetIle: 0.926 ± 0.02
0.437MetLys: 0.437 ± 0.012
1.897MetLeu: 1.897 ± 0.027
0.313MetMet: 0.313 ± 0.011
0.494MetAsn: 0.494 ± 0.014
1.125MetPro: 1.125 ± 0.02
0.395MetGln: 0.395 ± 0.013
1.539MetArg: 1.539 ± 0.023
1.296MetSer: 1.296 ± 0.021
1.531MetThr: 1.531 ± 0.026
1.401MetVal: 1.401 ± 0.024
0.236MetTrp: 0.236 ± 0.009
0.315MetTyr: 0.315 ± 0.01
0.0MetXaa: 0.0 ± 0.0
Asn
2.403AsnAla: 2.403 ± 0.039
0.172AsnCys: 0.172 ± 0.007
1.055AsnAsp: 1.055 ± 0.022
0.908AsnGlu: 0.908 ± 0.021
0.558AsnPhe: 0.558 ± 0.015
2.162AsnGly: 2.162 ± 0.045
0.428AsnHis: 0.428 ± 0.012
0.764AsnIle: 0.764 ± 0.018
0.399AsnLys: 0.399 ± 0.013
2.098AsnLeu: 2.098 ± 0.029
0.336AsnMet: 0.336 ± 0.011
0.568AsnAsn: 0.568 ± 0.024
1.685AsnPro: 1.685 ± 0.028
0.681AsnGln: 0.681 ± 0.019
1.526AsnArg: 1.526 ± 0.028
1.051AsnSer: 1.051 ± 0.022
1.213AsnThr: 1.213 ± 0.028
1.639AsnVal: 1.639 ± 0.028
0.375AsnTrp: 0.375 ± 0.014
0.477AsnTyr: 0.477 ± 0.015
0.0AsnXaa: 0.0 ± 0.0
Pro
8.229ProAla: 8.229 ± 0.068
0.352ProCys: 0.352 ± 0.012
4.37ProAsp: 4.37 ± 0.045
4.045ProGlu: 4.045 ± 0.04
1.685ProPhe: 1.685 ± 0.028
6.671ProGly: 6.671 ± 0.061
1.18ProHis: 1.18 ± 0.023
2.112ProIle: 2.112 ± 0.027
1.244ProLys: 1.244 ± 0.021
5.133ProLeu: 5.133 ± 0.047
1.155ProMet: 1.155 ± 0.023
1.141ProAsn: 1.141 ± 0.026
3.876ProPro: 3.876 ± 0.065
1.591ProGln: 1.591 ± 0.032
3.7ProArg: 3.7 ± 0.036
3.395ProSer: 3.395 ± 0.05
3.233ProThr: 3.233 ± 0.046
5.105ProVal: 5.105 ± 0.046
0.94ProTrp: 0.94 ± 0.016
1.475ProTyr: 1.475 ± 0.025
0.0ProXaa: 0.0 ± 0.0
Gln
3.966GlnAla: 3.966 ± 0.037
0.164GlnCys: 0.164 ± 0.008
1.335GlnAsp: 1.335 ± 0.023
1.388GlnGlu: 1.388 ± 0.022
0.787GlnPhe: 0.787 ± 0.017
2.108GlnGly: 2.108 ± 0.029
0.625GlnHis: 0.625 ± 0.018
1.373GlnIle: 1.373 ± 0.023
0.54GlnLys: 0.54 ± 0.016
3.098GlnLeu: 3.098 ± 0.04
0.488GlnMet: 0.488 ± 0.014
0.591GlnAsn: 0.591 ± 0.018
1.813GlnPro: 1.813 ± 0.031
1.061GlnGln: 1.061 ± 0.027
2.375GlnArg: 2.375 ± 0.033
1.348GlnSer: 1.348 ± 0.026
1.512GlnThr: 1.512 ± 0.025
2.706GlnVal: 2.706 ± 0.035
0.512GlnTrp: 0.512 ± 0.015
0.581GlnTyr: 0.581 ± 0.017
0.0GlnXaa: 0.0 ± 0.0
Arg
9.31ArgAla: 9.31 ± 0.079
0.592ArgCys: 0.592 ± 0.015
4.405ArgAsp: 4.405 ± 0.044
4.593ArgGlu: 4.593 ± 0.05
2.549ArgPhe: 2.549 ± 0.031
5.549ArgGly: 5.549 ± 0.046
2.033ArgHis: 2.033 ± 0.029
3.374ArgIle: 3.374 ± 0.036
1.622ArgLys: 1.622 ± 0.027
9.185ArgLeu: 9.185 ± 0.084
1.805ArgMet: 1.805 ± 0.029
1.516ArgAsn: 1.516 ± 0.025
4.659ArgPro: 4.659 ± 0.054
2.496ArgGln: 2.496 ± 0.033
7.468ArgArg: 7.468 ± 0.071
3.895ArgSer: 3.895 ± 0.037
4.585ArgThr: 4.585 ± 0.046
6.076ArgVal: 6.076 ± 0.058
1.436ArgTrp: 1.436 ± 0.026
1.877ArgTyr: 1.877 ± 0.03
0.0ArgXaa: 0.0 ± 0.0
Ser
6.619SerAla: 6.619 ± 0.055
0.421SerCys: 0.421 ± 0.013
2.6SerAsp: 2.6 ± 0.033
2.271SerGlu: 2.271 ± 0.028
1.643SerPhe: 1.643 ± 0.024
5.954SerGly: 5.954 ± 0.054
1.037SerHis: 1.037 ± 0.021
1.945SerIle: 1.945 ± 0.028
0.914SerLys: 0.914 ± 0.02
4.799SerLeu: 4.799 ± 0.038
1.163SerMet: 1.163 ± 0.023
0.985SerAsn: 0.985 ± 0.023
3.58SerPro: 3.58 ± 0.055
1.284SerGln: 1.284 ± 0.024
3.792SerArg: 3.792 ± 0.04
2.768SerSer: 2.768 ± 0.041
3.072SerThr: 3.072 ± 0.039
4.177SerVal: 4.177 ± 0.043
1.025SerTrp: 1.025 ± 0.019
1.244SerTyr: 1.244 ± 0.026
0.0SerXaa: 0.0 ± 0.0
Thr
7.977ThrAla: 7.977 ± 0.06
0.477ThrCys: 0.477 ± 0.012
3.144ThrAsp: 3.144 ± 0.039
2.987ThrGlu: 2.987 ± 0.032
1.849ThrPhe: 1.849 ± 0.026
6.504ThrGly: 6.504 ± 0.059
1.15ThrHis: 1.15 ± 0.023
2.48ThrIle: 2.48 ± 0.033
1.206ThrLys: 1.206 ± 0.022
6.072ThrLeu: 6.072 ± 0.051
1.114ThrMet: 1.114 ± 0.023
1.177ThrAsn: 1.177 ± 0.029
4.217ThrPro: 4.217 ± 0.053
1.527ThrGln: 1.527 ± 0.026
4.082ThrArg: 4.082 ± 0.035
3.158ThrSer: 3.158 ± 0.041
3.824ThrThr: 3.824 ± 0.052
5.526ThrVal: 5.526 ± 0.055
1.026ThrTrp: 1.026 ± 0.022
1.309ThrTyr: 1.309 ± 0.027
0.0ThrXaa: 0.0 ± 0.0
Val
10.963ValAla: 10.963 ± 0.078
0.651ValCys: 0.651 ± 0.016
4.74ValAsp: 4.74 ± 0.043
4.399ValGlu: 4.399 ± 0.04
2.495ValPhe: 2.495 ± 0.03
6.574ValGly: 6.574 ± 0.06
1.72ValHis: 1.72 ± 0.025
3.599ValIle: 3.599 ± 0.04
1.721ValLys: 1.721 ± 0.026
9.268ValLeu: 9.268 ± 0.075
1.384ValMet: 1.384 ± 0.024
1.882ValAsn: 1.882 ± 0.029
5.018ValPro: 5.018 ± 0.046
2.09ValGln: 2.09 ± 0.029
6.606ValArg: 6.606 ± 0.055
4.485ValSer: 4.485 ± 0.048
5.871ValThr: 5.871 ± 0.048
7.656ValVal: 7.656 ± 0.068
1.122ValTrp: 1.122 ± 0.025
1.633ValTyr: 1.633 ± 0.026
0.0ValXaa: 0.0 ± 0.0
Trp
1.748TrpAla: 1.748 ± 0.029
0.159TrpCys: 0.159 ± 0.008
0.862TrpAsp: 0.862 ± 0.022
0.718TrpGlu: 0.718 ± 0.018
0.551TrpPhe: 0.551 ± 0.015
1.094TrpGly: 1.094 ± 0.022
0.404TrpHis: 0.404 ± 0.012
0.644TrpIle: 0.644 ± 0.015
0.338TrpLys: 0.338 ± 0.012
1.955TrpLeu: 1.955 ± 0.03
0.308TrpMet: 0.308 ± 0.008
0.484TrpAsn: 0.484 ± 0.014
0.934TrpPro: 0.934 ± 0.02
0.627TrpGln: 0.627 ± 0.015
1.465TrpArg: 1.465 ± 0.022
1.02TrpSer: 1.02 ± 0.02
1.099TrpThr: 1.099 ± 0.021
1.017TrpVal: 1.017 ± 0.021
0.383TrpTrp: 0.383 ± 0.013
0.347TrpTyr: 0.347 ± 0.011
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.646TyrAla: 2.646 ± 0.036
0.191TyrCys: 0.191 ± 0.008
1.402TyrAsp: 1.402 ± 0.029
1.089TyrGlu: 1.089 ± 0.019
0.678TyrPhe: 0.678 ± 0.017
2.297TyrGly: 2.297 ± 0.034
0.442TyrHis: 0.442 ± 0.013
0.577TyrIle: 0.577 ± 0.016
0.36TyrLys: 0.36 ± 0.012
2.476TyrLeu: 2.476 ± 0.03
0.266TyrMet: 0.266 ± 0.011
0.473TyrAsn: 0.473 ± 0.017
1.191TyrPro: 1.191 ± 0.023
0.79TyrGln: 0.79 ± 0.017
1.902TyrArg: 1.902 ± 0.027
1.02TyrSer: 1.02 ± 0.02
1.208TyrThr: 1.208 ± 0.023
1.728TyrVal: 1.728 ± 0.026
0.392TyrTrp: 0.392 ± 0.012
0.499TyrTyr: 0.499 ± 0.016
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8466 proteins (2731098 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski