Amino acid dipepetide frequency for Motilimonas pumilua

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.708AlaAla: 8.708 ± 0.106
1.188AlaCys: 1.188 ± 0.032
4.91AlaAsp: 4.91 ± 0.057
5.698AlaGlu: 5.698 ± 0.087
3.675AlaPhe: 3.675 ± 0.058
6.262AlaGly: 6.262 ± 0.082
1.788AlaHis: 1.788 ± 0.043
6.017AlaIle: 6.017 ± 0.079
5.49AlaLys: 5.49 ± 0.08
10.274AlaLeu: 10.274 ± 0.098
2.804AlaMet: 2.804 ± 0.049
3.846AlaAsn: 3.846 ± 0.055
3.301AlaPro: 3.301 ± 0.059
4.979AlaGln: 4.979 ± 0.066
3.507AlaArg: 3.507 ± 0.056
5.827AlaSer: 5.827 ± 0.075
4.559AlaThr: 4.559 ± 0.066
5.934AlaVal: 5.934 ± 0.078
1.16AlaTrp: 1.16 ± 0.032
2.578AlaTyr: 2.578 ± 0.051
0.0AlaXaa: 0.0 ± 0.0
Cys
0.836CysAla: 0.836 ± 0.024
0.188CysCys: 0.188 ± 0.013
0.638CysAsp: 0.638 ± 0.026
0.607CysGlu: 0.607 ± 0.024
0.564CysPhe: 0.564 ± 0.024
0.856CysGly: 0.856 ± 0.023
0.431CysHis: 0.431 ± 0.02
0.643CysIle: 0.643 ± 0.024
0.428CysLys: 0.428 ± 0.019
1.224CysLeu: 1.224 ± 0.033
0.21CysMet: 0.21 ± 0.015
0.343CysAsn: 0.343 ± 0.018
0.467CysPro: 0.467 ± 0.019
0.743CysGln: 0.743 ± 0.028
0.552CysArg: 0.552 ± 0.022
0.745CysSer: 0.745 ± 0.025
0.426CysThr: 0.426 ± 0.017
0.635CysVal: 0.635 ± 0.023
0.162CysTrp: 0.162 ± 0.012
0.375CysTyr: 0.375 ± 0.017
0.0CysXaa: 0.0 ± 0.0
Asp
4.496AspAla: 4.496 ± 0.063
0.561AspCys: 0.561 ± 0.019
3.003AspAsp: 3.003 ± 0.061
3.585AspGlu: 3.585 ± 0.061
2.61AspPhe: 2.61 ± 0.05
3.559AspGly: 3.559 ± 0.075
1.043AspHis: 1.043 ± 0.031
4.143AspIle: 4.143 ± 0.061
3.269AspLys: 3.269 ± 0.052
5.369AspLeu: 5.369 ± 0.073
1.331AspMet: 1.331 ± 0.032
2.505AspAsn: 2.505 ± 0.05
2.131AspPro: 2.131 ± 0.04
2.084AspGln: 2.084 ± 0.041
1.938AspArg: 1.938 ± 0.043
3.154AspSer: 3.154 ± 0.06
2.783AspThr: 2.783 ± 0.054
3.658AspVal: 3.658 ± 0.062
0.856AspTrp: 0.856 ± 0.025
2.124AspTyr: 2.124 ± 0.045
0.0AspXaa: 0.0 ± 0.0
Glu
5.008GluAla: 5.008 ± 0.084
0.462GluCys: 0.462 ± 0.018
2.614GluAsp: 2.614 ± 0.054
3.013GluGlu: 3.013 ± 0.068
2.392GluPhe: 2.392 ± 0.049
3.227GluGly: 3.227 ± 0.052
1.596GluHis: 1.596 ± 0.038
3.517GluIle: 3.517 ± 0.051
3.118GluLys: 3.118 ± 0.058
6.884GluLeu: 6.884 ± 0.081
1.561GluMet: 1.561 ± 0.036
2.159GluAsn: 2.159 ± 0.04
2.222GluPro: 2.222 ± 0.054
4.785GluGln: 4.785 ± 0.071
2.884GluArg: 2.884 ± 0.045
3.312GluSer: 3.312 ± 0.049
2.804GluThr: 2.804 ± 0.052
4.353GluVal: 4.353 ± 0.062
0.65GluTrp: 0.65 ± 0.024
1.648GluTyr: 1.648 ± 0.036
0.0GluXaa: 0.0 ± 0.0
Phe
3.857PheAla: 3.857 ± 0.06
0.571PheCys: 0.571 ± 0.021
2.742PheAsp: 2.742 ± 0.047
2.59PheGlu: 2.59 ± 0.041
1.74PhePhe: 1.74 ± 0.043
2.763PheGly: 2.763 ± 0.052
0.839PheHis: 0.839 ± 0.025
2.741PheIle: 2.741 ± 0.043
2.078PheLys: 2.078 ± 0.042
3.332PheLeu: 3.332 ± 0.057
0.964PheMet: 0.964 ± 0.026
2.128PheAsn: 2.128 ± 0.043
1.368PhePro: 1.368 ± 0.035
1.45PheGln: 1.45 ± 0.035
1.448PheArg: 1.448 ± 0.031
3.463PheSer: 3.463 ± 0.053
2.287PheThr: 2.287 ± 0.043
2.707PheVal: 2.707 ± 0.051
0.592PheTrp: 0.592 ± 0.024
1.425PheTyr: 1.425 ± 0.033
0.0PheXaa: 0.0 ± 0.0
Gly
5.198GlyAla: 5.198 ± 0.079
0.889GlyCys: 0.889 ± 0.031
3.584GlyAsp: 3.584 ± 0.059
4.26GlyGlu: 4.26 ± 0.064
3.18GlyPhe: 3.18 ± 0.046
4.432GlyGly: 4.432 ± 0.067
1.627GlyHis: 1.627 ± 0.037
4.397GlyIle: 4.397 ± 0.066
3.843GlyLys: 3.843 ± 0.063
7.011GlyLeu: 7.011 ± 0.077
1.782GlyMet: 1.782 ± 0.039
2.518GlyAsn: 2.518 ± 0.05
1.809GlyPro: 1.809 ± 0.038
3.568GlyGln: 3.568 ± 0.064
2.835GlyArg: 2.835 ± 0.051
3.955GlySer: 3.955 ± 0.061
2.947GlyThr: 2.947 ± 0.054
4.876GlyVal: 4.876 ± 0.073
0.913GlyTrp: 0.913 ± 0.027
2.413GlyTyr: 2.413 ± 0.04
0.0GlyXaa: 0.0 ± 0.0
His
1.727HisAla: 1.727 ± 0.035
0.383HisCys: 0.383 ± 0.017
1.178HisAsp: 1.178 ± 0.04
1.085HisGlu: 1.085 ± 0.03
1.247HisPhe: 1.247 ± 0.035
1.672HisGly: 1.672 ± 0.041
0.842HisHis: 0.842 ± 0.031
1.424HisIle: 1.424 ± 0.033
1.082HisLys: 1.082 ± 0.028
2.548HisLeu: 2.548 ± 0.049
0.495HisMet: 0.495 ± 0.021
0.911HisAsn: 0.911 ± 0.03
1.201HisPro: 1.201 ± 0.035
1.639HisGln: 1.639 ± 0.045
1.016HisArg: 1.016 ± 0.03
1.497HisSer: 1.497 ± 0.035
1.117HisThr: 1.117 ± 0.033
1.267HisVal: 1.267 ± 0.032
0.425HisTrp: 0.425 ± 0.02
1.015HisTyr: 1.015 ± 0.03
0.0HisXaa: 0.0 ± 0.0
Ile
6.363IleAla: 6.363 ± 0.078
0.726IleCys: 0.726 ± 0.024
3.977IleAsp: 3.977 ± 0.064
4.251IleGlu: 4.251 ± 0.062
2.135IlePhe: 2.135 ± 0.046
4.04IleGly: 4.04 ± 0.065
1.219IleHis: 1.219 ± 0.027
3.448IleIle: 3.448 ± 0.065
3.429IleLys: 3.429 ± 0.058
4.983IleLeu: 4.983 ± 0.067
1.222IleMet: 1.222 ± 0.035
3.035IleAsn: 3.035 ± 0.053
2.279IlePro: 2.279 ± 0.043
2.171IleGln: 2.171 ± 0.034
2.501IleArg: 2.501 ± 0.039
4.371IleSer: 4.371 ± 0.063
3.49IleThr: 3.49 ± 0.055
3.682IleVal: 3.682 ± 0.06
0.628IleTrp: 0.628 ± 0.022
1.723IleTyr: 1.723 ± 0.038
0.0IleXaa: 0.0 ± 0.0
Lys
4.998LysAla: 4.998 ± 0.076
0.347LysCys: 0.347 ± 0.015
2.667LysAsp: 2.667 ± 0.045
2.788LysGlu: 2.788 ± 0.055
1.527LysPhe: 1.527 ± 0.035
3.308LysGly: 3.308 ± 0.052
1.372LysHis: 1.372 ± 0.032
2.657LysIle: 2.657 ± 0.052
2.795LysLys: 2.795 ± 0.063
5.326LysLeu: 5.326 ± 0.079
1.309LysMet: 1.309 ± 0.033
1.867LysAsn: 1.867 ± 0.04
2.437LysPro: 2.437 ± 0.046
3.736LysGln: 3.736 ± 0.067
2.572LysArg: 2.572 ± 0.05
3.035LysSer: 3.035 ± 0.052
2.75LysThr: 2.75 ± 0.049
4.058LysVal: 4.058 ± 0.059
0.565LysTrp: 0.565 ± 0.021
1.408LysTyr: 1.408 ± 0.031
0.0LysXaa: 0.0 ± 0.0
Leu
11.55LeuAla: 11.55 ± 0.114
1.17LeuCys: 1.17 ± 0.033
5.694LeuAsp: 5.694 ± 0.07
5.628LeuGlu: 5.628 ± 0.079
4.123LeuPhe: 4.123 ± 0.067
6.73LeuGly: 6.73 ± 0.076
2.213LeuHis: 2.213 ± 0.035
5.878LeuIle: 5.878 ± 0.074
5.403LeuLys: 5.403 ± 0.072
11.433LeuLeu: 11.433 ± 0.149
2.584LeuMet: 2.584 ± 0.04
4.569LeuAsn: 4.569 ± 0.066
5.108LeuPro: 5.108 ± 0.078
5.104LeuGln: 5.104 ± 0.081
4.104LeuArg: 4.104 ± 0.064
8.069LeuSer: 8.069 ± 0.082
6.421LeuThr: 6.421 ± 0.09
7.176LeuVal: 7.176 ± 0.088
1.177LeuTrp: 1.177 ± 0.033
2.71LeuTyr: 2.71 ± 0.053
0.0LeuXaa: 0.0 ± 0.0
Met
2.764MetAla: 2.764 ± 0.052
0.213MetCys: 0.213 ± 0.014
1.157MetAsp: 1.157 ± 0.032
1.095MetGlu: 1.095 ± 0.033
0.846MetPhe: 0.846 ± 0.028
1.508MetGly: 1.508 ± 0.038
0.472MetHis: 0.472 ± 0.018
1.328MetIle: 1.328 ± 0.031
1.427MetLys: 1.427 ± 0.034
2.747MetLeu: 2.747 ± 0.053
0.726MetMet: 0.726 ± 0.024
0.966MetAsn: 0.966 ± 0.03
1.247MetPro: 1.247 ± 0.032
1.369MetGln: 1.369 ± 0.035
1.091MetArg: 1.091 ± 0.029
1.736MetSer: 1.736 ± 0.043
1.452MetThr: 1.452 ± 0.032
1.751MetVal: 1.751 ± 0.041
0.211MetTrp: 0.211 ± 0.014
0.501MetTyr: 0.501 ± 0.019
0.0MetXaa: 0.0 ± 0.0
Asn
3.457AsnAla: 3.457 ± 0.058
0.381AsnCys: 0.381 ± 0.017
2.189AsnAsp: 2.189 ± 0.04
2.217AsnGlu: 2.217 ± 0.038
1.536AsnPhe: 1.536 ± 0.036
2.865AsnGly: 2.865 ± 0.053
0.961AsnHis: 0.961 ± 0.028
2.745AsnIle: 2.745 ± 0.046
2.367AsnLys: 2.367 ± 0.043
3.932AsnLeu: 3.932 ± 0.058
1.004AsnMet: 1.004 ± 0.03
1.904AsnAsn: 1.904 ± 0.045
1.974AsnPro: 1.974 ± 0.04
2.409AsnGln: 2.409 ± 0.046
1.723AsnArg: 1.723 ± 0.031
2.408AsnSer: 2.408 ± 0.049
2.235AsnThr: 2.235 ± 0.043
2.414AsnVal: 2.414 ± 0.043
0.62AsnTrp: 0.62 ± 0.02
1.278AsnTyr: 1.278 ± 0.038
0.0AsnXaa: 0.0 ± 0.0
Pro
3.759ProAla: 3.759 ± 0.067
0.37ProCys: 0.37 ± 0.017
2.449ProAsp: 2.449 ± 0.042
3.348ProGlu: 3.348 ± 0.059
1.701ProPhe: 1.701 ± 0.039
2.54ProGly: 2.54 ± 0.04
0.934ProHis: 0.934 ± 0.024
2.236ProIle: 2.236 ± 0.038
1.924ProLys: 1.924 ± 0.048
4.335ProLeu: 4.335 ± 0.063
0.974ProMet: 0.974 ± 0.026
1.649ProAsn: 1.649 ± 0.037
1.259ProPro: 1.259 ± 0.038
1.994ProGln: 1.994 ± 0.048
1.308ProArg: 1.308 ± 0.034
2.708ProSer: 2.708 ± 0.051
2.309ProThr: 2.309 ± 0.079
3.109ProVal: 3.109 ± 0.052
0.606ProTrp: 0.606 ± 0.023
1.247ProTyr: 1.247 ± 0.037
0.0ProXaa: 0.0 ± 0.0
Gln
6.397GlnAla: 6.397 ± 0.091
0.537GlnCys: 0.537 ± 0.021
2.768GlnAsp: 2.768 ± 0.055
2.634GlnGlu: 2.634 ± 0.051
1.966GlnPhe: 1.966 ± 0.042
4.03GlnGly: 4.03 ± 0.067
1.748GlnHis: 1.748 ± 0.044
2.605GlnIle: 2.605 ± 0.049
2.211GlnLys: 2.211 ± 0.05
6.677GlnLeu: 6.677 ± 0.096
1.165GlnMet: 1.165 ± 0.031
1.701GlnAsn: 1.701 ± 0.04
2.249GlnPro: 2.249 ± 0.05
5.787GlnGln: 5.787 ± 0.125
2.699GlnArg: 2.699 ± 0.051
3.192GlnSer: 3.192 ± 0.062
2.657GlnThr: 2.657 ± 0.048
4.266GlnVal: 4.266 ± 0.064
0.909GlnTrp: 0.909 ± 0.028
1.835GlnTyr: 1.835 ± 0.043
0.0GlnXaa: 0.0 ± 0.0
Arg
3.258ArgAla: 3.258 ± 0.05
0.444ArgCys: 0.444 ± 0.019
2.177ArgAsp: 2.177 ± 0.041
2.407ArgGlu: 2.407 ± 0.047
2.134ArgPhe: 2.134 ± 0.038
2.394ArgGly: 2.394 ± 0.048
1.228ArgHis: 1.228 ± 0.034
2.599ArgIle: 2.599 ± 0.051
2.084ArgLys: 2.084 ± 0.044
4.941ArgLeu: 4.941 ± 0.065
1.03ArgMet: 1.03 ± 0.029
1.658ArgAsn: 1.658 ± 0.039
1.584ArgPro: 1.584 ± 0.033
2.656ArgGln: 2.656 ± 0.05
2.148ArgArg: 2.148 ± 0.054
2.456ArgSer: 2.456 ± 0.044
1.836ArgThr: 1.836 ± 0.041
2.837ArgVal: 2.837 ± 0.048
0.674ArgTrp: 0.674 ± 0.026
1.64ArgTyr: 1.64 ± 0.04
0.0ArgXaa: 0.0 ± 0.0
Ser
5.701SerAla: 5.701 ± 0.066
0.724SerCys: 0.724 ± 0.027
3.479SerAsp: 3.479 ± 0.064
3.911SerGlu: 3.911 ± 0.055
2.807SerPhe: 2.807 ± 0.049
4.788SerGly: 4.788 ± 0.071
1.744SerHis: 1.744 ± 0.039
3.746SerIle: 3.746 ± 0.062
3.065SerLys: 3.065 ± 0.047
7.312SerLeu: 7.312 ± 0.086
1.543SerMet: 1.543 ± 0.033
2.545SerAsn: 2.545 ± 0.046
2.585SerPro: 2.585 ± 0.05
4.024SerGln: 4.024 ± 0.06
2.749SerArg: 2.749 ± 0.049
4.34SerSer: 4.34 ± 0.093
2.925SerThr: 2.925 ± 0.049
4.348SerVal: 4.348 ± 0.064
0.932SerTrp: 0.932 ± 0.027
2.015SerTyr: 2.015 ± 0.043
0.0SerXaa: 0.0 ± 0.0
Thr
4.603ThrAla: 4.603 ± 0.062
0.531ThrCys: 0.531 ± 0.021
2.883ThrAsp: 2.883 ± 0.059
2.985ThrGlu: 2.985 ± 0.053
2.039ThrPhe: 2.039 ± 0.039
3.988ThrGly: 3.988 ± 0.069
1.136ThrHis: 1.136 ± 0.031
2.872ThrIle: 2.872 ± 0.05
2.147ThrLys: 2.147 ± 0.042
6.132ThrLeu: 6.132 ± 0.072
1.082ThrMet: 1.082 ± 0.029
1.694ThrAsn: 1.694 ± 0.046
2.844ThrPro: 2.844 ± 0.072
2.89ThrGln: 2.89 ± 0.053
2.091ThrArg: 2.091 ± 0.04
3.244ThrSer: 3.244 ± 0.055
2.627ThrThr: 2.627 ± 0.046
3.612ThrVal: 3.612 ± 0.072
0.656ThrTrp: 0.656 ± 0.025
1.411ThrTyr: 1.411 ± 0.034
0.0ThrXaa: 0.0 ± 0.0
Val
6.77ValAla: 6.77 ± 0.083
0.793ValCys: 0.793 ± 0.026
3.856ValAsp: 3.856 ± 0.058
4.196ValGlu: 4.196 ± 0.062
2.803ValPhe: 2.803 ± 0.049
4.254ValGly: 4.254 ± 0.06
1.2ValHis: 1.2 ± 0.031
4.496ValIle: 4.496 ± 0.064
3.691ValLys: 3.691 ± 0.058
6.779ValLeu: 6.779 ± 0.077
1.925ValMet: 1.925 ± 0.042
3.058ValAsn: 3.058 ± 0.051
2.722ValPro: 2.722 ± 0.049
2.523ValGln: 2.523 ± 0.044
2.598ValArg: 2.598 ± 0.043
4.992ValSer: 4.992 ± 0.072
4.064ValThr: 4.064 ± 0.075
5.031ValVal: 5.031 ± 0.072
0.734ValTrp: 0.734 ± 0.025
1.854ValTyr: 1.854 ± 0.039
0.0ValXaa: 0.0 ± 0.0
Trp
0.853TrpAla: 0.853 ± 0.026
0.163TrpCys: 0.163 ± 0.013
0.518TrpAsp: 0.518 ± 0.023
0.483TrpGlu: 0.483 ± 0.025
0.651TrpPhe: 0.651 ± 0.022
0.735TrpGly: 0.735 ± 0.025
0.482TrpHis: 0.482 ± 0.022
0.535TrpIle: 0.535 ± 0.021
0.4TrpLys: 0.4 ± 0.019
2.075TrpLeu: 2.075 ± 0.052
0.272TrpMet: 0.272 ± 0.013
0.464TrpAsn: 0.464 ± 0.02
0.587TrpPro: 0.587 ± 0.02
1.503TrpGln: 1.503 ± 0.036
0.734TrpArg: 0.734 ± 0.024
0.818TrpSer: 0.818 ± 0.03
0.481TrpThr: 0.481 ± 0.019
0.845TrpVal: 0.845 ± 0.026
0.223TrpTrp: 0.223 ± 0.014
0.372TrpTyr: 0.372 ± 0.019
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.203TyrAla: 2.203 ± 0.047
0.431TyrCys: 0.431 ± 0.019
1.72TyrAsp: 1.72 ± 0.041
1.43TyrGlu: 1.43 ± 0.033
1.411TyrPhe: 1.411 ± 0.041
2.084TyrGly: 2.084 ± 0.045
0.894TyrHis: 0.894 ± 0.03
1.645TyrIle: 1.645 ± 0.036
1.272TyrLys: 1.272 ± 0.033
3.531TyrLeu: 3.531 ± 0.058
0.621TyrMet: 0.621 ± 0.024
1.099TyrAsn: 1.099 ± 0.034
1.386TyrPro: 1.386 ± 0.036
2.617TyrGln: 2.617 ± 0.05
1.689TyrArg: 1.689 ± 0.036
1.96TyrSer: 1.96 ± 0.04
1.297TyrThr: 1.297 ± 0.033
1.778TyrVal: 1.778 ± 0.038
0.503TyrTrp: 0.503 ± 0.023
1.02TyrTyr: 1.02 ± 0.03
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4087 proteins (1321219 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski