Amino acid dipepetide frequency for Bombella sp. KACC 21507

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.792AlaAla: 8.792 ± 0.164
1.078AlaCys: 1.078 ± 0.052
4.331AlaAsp: 4.331 ± 0.088
5.497AlaGlu: 5.497 ± 0.116
3.646AlaPhe: 3.646 ± 0.092
7.081AlaGly: 7.081 ± 0.151
2.353AlaHis: 2.353 ± 0.068
5.746AlaIle: 5.746 ± 0.113
4.311AlaLys: 4.311 ± 0.103
12.05AlaLeu: 12.05 ± 0.165
2.19AlaMet: 2.19 ± 0.067
2.668AlaAsn: 2.668 ± 0.085
4.03AlaPro: 4.03 ± 0.104
4.397AlaGln: 4.397 ± 0.097
5.584AlaArg: 5.584 ± 0.136
5.913AlaSer: 5.913 ± 0.124
4.293AlaThr: 4.293 ± 0.094
5.728AlaVal: 5.728 ± 0.125
1.04AlaTrp: 1.04 ± 0.051
2.309AlaTyr: 2.309 ± 0.074
0.0AlaXaa: 0.0 ± 0.0
Cys
0.984CysAla: 0.984 ± 0.041
0.155CysCys: 0.155 ± 0.018
0.57CysAsp: 0.57 ± 0.033
0.526CysGlu: 0.526 ± 0.037
0.45CysPhe: 0.45 ± 0.033
0.942CysGly: 0.942 ± 0.045
0.259CysHis: 0.259 ± 0.025
0.526CysIle: 0.526 ± 0.033
0.317CysLys: 0.317 ± 0.025
1.247CysLeu: 1.247 ± 0.051
0.149CysMet: 0.149 ± 0.016
0.317CysAsn: 0.317 ± 0.027
0.49CysPro: 0.49 ± 0.033
0.39CysGln: 0.39 ± 0.03
0.498CysArg: 0.498 ± 0.029
0.707CysSer: 0.707 ± 0.041
0.508CysThr: 0.508 ± 0.033
0.789CysVal: 0.789 ± 0.046
0.122CysTrp: 0.122 ± 0.015
0.253CysTyr: 0.253 ± 0.023
0.0CysXaa: 0.0 ± 0.0
Asp
4.068AspAla: 4.068 ± 0.105
0.47AspCys: 0.47 ± 0.032
2.636AspAsp: 2.636 ± 0.088
3.277AspGlu: 3.277 ± 0.09
2.114AspPhe: 2.114 ± 0.069
3.785AspGly: 3.785 ± 0.1
1.494AspHis: 1.494 ± 0.052
3.905AspIle: 3.905 ± 0.098
2.582AspLys: 2.582 ± 0.085
5.555AspLeu: 5.555 ± 0.113
1.225AspMet: 1.225 ± 0.046
1.927AspAsn: 1.927 ± 0.084
2.727AspPro: 2.727 ± 0.082
2.016AspGln: 2.016 ± 0.067
2.825AspArg: 2.825 ± 0.07
2.863AspSer: 2.863 ± 0.078
2.209AspThr: 2.209 ± 0.069
3.614AspVal: 3.614 ± 0.088
0.719AspTrp: 0.719 ± 0.043
1.67AspTyr: 1.67 ± 0.058
0.0AspXaa: 0.0 ± 0.0
Glu
6.308GluAla: 6.308 ± 0.149
0.42GluCys: 0.42 ± 0.032
3.028GluAsp: 3.028 ± 0.086
3.985GluGlu: 3.985 ± 0.129
1.729GluPhe: 1.729 ± 0.06
4.393GluGly: 4.393 ± 0.106
1.271GluHis: 1.271 ± 0.049
3.943GluIle: 3.943 ± 0.108
4.106GluLys: 4.106 ± 0.111
5.23GluLeu: 5.23 ± 0.109
1.405GluMet: 1.405 ± 0.053
2.753GluAsn: 2.753 ± 0.078
2.06GluPro: 2.06 ± 0.075
2.606GluGln: 2.606 ± 0.088
3.989GluArg: 3.989 ± 0.109
3.016GluSer: 3.016 ± 0.091
3.628GluThr: 3.628 ± 0.097
3.228GluVal: 3.228 ± 0.095
0.735GluTrp: 0.735 ± 0.038
0.98GluTyr: 0.98 ± 0.042
0.0GluXaa: 0.0 ± 0.0
Phe
3.285PheAla: 3.285 ± 0.095
0.604PheCys: 0.604 ± 0.034
2.357PheAsp: 2.357 ± 0.071
1.901PheGlu: 1.901 ± 0.063
1.917PhePhe: 1.917 ± 0.084
3.064PheGly: 3.064 ± 0.089
0.954PheHis: 0.954 ± 0.045
2.548PheIle: 2.548 ± 0.09
1.707PheLys: 1.707 ± 0.065
4.164PheLeu: 4.164 ± 0.1
0.907PheMet: 0.907 ± 0.042
1.482PheAsn: 1.482 ± 0.055
1.666PhePro: 1.666 ± 0.059
1.203PheGln: 1.203 ± 0.041
1.901PheArg: 1.901 ± 0.064
3.361PheSer: 3.361 ± 0.077
2.012PheThr: 2.012 ± 0.06
2.463PheVal: 2.463 ± 0.089
0.592PheTrp: 0.592 ± 0.036
1.183PheTyr: 1.183 ± 0.052
0.0PheXaa: 0.0 ± 0.0
Gly
6.481GlyAla: 6.481 ± 0.121
0.885GlyCys: 0.885 ± 0.039
3.61GlyAsp: 3.61 ± 0.087
3.891GlyGlu: 3.891 ± 0.083
3.265GlyPhe: 3.265 ± 0.08
5.828GlyGly: 5.828 ± 0.156
2.032GlyHis: 2.032 ± 0.073
4.493GlyIle: 4.493 ± 0.096
3.941GlyLys: 3.941 ± 0.1
7.852GlyLeu: 7.852 ± 0.154
1.861GlyMet: 1.861 ± 0.067
2.588GlyAsn: 2.588 ± 0.096
2.698GlyPro: 2.698 ± 0.079
3.078GlyGln: 3.078 ± 0.086
4.543GlyArg: 4.543 ± 0.101
4.604GlySer: 4.604 ± 0.121
4.114GlyThr: 4.114 ± 0.108
4.969GlyVal: 4.969 ± 0.108
1.132GlyTrp: 1.132 ± 0.059
2.09GlyTyr: 2.09 ± 0.078
0.0GlyXaa: 0.0 ± 0.0
His
2.156HisAla: 2.156 ± 0.071
0.237HisCys: 0.237 ± 0.022
1.401HisAsp: 1.401 ± 0.051
1.231HisGlu: 1.231 ± 0.051
1.189HisPhe: 1.189 ± 0.051
1.763HisGly: 1.763 ± 0.06
0.901HisHis: 0.901 ± 0.047
1.819HisIle: 1.819 ± 0.064
1.295HisLys: 1.295 ± 0.051
2.604HisLeu: 2.604 ± 0.088
0.526HisMet: 0.526 ± 0.036
1.205HisAsn: 1.205 ± 0.048
1.429HisPro: 1.429 ± 0.052
0.851HisGln: 0.851 ± 0.04
1.269HisArg: 1.269 ± 0.053
1.57HisSer: 1.57 ± 0.058
1.058HisThr: 1.058 ± 0.046
1.486HisVal: 1.486 ± 0.052
0.357HisTrp: 0.357 ± 0.029
0.934HisTyr: 0.934 ± 0.04
0.0HisXaa: 0.0 ± 0.0
Ile
6.433IleAla: 6.433 ± 0.122
0.813IleCys: 0.813 ± 0.045
3.554IleAsp: 3.554 ± 0.077
4.262IleGlu: 4.262 ± 0.099
2.482IlePhe: 2.482 ± 0.076
5.007IleGly: 5.007 ± 0.119
1.333IleHis: 1.333 ± 0.051
4.548IleIle: 4.548 ± 0.112
3.186IleLys: 3.186 ± 0.08
6.465IleLeu: 6.465 ± 0.135
1.391IleMet: 1.391 ± 0.062
2.277IleAsn: 2.277 ± 0.083
2.574IlePro: 2.574 ± 0.075
1.715IleGln: 1.715 ± 0.057
3.238IleArg: 3.238 ± 0.074
4.7IleSer: 4.7 ± 0.114
3.728IleThr: 3.728 ± 0.1
4.007IleVal: 4.007 ± 0.102
0.721IleTrp: 0.721 ± 0.044
1.498IleTyr: 1.498 ± 0.055
0.0IleXaa: 0.0 ± 0.0
Lys
5.68LysAla: 5.68 ± 0.113
0.323LysCys: 0.323 ± 0.024
2.849LysAsp: 2.849 ± 0.075
3.847LysGlu: 3.847 ± 0.097
1.327LysPhe: 1.327 ± 0.049
3.754LysGly: 3.754 ± 0.105
1.086LysHis: 1.086 ± 0.048
3.598LysIle: 3.598 ± 0.097
3.746LysLys: 3.746 ± 0.082
4.859LysLeu: 4.859 ± 0.112
1.132LysMet: 1.132 ± 0.046
2.504LysAsn: 2.504 ± 0.079
2.273LysPro: 2.273 ± 0.07
1.853LysGln: 1.853 ± 0.065
3.373LysArg: 3.373 ± 0.093
3.03LysSer: 3.03 ± 0.076
3.247LysThr: 3.247 ± 0.081
2.566LysVal: 2.566 ± 0.089
0.57LysTrp: 0.57 ± 0.034
0.853LysTyr: 0.853 ± 0.047
0.0LysXaa: 0.0 ± 0.0
Leu
10.27LeuAla: 10.27 ± 0.166
1.349LeuCys: 1.349 ± 0.053
5.511LeuAsp: 5.511 ± 0.113
6.206LeuGlu: 6.206 ± 0.126
4.036LeuPhe: 4.036 ± 0.105
7.304LeuGly: 7.304 ± 0.127
2.435LeuHis: 2.435 ± 0.079
6.579LeuIle: 6.579 ± 0.14
6.264LeuLys: 6.264 ± 0.13
10.593LeuLeu: 10.593 ± 0.208
2.377LeuMet: 2.377 ± 0.073
4.062LeuAsn: 4.062 ± 0.109
5.967LeuPro: 5.967 ± 0.107
3.526LeuGln: 3.526 ± 0.097
6.11LeuArg: 6.11 ± 0.132
8.647LeuSer: 8.647 ± 0.179
6.346LeuThr: 6.346 ± 0.128
5.953LeuVal: 5.953 ± 0.112
1.359LeuTrp: 1.359 ± 0.047
2.566LeuTyr: 2.566 ± 0.079
0.0LeuXaa: 0.0 ± 0.0
Met
2.488MetAla: 2.488 ± 0.063
0.155MetCys: 0.155 ± 0.018
0.948MetAsp: 0.948 ± 0.046
0.97MetGlu: 0.97 ± 0.045
0.568MetPhe: 0.568 ± 0.036
1.721MetGly: 1.721 ± 0.061
0.482MetHis: 0.482 ± 0.034
1.401MetIle: 1.401 ± 0.052
1.195MetLys: 1.195 ± 0.057
2.196MetLeu: 2.196 ± 0.067
0.6MetMet: 0.6 ± 0.042
0.833MetAsn: 0.833 ± 0.044
1.301MetPro: 1.301 ± 0.045
0.841MetGln: 0.841 ± 0.042
1.492MetArg: 1.492 ± 0.062
1.853MetSer: 1.853 ± 0.058
1.564MetThr: 1.564 ± 0.057
1.419MetVal: 1.419 ± 0.059
0.167MetTrp: 0.167 ± 0.018
0.273MetTyr: 0.273 ± 0.02
0.0MetXaa: 0.0 ± 0.0
Asn
3.379AsnAla: 3.379 ± 0.109
0.339AsnCys: 0.339 ± 0.027
2.032AsnAsp: 2.032 ± 0.064
2.05AsnGlu: 2.05 ± 0.067
1.506AsnPhe: 1.506 ± 0.049
3.0AsnGly: 3.0 ± 0.114
0.934AsnHis: 0.934 ± 0.044
2.761AsnIle: 2.761 ± 0.083
1.849AsnLys: 1.849 ± 0.067
3.857AsnLeu: 3.857 ± 0.107
0.857AsnMet: 0.857 ± 0.042
1.815AsnAsn: 1.815 ± 0.099
2.006AsnPro: 2.006 ± 0.067
1.367AsnGln: 1.367 ± 0.051
1.889AsnArg: 1.889 ± 0.065
2.407AsnSer: 2.407 ± 0.098
1.642AsnThr: 1.642 ± 0.084
2.415AsnVal: 2.415 ± 0.071
0.566AsnTrp: 0.566 ± 0.04
1.108AsnTyr: 1.108 ± 0.051
0.0AsnXaa: 0.0 ± 0.0
Pro
3.947ProAla: 3.947 ± 0.091
0.414ProCys: 0.414 ± 0.031
2.775ProAsp: 2.775 ± 0.074
3.349ProGlu: 3.349 ± 0.078
2.058ProPhe: 2.058 ± 0.056
3.198ProGly: 3.198 ± 0.076
1.46ProHis: 1.46 ± 0.054
2.614ProIle: 2.614 ± 0.075
2.182ProLys: 2.182 ± 0.063
4.957ProLeu: 4.957 ± 0.118
0.887ProMet: 0.887 ± 0.039
1.618ProAsn: 1.618 ± 0.055
2.225ProPro: 2.225 ± 0.083
2.144ProGln: 2.144 ± 0.065
2.102ProArg: 2.102 ± 0.069
3.481ProSer: 3.481 ± 0.084
2.289ProThr: 2.289 ± 0.066
3.271ProVal: 3.271 ± 0.089
0.709ProTrp: 0.709 ± 0.043
1.355ProTyr: 1.355 ± 0.05
0.0ProXaa: 0.0 ± 0.0
Gln
4.248GlnAla: 4.248 ± 0.1
0.227GlnCys: 0.227 ± 0.022
1.853GlnAsp: 1.853 ± 0.057
2.443GlnGlu: 2.443 ± 0.072
1.293GlnPhe: 1.293 ± 0.047
2.498GlnGly: 2.498 ± 0.076
0.909GlnHis: 0.909 ± 0.05
2.449GlnIle: 2.449 ± 0.07
2.771GlnLys: 2.771 ± 0.085
3.524GlnLeu: 3.524 ± 0.091
0.809GlnMet: 0.809 ± 0.038
2.014GlnAsn: 2.014 ± 0.068
1.634GlnPro: 1.634 ± 0.059
1.66GlnGln: 1.66 ± 0.076
2.245GlnArg: 2.245 ± 0.072
2.606GlnSer: 2.606 ± 0.07
2.271GlnThr: 2.271 ± 0.071
2.026GlnVal: 2.026 ± 0.066
0.402GlnTrp: 0.402 ± 0.027
0.809GlnTyr: 0.809 ± 0.046
0.0GlnXaa: 0.0 ± 0.0
Arg
4.903ArgAla: 4.903 ± 0.117
0.47ArgCys: 0.47 ± 0.03
3.092ArgAsp: 3.092 ± 0.082
3.331ArgGlu: 3.331 ± 0.104
2.656ArgPhe: 2.656 ± 0.076
3.524ArgGly: 3.524 ± 0.085
1.693ArgHis: 1.693 ± 0.065
3.271ArgIle: 3.271 ± 0.082
2.815ArgLys: 2.815 ± 0.08
6.911ArgLeu: 6.911 ± 0.147
1.14ArgMet: 1.14 ± 0.046
2.106ArgAsn: 2.106 ± 0.063
2.718ArgPro: 2.718 ± 0.077
2.437ArgGln: 2.437 ± 0.08
3.748ArgArg: 3.748 ± 0.125
3.582ArgSer: 3.582 ± 0.1
2.554ArgThr: 2.554 ± 0.072
3.69ArgVal: 3.69 ± 0.084
0.64ArgTrp: 0.64 ± 0.04
1.823ArgTyr: 1.823 ± 0.068
0.0ArgXaa: 0.0 ± 0.0
Ser
5.885SerAla: 5.885 ± 0.127
0.667SerCys: 0.667 ± 0.039
3.433SerAsp: 3.433 ± 0.09
3.805SerGlu: 3.805 ± 0.107
3.313SerPhe: 3.313 ± 0.082
5.322SerGly: 5.322 ± 0.142
1.893SerHis: 1.893 ± 0.065
3.646SerIle: 3.646 ± 0.092
2.803SerLys: 2.803 ± 0.072
8.336SerLeu: 8.336 ± 0.159
1.417SerMet: 1.417 ± 0.051
2.126SerAsn: 2.126 ± 0.081
3.118SerPro: 3.118 ± 0.09
2.783SerGln: 2.783 ± 0.085
3.596SerArg: 3.596 ± 0.098
5.011SerSer: 5.011 ± 0.132
3.244SerThr: 3.244 ± 0.106
4.365SerVal: 4.365 ± 0.096
1.02SerTrp: 1.02 ± 0.047
2.056SerTyr: 2.056 ± 0.077
0.0SerXaa: 0.0 ± 0.0
Thr
4.752ThrAla: 4.752 ± 0.113
0.412ThrCys: 0.412 ± 0.03
2.737ThrAsp: 2.737 ± 0.078
2.759ThrGlu: 2.759 ± 0.076
1.743ThrPhe: 1.743 ± 0.059
4.321ThrGly: 4.321 ± 0.106
1.371ThrHis: 1.371 ± 0.049
3.55ThrIle: 3.55 ± 0.087
2.285ThrLys: 2.285 ± 0.081
6.872ThrLeu: 6.872 ± 0.137
1.162ThrMet: 1.162 ± 0.047
1.865ThrAsn: 1.865 ± 0.086
3.052ThrPro: 3.052 ± 0.083
1.97ThrGln: 1.97 ± 0.068
2.716ThrArg: 2.716 ± 0.076
3.459ThrSer: 3.459 ± 0.104
2.654ThrThr: 2.654 ± 0.085
3.52ThrVal: 3.52 ± 0.084
0.53ThrTrp: 0.53 ± 0.034
1.229ThrTyr: 1.229 ± 0.052
0.0ThrXaa: 0.0 ± 0.0
Val
5.72ValAla: 5.72 ± 0.114
0.646ValCys: 0.646 ± 0.034
2.935ValAsp: 2.935 ± 0.084
3.453ValGlu: 3.453 ± 0.094
2.393ValPhe: 2.393 ± 0.084
4.379ValGly: 4.379 ± 0.095
1.269ValHis: 1.269 ± 0.053
4.481ValIle: 4.481 ± 0.089
3.337ValLys: 3.337 ± 0.092
6.045ValLeu: 6.045 ± 0.115
1.674ValMet: 1.674 ± 0.055
2.138ValAsn: 2.138 ± 0.069
3.242ValPro: 3.242 ± 0.085
2.15ValGln: 2.15 ± 0.067
3.465ValArg: 3.465 ± 0.085
4.648ValSer: 4.648 ± 0.098
4.05ValThr: 4.05 ± 0.085
4.128ValVal: 4.128 ± 0.112
0.673ValTrp: 0.673 ± 0.039
1.261ValTyr: 1.261 ± 0.049
0.0ValXaa: 0.0 ± 0.0
Trp
0.952TrpAla: 0.952 ± 0.05
0.167TrpCys: 0.167 ± 0.017
0.508TrpAsp: 0.508 ± 0.033
0.604TrpGlu: 0.604 ± 0.04
0.52TrpPhe: 0.52 ± 0.033
0.863TrpGly: 0.863 ± 0.048
0.432TrpHis: 0.432 ± 0.031
0.634TrpIle: 0.634 ± 0.033
0.598TrpLys: 0.598 ± 0.033
1.66TrpLeu: 1.66 ± 0.068
0.275TrpMet: 0.275 ± 0.021
0.484TrpAsn: 0.484 ± 0.031
0.602TrpPro: 0.602 ± 0.033
0.632TrpGln: 0.632 ± 0.033
0.954TrpArg: 0.954 ± 0.049
0.827TrpSer: 0.827 ± 0.044
0.552TrpThr: 0.552 ± 0.036
0.847TrpVal: 0.847 ± 0.049
0.197TrpTrp: 0.197 ± 0.021
0.303TrpTyr: 0.303 ± 0.028
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.305TyrAla: 2.305 ± 0.098
0.369TyrCys: 0.369 ± 0.029
1.522TyrAsp: 1.522 ± 0.059
1.427TyrGlu: 1.427 ± 0.058
1.064TyrPhe: 1.064 ± 0.053
2.206TyrGly: 2.206 ± 0.074
0.719TyrHis: 0.719 ± 0.047
1.421TyrIle: 1.421 ± 0.059
1.179TyrLys: 1.179 ± 0.049
2.484TyrLeu: 2.484 ± 0.077
0.486TyrMet: 0.486 ± 0.032
0.996TyrAsn: 0.996 ± 0.046
1.223TyrPro: 1.223 ± 0.042
1.072TyrGln: 1.072 ± 0.047
1.568TyrArg: 1.568 ± 0.062
1.556TyrSer: 1.556 ± 0.058
0.996TyrThr: 0.996 ± 0.046
1.602TyrVal: 1.602 ± 0.059
0.359TyrTrp: 0.359 ± 0.028
0.775TyrTyr: 0.775 ± 0.041
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1570 proteins (501926 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski