Amino acid dipepetide frequency for Pantoea sp. Eser

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.881AlaAla: 10.881 ± 0.179
1.068AlaCys: 1.068 ± 0.046
5.494AlaAsp: 5.494 ± 0.09
6.382AlaGlu: 6.382 ± 0.103
3.471AlaPhe: 3.471 ± 0.084
7.605AlaGly: 7.605 ± 0.12
1.909AlaHis: 1.909 ± 0.064
5.827AlaIle: 5.827 ± 0.111
3.733AlaLys: 3.733 ± 0.092
12.21AlaLeu: 12.21 ± 0.151
2.968AlaMet: 2.968 ± 0.073
2.845AlaAsn: 2.845 ± 0.075
3.583AlaPro: 3.583 ± 0.092
4.754AlaGln: 4.754 ± 0.101
6.069AlaArg: 6.069 ± 0.113
5.67AlaSer: 5.67 ± 0.098
4.561AlaThr: 4.561 ± 0.084
6.702AlaVal: 6.702 ± 0.106
1.405AlaTrp: 1.405 ± 0.049
2.015AlaTyr: 2.015 ± 0.058
0.0AlaXaa: 0.0 ± 0.0
Cys
0.911CysAla: 0.911 ± 0.039
0.199CysCys: 0.199 ± 0.019
0.662CysAsp: 0.662 ± 0.034
0.639CysGlu: 0.639 ± 0.032
0.396CysPhe: 0.396 ± 0.024
1.07CysGly: 1.07 ± 0.039
0.333CysHis: 0.333 ± 0.025
0.552CysIle: 0.552 ± 0.037
0.328CysLys: 0.328 ± 0.021
0.95CysLeu: 0.95 ± 0.044
0.224CysMet: 0.224 ± 0.018
0.336CysAsn: 0.336 ± 0.022
0.479CysPro: 0.479 ± 0.029
0.46CysGln: 0.46 ± 0.025
0.67CysArg: 0.67 ± 0.031
0.712CysSer: 0.712 ± 0.035
0.459CysThr: 0.459 ± 0.028
0.734CysVal: 0.734 ± 0.039
0.173CysTrp: 0.173 ± 0.017
0.322CysTyr: 0.322 ± 0.02
0.0CysXaa: 0.0 ± 0.0
Asp
5.657AspAla: 5.657 ± 0.09
0.567AspCys: 0.567 ± 0.031
3.064AspAsp: 3.064 ± 0.084
3.747AspGlu: 3.747 ± 0.08
2.158AspPhe: 2.158 ± 0.067
3.768AspGly: 3.768 ± 0.095
1.144AspHis: 1.144 ± 0.044
3.374AspIle: 3.374 ± 0.082
2.47AspLys: 2.47 ± 0.071
5.079AspLeu: 5.079 ± 0.086
1.36AspMet: 1.36 ± 0.044
2.099AspAsn: 2.099 ± 0.061
2.347AspPro: 2.347 ± 0.067
2.164AspGln: 2.164 ± 0.065
3.106AspArg: 3.106 ± 0.077
3.106AspSer: 3.106 ± 0.07
2.419AspThr: 2.419 ± 0.064
3.935AspVal: 3.935 ± 0.075
0.861AspTrp: 0.861 ± 0.041
1.976AspTyr: 1.976 ± 0.059
0.0AspXaa: 0.0 ± 0.0
Glu
5.957GluAla: 5.957 ± 0.123
0.438GluCys: 0.438 ± 0.027
2.411GluAsp: 2.411 ± 0.069
3.67GluGlu: 3.67 ± 0.101
1.901GluPhe: 1.901 ± 0.054
3.627GluGly: 3.627 ± 0.085
1.441GluHis: 1.441 ± 0.049
3.456GluIle: 3.456 ± 0.085
3.193GluLys: 3.193 ± 0.076
6.02GluLeu: 6.02 ± 0.097
1.892GluMet: 1.892 ± 0.061
2.245GluAsn: 2.245 ± 0.073
2.097GluPro: 2.097 ± 0.063
3.702GluGln: 3.702 ± 0.089
3.725GluArg: 3.725 ± 0.09
3.049GluSer: 3.049 ± 0.069
3.061GluThr: 3.061 ± 0.083
4.285GluVal: 4.285 ± 0.084
0.7GluTrp: 0.7 ± 0.031
1.286GluTyr: 1.286 ± 0.047
0.0GluXaa: 0.0 ± 0.0
Phe
3.496PheAla: 3.496 ± 0.075
0.538PheCys: 0.538 ± 0.027
2.52PheAsp: 2.52 ± 0.066
1.906PheGlu: 1.906 ± 0.055
1.516PhePhe: 1.516 ± 0.057
2.988PheGly: 2.988 ± 0.078
0.857PheHis: 0.857 ± 0.039
2.332PheIle: 2.332 ± 0.07
1.262PheLys: 1.262 ± 0.043
3.206PheLeu: 3.206 ± 0.082
0.953PheMet: 0.953 ± 0.042
1.651PheAsn: 1.651 ± 0.048
1.522PhePro: 1.522 ± 0.044
1.236PheGln: 1.236 ± 0.044
1.948PheArg: 1.948 ± 0.047
2.965PheSer: 2.965 ± 0.073
2.156PheThr: 2.156 ± 0.057
2.417PheVal: 2.417 ± 0.066
0.571PheTrp: 0.571 ± 0.034
1.129PheTyr: 1.129 ± 0.051
0.0PheXaa: 0.0 ± 0.0
Gly
6.057GlyAla: 6.057 ± 0.12
1.017GlyCys: 1.017 ± 0.042
3.922GlyAsp: 3.922 ± 0.074
4.655GlyGlu: 4.655 ± 0.097
3.204GlyPhe: 3.204 ± 0.077
5.171GlyGly: 5.171 ± 0.108
1.732GlyHis: 1.732 ± 0.056
4.667GlyIle: 4.667 ± 0.089
3.971GlyLys: 3.971 ± 0.081
7.35GlyLeu: 7.35 ± 0.132
2.309GlyMet: 2.309 ± 0.068
2.585GlyAsn: 2.585 ± 0.066
2.086GlyPro: 2.086 ± 0.062
3.018GlyGln: 3.018 ± 0.07
4.061GlyArg: 4.061 ± 0.096
4.211GlySer: 4.211 ± 0.088
3.573GlyThr: 3.573 ± 0.094
5.572GlyVal: 5.572 ± 0.092
1.231GlyTrp: 1.231 ± 0.044
2.371GlyTyr: 2.371 ± 0.067
0.0GlyXaa: 0.0 ± 0.0
His
1.95HisAla: 1.95 ± 0.058
0.375HisCys: 0.375 ± 0.024
1.213HisAsp: 1.213 ± 0.052
1.101HisGlu: 1.101 ± 0.051
1.051HisPhe: 1.051 ± 0.044
1.853HisGly: 1.853 ± 0.058
0.812HisHis: 0.812 ± 0.04
1.278HisIle: 1.278 ± 0.039
0.799HisLys: 0.799 ± 0.039
2.438HisLeu: 2.438 ± 0.06
0.574HisMet: 0.574 ± 0.03
0.866HisAsn: 0.866 ± 0.035
1.469HisPro: 1.469 ± 0.049
1.379HisGln: 1.379 ± 0.052
1.41HisArg: 1.41 ± 0.046
1.371HisSer: 1.371 ± 0.045
1.101HisThr: 1.101 ± 0.041
1.306HisVal: 1.306 ± 0.048
0.423HisTrp: 0.423 ± 0.023
0.911HisTyr: 0.911 ± 0.042
0.0HisXaa: 0.0 ± 0.0
Ile
6.113IleAla: 6.113 ± 0.116
0.636IleCys: 0.636 ± 0.031
3.767IleAsp: 3.767 ± 0.073
3.538IleGlu: 3.538 ± 0.087
1.836IlePhe: 1.836 ± 0.061
4.488IleGly: 4.488 ± 0.09
1.143IleHis: 1.143 ± 0.046
3.153IleIle: 3.153 ± 0.074
2.377IleLys: 2.377 ± 0.059
4.655IleLeu: 4.655 ± 0.103
1.219IleMet: 1.219 ± 0.047
2.403IleAsn: 2.403 ± 0.064
2.503IlePro: 2.503 ± 0.053
1.926IleGln: 1.926 ± 0.055
3.154IleArg: 3.154 ± 0.072
3.644IleSer: 3.644 ± 0.085
3.296IleThr: 3.296 ± 0.068
3.809IleVal: 3.809 ± 0.092
0.625IleTrp: 0.625 ± 0.033
1.491IleTyr: 1.491 ± 0.046
0.0IleXaa: 0.0 ± 0.0
Lys
4.283LysAla: 4.283 ± 0.087
0.28LysCys: 0.28 ± 0.019
1.995LysAsp: 1.995 ± 0.06
2.327LysGlu: 2.327 ± 0.062
1.076LysPhe: 1.076 ± 0.041
2.93LysGly: 2.93 ± 0.084
0.871LysHis: 0.871 ± 0.039
2.259LysIle: 2.259 ± 0.066
2.267LysLys: 2.267 ± 0.071
4.28LysLeu: 4.28 ± 0.098
1.262LysMet: 1.262 ± 0.043
1.606LysAsn: 1.606 ± 0.062
2.088LysPro: 2.088 ± 0.059
2.128LysGln: 2.128 ± 0.062
2.747LysArg: 2.747 ± 0.068
2.344LysSer: 2.344 ± 0.073
2.358LysThr: 2.358 ± 0.06
3.016LysVal: 3.016 ± 0.069
0.432LysTrp: 0.432 ± 0.027
1.048LysTyr: 1.048 ± 0.044
0.0LysXaa: 0.0 ± 0.0
Leu
11.464LeuAla: 11.464 ± 0.139
1.168LeuCys: 1.168 ± 0.04
5.698LeuAsp: 5.698 ± 0.102
5.839LeuGlu: 5.839 ± 0.116
4.204LeuPhe: 4.204 ± 0.092
7.102LeuGly: 7.102 ± 0.11
2.408LeuHis: 2.408 ± 0.065
5.814LeuIle: 5.814 ± 0.119
4.613LeuLys: 4.613 ± 0.099
11.764LeuLeu: 11.764 ± 0.203
3.007LeuMet: 3.007 ± 0.076
4.213LeuAsn: 4.213 ± 0.084
5.693LeuPro: 5.693 ± 0.102
4.855LeuGln: 4.855 ± 0.093
6.578LeuArg: 6.578 ± 0.116
7.232LeuSer: 7.232 ± 0.119
6.209LeuThr: 6.209 ± 0.093
7.1LeuVal: 7.1 ± 0.125
1.315LeuTrp: 1.315 ± 0.054
2.511LeuTyr: 2.511 ± 0.06
0.0LeuXaa: 0.0 ± 0.0
Met
2.943MetAla: 2.943 ± 0.072
0.183MetCys: 0.183 ± 0.014
1.171MetAsp: 1.171 ± 0.046
1.284MetGlu: 1.284 ± 0.041
0.849MetPhe: 0.849 ± 0.041
1.834MetGly: 1.834 ± 0.057
0.516MetHis: 0.516 ± 0.031
1.427MetIle: 1.427 ± 0.051
1.457MetLys: 1.457 ± 0.049
3.087MetLeu: 3.087 ± 0.059
0.858MetMet: 0.858 ± 0.037
1.136MetAsn: 1.136 ± 0.04
1.36MetPro: 1.36 ± 0.044
1.286MetGln: 1.286 ± 0.038
1.618MetArg: 1.618 ± 0.047
1.828MetSer: 1.828 ± 0.054
1.719MetThr: 1.719 ± 0.053
1.943MetVal: 1.943 ± 0.049
0.235MetTrp: 0.235 ± 0.018
0.462MetTyr: 0.462 ± 0.031
0.0MetXaa: 0.0 ± 0.0
Asn
3.507AsnAla: 3.507 ± 0.08
0.323AsnCys: 0.323 ± 0.024
2.047AsnAsp: 2.047 ± 0.06
1.737AsnGlu: 1.737 ± 0.054
1.346AsnPhe: 1.346 ± 0.048
2.85AsnGly: 2.85 ± 0.06
0.875AsnHis: 0.875 ± 0.041
2.113AsnIle: 2.113 ± 0.057
1.482AsnLys: 1.482 ± 0.055
3.602AsnLeu: 3.602 ± 0.071
0.849AsnMet: 0.849 ± 0.036
1.433AsnAsn: 1.433 ± 0.058
2.113AsnPro: 2.113 ± 0.057
1.721AsnGln: 1.721 ± 0.053
2.024AsnArg: 2.024 ± 0.059
1.923AsnSer: 1.923 ± 0.054
1.758AsnThr: 1.758 ± 0.054
2.452AsnVal: 2.452 ± 0.069
0.557AsnTrp: 0.557 ± 0.029
1.001AsnTyr: 1.001 ± 0.046
0.0AsnXaa: 0.0 ± 0.0
Pro
4.846ProAla: 4.846 ± 0.102
0.414ProCys: 0.414 ± 0.025
2.929ProAsp: 2.929 ± 0.068
3.294ProGlu: 3.294 ± 0.083
1.802ProPhe: 1.802 ± 0.052
3.426ProGly: 3.426 ± 0.074
1.124ProHis: 1.124 ± 0.042
1.942ProIle: 1.942 ± 0.054
1.447ProLys: 1.447 ± 0.052
5.059ProLeu: 5.059 ± 0.104
1.094ProMet: 1.094 ± 0.044
1.234ProAsn: 1.234 ± 0.039
1.707ProPro: 1.707 ± 0.049
2.323ProGln: 2.323 ± 0.059
2.044ProArg: 2.044 ± 0.056
2.206ProSer: 2.206 ± 0.059
2.1ProThr: 2.1 ± 0.06
3.899ProVal: 3.899 ± 0.083
0.631ProTrp: 0.631 ± 0.03
1.155ProTyr: 1.155 ± 0.039
0.0ProXaa: 0.0 ± 0.0
Gln
5.206GlnAla: 5.206 ± 0.112
0.382GlnCys: 0.382 ± 0.021
2.051GlnAsp: 2.051 ± 0.059
2.096GlnGlu: 2.096 ± 0.065
1.513GlnPhe: 1.513 ± 0.053
3.293GlnGly: 3.293 ± 0.073
1.471GlnHis: 1.471 ± 0.055
2.327GlnIle: 2.327 ± 0.064
1.684GlnLys: 1.684 ± 0.058
5.783GlnLeu: 5.783 ± 0.113
1.36GlnMet: 1.36 ± 0.047
1.408GlnAsn: 1.408 ± 0.045
2.711GlnPro: 2.711 ± 0.066
4.278GlnGln: 4.278 ± 0.1
3.719GlnArg: 3.719 ± 0.094
2.346GlnSer: 2.346 ± 0.066
2.246GlnThr: 2.246 ± 0.064
3.485GlnVal: 3.485 ± 0.076
0.583GlnTrp: 0.583 ± 0.03
1.147GlnTyr: 1.147 ± 0.043
0.0GlnXaa: 0.0 ± 0.0
Arg
5.14ArgAla: 5.14 ± 0.098
0.636ArgCys: 0.636 ± 0.035
3.416ArgAsp: 3.416 ± 0.085
4.031ArgGlu: 4.031 ± 0.095
2.794ArgPhe: 2.794 ± 0.065
3.697ArgGly: 3.697 ± 0.082
1.743ArgHis: 1.743 ± 0.05
3.515ArgIle: 3.515 ± 0.075
2.386ArgLys: 2.386 ± 0.072
6.943ArgLeu: 6.943 ± 0.108
1.595ArgMet: 1.595 ± 0.049
2.072ArgAsn: 2.072 ± 0.06
2.48ArgPro: 2.48 ± 0.066
3.498ArgGln: 3.498 ± 0.084
3.897ArgArg: 3.897 ± 0.087
3.005ArgSer: 3.005 ± 0.081
2.581ArgThr: 2.581 ± 0.06
4.067ArgVal: 4.067 ± 0.092
0.984ArgTrp: 0.984 ± 0.039
2.153ArgTyr: 2.153 ± 0.062
0.0ArgXaa: 0.0 ± 0.0
Ser
5.789SerAla: 5.789 ± 0.096
0.574SerCys: 0.574 ± 0.028
3.33SerAsp: 3.33 ± 0.065
3.366SerGlu: 3.366 ± 0.071
2.125SerPhe: 2.125 ± 0.056
5.276SerGly: 5.276 ± 0.094
1.488SerHis: 1.488 ± 0.046
2.733SerIle: 2.733 ± 0.078
2.127SerLys: 2.127 ± 0.057
6.694SerLeu: 6.694 ± 0.109
1.452SerMet: 1.452 ± 0.043
1.974SerAsn: 1.974 ± 0.061
2.557SerPro: 2.557 ± 0.069
2.749SerGln: 2.749 ± 0.071
3.675SerArg: 3.675 ± 0.077
3.604SerSer: 3.604 ± 0.089
2.801SerThr: 2.801 ± 0.071
4.121SerVal: 4.121 ± 0.081
0.925SerTrp: 0.925 ± 0.041
1.519SerTyr: 1.519 ± 0.049
0.0SerXaa: 0.0 ± 0.0
Thr
4.878ThrAla: 4.878 ± 0.095
0.477ThrCys: 0.477 ± 0.028
2.621ThrAsp: 2.621 ± 0.069
2.601ThrGlu: 2.601 ± 0.059
1.82ThrPhe: 1.82 ± 0.059
4.19ThrGly: 4.19 ± 0.077
1.225ThrHis: 1.225 ± 0.043
2.576ThrIle: 2.576 ± 0.073
1.514ThrLys: 1.514 ± 0.05
7.417ThrLeu: 7.417 ± 0.103
1.084ThrMet: 1.084 ± 0.04
1.492ThrAsn: 1.492 ± 0.053
2.969ThrPro: 2.969 ± 0.074
2.183ThrGln: 2.183 ± 0.058
3.227ThrArg: 3.227 ± 0.064
2.766ThrSer: 2.766 ± 0.066
2.648ThrThr: 2.648 ± 0.067
3.711ThrVal: 3.711 ± 0.081
0.599ThrTrp: 0.599 ± 0.035
1.048ThrTyr: 1.048 ± 0.042
0.0ThrXaa: 0.0 ± 0.0
Val
6.966ValAla: 6.966 ± 0.116
0.732ValCys: 0.732 ± 0.036
3.95ValAsp: 3.95 ± 0.078
4.185ValGlu: 4.185 ± 0.088
2.444ValPhe: 2.444 ± 0.069
4.794ValGly: 4.794 ± 0.091
1.368ValHis: 1.368 ± 0.048
4.48ValIle: 4.48 ± 0.089
3.165ValLys: 3.165 ± 0.086
7.185ValLeu: 7.185 ± 0.118
2.175ValMet: 2.175 ± 0.06
2.764ValAsn: 2.764 ± 0.066
3.022ValPro: 3.022 ± 0.063
2.505ValGln: 2.505 ± 0.064
4.07ValArg: 4.07 ± 0.086
4.588ValSer: 4.588 ± 0.091
4.236ValThr: 4.236 ± 0.095
5.547ValVal: 5.547 ± 0.108
0.836ValTrp: 0.836 ± 0.035
1.625ValTyr: 1.625 ± 0.05
0.0ValXaa: 0.0 ± 0.0
Trp
0.852TrpAla: 0.852 ± 0.037
0.193TrpCys: 0.193 ± 0.017
0.619TrpAsp: 0.619 ± 0.035
0.561TrpGlu: 0.561 ± 0.033
0.62TrpPhe: 0.62 ± 0.032
0.735TrpGly: 0.735 ± 0.038
0.443TrpHis: 0.443 ± 0.026
0.712TrpIle: 0.712 ± 0.037
0.48TrpLys: 0.48 ± 0.027
2.136TrpLeu: 2.136 ± 0.067
0.395TrpMet: 0.395 ± 0.023
0.403TrpAsn: 0.403 ± 0.024
0.639TrpPro: 0.639 ± 0.031
1.281TrpGln: 1.281 ± 0.053
1.043TrpArg: 1.043 ± 0.041
0.693TrpSer: 0.693 ± 0.036
0.454TrpThr: 0.454 ± 0.029
0.857TrpVal: 0.857 ± 0.038
0.201TrpTrp: 0.201 ± 0.019
0.379TrpTyr: 0.379 ± 0.026
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.256TyrAla: 2.256 ± 0.057
0.407TyrCys: 0.407 ± 0.027
1.524TyrAsp: 1.524 ± 0.049
1.197TyrGlu: 1.197 ± 0.043
1.043TyrPhe: 1.043 ± 0.039
2.063TyrGly: 2.063 ± 0.059
0.776TyrHis: 0.776 ± 0.036
1.239TyrIle: 1.239 ± 0.043
0.843TyrLys: 0.843 ± 0.038
2.927TyrLeu: 2.927 ± 0.072
0.567TyrMet: 0.567 ± 0.027
0.945TyrAsn: 0.945 ± 0.042
1.25TyrPro: 1.25 ± 0.05
1.646TyrGln: 1.646 ± 0.054
1.845TyrArg: 1.845 ± 0.051
1.617TyrSer: 1.617 ± 0.051
1.309TyrThr: 1.309 ± 0.04
1.682TyrVal: 1.682 ± 0.053
0.41TyrTrp: 0.41 ± 0.029
0.798TyrTyr: 0.798 ± 0.038
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2328 proteins (643241 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski