Amino acid dipepetide frequency for Gemmobacter caeni

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
19.293AlaAla: 19.293 ± 0.187
1.193AlaCys: 1.193 ± 0.032
7.22AlaAsp: 7.22 ± 0.061
9.299AlaGlu: 9.299 ± 0.1
4.419AlaPhe: 4.419 ± 0.052
11.688AlaGly: 11.688 ± 0.109
2.388AlaHis: 2.388 ± 0.042
5.607AlaIle: 5.607 ± 0.062
3.497AlaLys: 3.497 ± 0.056
15.213AlaLeu: 15.213 ± 0.142
3.94AlaMet: 3.94 ± 0.056
2.514AlaAsn: 2.514 ± 0.041
6.684AlaPro: 6.684 ± 0.093
4.497AlaGln: 4.497 ± 0.057
10.434AlaArg: 10.434 ± 0.106
5.59AlaSer: 5.59 ± 0.066
6.326AlaThr: 6.326 ± 0.075
8.802AlaVal: 8.802 ± 0.082
1.544AlaTrp: 1.544 ± 0.033
2.411AlaTyr: 2.411 ± 0.047
0.0AlaXaa: 0.0 ± 0.0
Cys
1.145CysAla: 1.145 ± 0.025
0.105CysCys: 0.105 ± 0.01
0.587CysAsp: 0.587 ± 0.023
0.437CysGlu: 0.437 ± 0.017
0.324CysPhe: 0.324 ± 0.015
1.003CysGly: 1.003 ± 0.03
0.251CysHis: 0.251 ± 0.013
0.412CysIle: 0.412 ± 0.017
0.19CysLys: 0.19 ± 0.01
0.915CysLeu: 0.915 ± 0.026
0.171CysMet: 0.171 ± 0.011
0.213CysAsn: 0.213 ± 0.011
0.545CysPro: 0.545 ± 0.02
0.242CysGln: 0.242 ± 0.013
0.604CysArg: 0.604 ± 0.02
0.398CysSer: 0.398 ± 0.017
0.456CysThr: 0.456 ± 0.016
0.574CysVal: 0.574 ± 0.022
0.13CysTrp: 0.13 ± 0.009
0.2CysTyr: 0.2 ± 0.012
0.0CysXaa: 0.0 ± 0.0
Asp
6.799AspAla: 6.799 ± 0.073
0.532AspCys: 0.532 ± 0.02
2.883AspAsp: 2.883 ± 0.054
3.119AspGlu: 3.119 ± 0.048
2.219AspPhe: 2.219 ± 0.037
5.331AspGly: 5.331 ± 0.081
1.286AspHis: 1.286 ± 0.031
2.56AspIle: 2.56 ± 0.037
1.369AspLys: 1.369 ± 0.034
6.926AspLeu: 6.926 ± 0.079
1.477AspMet: 1.477 ± 0.031
1.021AspAsn: 1.021 ± 0.032
3.871AspPro: 3.871 ± 0.062
1.749AspGln: 1.749 ± 0.038
4.895AspArg: 4.895 ± 0.06
2.1AspSer: 2.1 ± 0.04
2.68AspThr: 2.68 ± 0.062
3.666AspVal: 3.666 ± 0.051
1.284AspTrp: 1.284 ± 0.033
1.385AspTyr: 1.385 ± 0.031
0.0AspXaa: 0.0 ± 0.0
Glu
9.046GluAla: 9.046 ± 0.095
0.357GluCys: 0.357 ± 0.015
3.015GluAsp: 3.015 ± 0.05
3.177GluGlu: 3.177 ± 0.051
1.661GluPhe: 1.661 ± 0.034
5.179GluGly: 5.179 ± 0.063
1.003GluHis: 1.003 ± 0.026
3.282GluIle: 3.282 ± 0.045
1.892GluLys: 1.892 ± 0.037
4.941GluLeu: 4.941 ± 0.064
1.752GluMet: 1.752 ± 0.036
1.422GluAsn: 1.422 ± 0.026
2.697GluPro: 2.697 ± 0.045
1.721GluGln: 1.721 ± 0.037
4.356GluArg: 4.356 ± 0.065
2.041GluSer: 2.041 ± 0.034
3.746GluThr: 3.746 ± 0.044
4.488GluVal: 4.488 ± 0.061
0.728GluTrp: 0.728 ± 0.021
0.985GluTyr: 0.985 ± 0.029
0.0GluXaa: 0.0 ± 0.0
Phe
4.51PheAla: 4.51 ± 0.055
0.404PheCys: 0.404 ± 0.016
2.605PheAsp: 2.605 ± 0.044
1.874PheGlu: 1.874 ± 0.035
1.273PhePhe: 1.273 ± 0.033
3.658PheGly: 3.658 ± 0.052
0.725PheHis: 0.725 ± 0.023
1.492PheIle: 1.492 ± 0.033
0.728PheLys: 0.728 ± 0.021
3.439PheLeu: 3.439 ± 0.056
0.782PheMet: 0.782 ± 0.023
0.889PheAsn: 0.889 ± 0.026
1.582PhePro: 1.582 ± 0.032
0.9PheGln: 0.9 ± 0.021
2.534PheArg: 2.534 ± 0.045
1.98PheSer: 1.98 ± 0.036
2.114PheThr: 2.114 ± 0.037
2.449PheVal: 2.449 ± 0.042
0.555PheTrp: 0.555 ± 0.02
0.819PheTyr: 0.819 ± 0.024
0.0PheXaa: 0.0 ± 0.0
Gly
10.858GlyAla: 10.858 ± 0.12
0.893GlyCys: 0.893 ± 0.024
4.623GlyAsp: 4.623 ± 0.084
4.622GlyGlu: 4.622 ± 0.059
3.674GlyPhe: 3.674 ± 0.054
7.976GlyGly: 7.976 ± 0.138
1.966GlyHis: 1.966 ± 0.037
4.281GlyIle: 4.281 ± 0.059
2.995GlyLys: 2.995 ± 0.054
9.846GlyLeu: 9.846 ± 0.105
2.722GlyMet: 2.722 ± 0.041
2.203GlyAsn: 2.203 ± 0.077
4.121GlyPro: 4.121 ± 0.052
3.347GlyGln: 3.347 ± 0.046
6.641GlyArg: 6.641 ± 0.064
4.339GlySer: 4.339 ± 0.061
4.731GlyThr: 4.731 ± 0.067
6.485GlyVal: 6.485 ± 0.071
1.681GlyTrp: 1.681 ± 0.031
2.2GlyTyr: 2.2 ± 0.039
0.0GlyXaa: 0.0 ± 0.0
His
2.362HisAla: 2.362 ± 0.046
0.238HisCys: 0.238 ± 0.012
1.246HisAsp: 1.246 ± 0.036
1.061HisGlu: 1.061 ± 0.025
0.77HisPhe: 0.77 ± 0.019
1.935HisGly: 1.935 ± 0.038
0.53HisHis: 0.53 ± 0.023
0.893HisIle: 0.893 ± 0.023
0.427HisLys: 0.427 ± 0.017
2.282HisLeu: 2.282 ± 0.046
0.497HisMet: 0.497 ± 0.018
0.417HisAsn: 0.417 ± 0.017
1.481HisPro: 1.481 ± 0.033
0.538HisGln: 0.538 ± 0.018
1.493HisArg: 1.493 ± 0.036
0.848HisSer: 0.848 ± 0.022
0.745HisThr: 0.745 ± 0.024
1.443HisVal: 1.443 ± 0.034
0.374HisTrp: 0.374 ± 0.016
0.521HisTyr: 0.521 ± 0.016
0.0HisXaa: 0.0 ± 0.0
Ile
6.731IleAla: 6.731 ± 0.071
0.596IleCys: 0.596 ± 0.021
3.135IleAsp: 3.135 ± 0.053
3.117IleGlu: 3.117 ± 0.046
1.542IlePhe: 1.542 ± 0.031
4.845IleGly: 4.845 ± 0.06
0.914IleHis: 0.914 ± 0.025
1.808IleIle: 1.808 ± 0.037
1.011IleLys: 1.011 ± 0.028
4.498IleLeu: 4.498 ± 0.062
0.911IleMet: 0.911 ± 0.027
1.144IleAsn: 1.144 ± 0.028
2.183IlePro: 2.183 ± 0.035
0.996IleGln: 0.996 ± 0.029
3.569IleArg: 3.569 ± 0.05
2.676IleSer: 2.676 ± 0.046
2.781IleThr: 2.781 ± 0.048
3.371IleVal: 3.371 ± 0.049
0.687IleTrp: 0.687 ± 0.021
1.004IleTyr: 1.004 ± 0.025
0.0IleXaa: 0.0 ± 0.0
Lys
3.674LysAla: 3.674 ± 0.063
0.158LysCys: 0.158 ± 0.011
1.45LysAsp: 1.45 ± 0.036
1.316LysGlu: 1.316 ± 0.035
0.796LysPhe: 0.796 ± 0.022
2.692LysGly: 2.692 ± 0.05
0.484LysHis: 0.484 ± 0.02
1.375LysIle: 1.375 ± 0.033
0.954LysLys: 0.954 ± 0.032
2.683LysLeu: 2.683 ± 0.039
0.772LysMet: 0.772 ± 0.026
0.701LysAsn: 0.701 ± 0.023
1.717LysPro: 1.717 ± 0.036
0.761LysGln: 0.761 ± 0.025
2.013LysArg: 2.013 ± 0.038
1.502LysSer: 1.502 ± 0.037
1.684LysThr: 1.684 ± 0.039
2.141LysVal: 2.141 ± 0.044
0.361LysTrp: 0.361 ± 0.013
0.576LysTyr: 0.576 ± 0.02
0.0LysXaa: 0.0 ± 0.0
Leu
14.728LeuAla: 14.728 ± 0.127
0.998LeuCys: 0.998 ± 0.029
5.941LeuAsp: 5.941 ± 0.067
5.194LeuGlu: 5.194 ± 0.061
3.414LeuPhe: 3.414 ± 0.046
8.641LeuGly: 8.641 ± 0.091
2.116LeuHis: 2.116 ± 0.041
5.074LeuIle: 5.074 ± 0.059
3.068LeuLys: 3.068 ± 0.047
9.584LeuLeu: 9.584 ± 0.104
2.767LeuMet: 2.767 ± 0.048
2.512LeuAsn: 2.512 ± 0.043
6.346LeuPro: 6.346 ± 0.079
2.574LeuGln: 2.574 ± 0.04
8.073LeuArg: 8.073 ± 0.081
6.44LeuSer: 6.44 ± 0.073
6.384LeuThr: 6.384 ± 0.074
7.066LeuVal: 7.066 ± 0.079
1.432LeuTrp: 1.432 ± 0.031
1.907LeuTyr: 1.907 ± 0.034
0.0LeuXaa: 0.0 ± 0.0
Met
3.67MetAla: 3.67 ± 0.058
0.173MetCys: 0.173 ± 0.01
1.227MetAsp: 1.227 ± 0.033
1.327MetGlu: 1.327 ± 0.03
0.754MetPhe: 0.754 ± 0.023
2.248MetGly: 2.248 ± 0.044
0.394MetHis: 0.394 ± 0.016
1.497MetIle: 1.497 ± 0.031
0.947MetLys: 0.947 ± 0.027
2.52MetLeu: 2.52 ± 0.042
0.745MetMet: 0.745 ± 0.023
0.75MetAsn: 0.75 ± 0.022
1.519MetPro: 1.519 ± 0.037
1.025MetGln: 1.025 ± 0.028
2.046MetArg: 2.046 ± 0.036
1.598MetSer: 1.598 ± 0.036
2.196MetThr: 2.196 ± 0.035
1.794MetVal: 1.794 ± 0.036
0.224MetTrp: 0.224 ± 0.013
0.287MetTyr: 0.287 ± 0.013
0.0MetXaa: 0.0 ± 0.0
Asn
2.881AsnAla: 2.881 ± 0.049
0.213AsnCys: 0.213 ± 0.011
1.4AsnAsp: 1.4 ± 0.066
1.031AsnGlu: 1.031 ± 0.026
0.85AsnPhe: 0.85 ± 0.024
2.243AsnGly: 2.243 ± 0.042
0.455AsnHis: 0.455 ± 0.018
1.163AsnIle: 1.163 ± 0.03
0.547AsnLys: 0.547 ± 0.02
2.387AsnLeu: 2.387 ± 0.041
0.618AsnMet: 0.618 ± 0.022
0.511AsnAsn: 0.511 ± 0.018
1.787AsnPro: 1.787 ± 0.039
0.608AsnGln: 0.608 ± 0.019
1.784AsnArg: 1.784 ± 0.036
1.007AsnSer: 1.007 ± 0.029
1.141AsnThr: 1.141 ± 0.029
1.545AsnVal: 1.545 ± 0.037
0.409AsnTrp: 0.409 ± 0.017
0.536AsnTyr: 0.536 ± 0.021
0.0AsnXaa: 0.0 ± 0.0
Pro
7.039ProAla: 7.039 ± 0.096
0.412ProCys: 0.412 ± 0.016
4.07ProAsp: 4.07 ± 0.062
4.637ProGlu: 4.637 ± 0.068
2.017ProPhe: 2.017 ± 0.035
5.257ProGly: 5.257 ± 0.067
1.083ProHis: 1.083 ± 0.029
1.885ProIle: 1.885 ± 0.035
1.548ProLys: 1.548 ± 0.033
5.074ProLeu: 5.074 ± 0.062
1.345ProMet: 1.345 ± 0.032
1.142ProAsn: 1.142 ± 0.032
2.726ProPro: 2.726 ± 0.064
1.767ProGln: 1.767 ± 0.034
3.218ProArg: 3.218 ± 0.046
2.392ProSer: 2.392 ± 0.038
2.264ProThr: 2.264 ± 0.039
4.877ProVal: 4.877 ± 0.055
0.737ProTrp: 0.737 ± 0.025
1.081ProTyr: 1.081 ± 0.028
0.0ProXaa: 0.0 ± 0.0
Gln
4.235GlnAla: 4.235 ± 0.055
0.202GlnCys: 0.202 ± 0.012
1.491GlnAsp: 1.491 ± 0.034
1.492GlnGlu: 1.492 ± 0.033
0.953GlnPhe: 0.953 ± 0.024
2.764GlnGly: 2.764 ± 0.046
0.575GlnHis: 0.575 ± 0.018
1.973GlnIle: 1.973 ± 0.036
0.959GlnLys: 0.959 ± 0.028
2.625GlnLeu: 2.625 ± 0.049
1.029GlnMet: 1.029 ± 0.025
0.8GlnAsn: 0.8 ± 0.024
1.751GlnPro: 1.751 ± 0.032
0.971GlnGln: 0.971 ± 0.027
2.187GlnArg: 2.187 ± 0.04
1.608GlnSer: 1.608 ± 0.037
1.673GlnThr: 1.673 ± 0.031
2.407GlnVal: 2.407 ± 0.038
0.361GlnTrp: 0.361 ± 0.015
0.488GlnTyr: 0.488 ± 0.018
0.0GlnXaa: 0.0 ± 0.0
Arg
9.609ArgAla: 9.609 ± 0.084
0.523ArgCys: 0.523 ± 0.019
4.525ArgAsp: 4.525 ± 0.056
3.942ArgGlu: 3.942 ± 0.056
2.889ArgPhe: 2.889 ± 0.048
5.364ArgGly: 5.364 ± 0.06
1.74ArgHis: 1.74 ± 0.035
4.226ArgIle: 4.226 ± 0.044
2.219ArgLys: 2.219 ± 0.038
8.914ArgLeu: 8.914 ± 0.098
2.142ArgMet: 2.142 ± 0.042
1.812ArgAsn: 1.812 ± 0.037
3.833ArgPro: 3.833 ± 0.057
2.497ArgGln: 2.497 ± 0.044
5.872ArgArg: 5.872 ± 0.082
3.415ArgSer: 3.415 ± 0.052
3.081ArgThr: 3.081 ± 0.047
5.144ArgVal: 5.144 ± 0.057
1.045ArgTrp: 1.045 ± 0.03
1.533ArgTyr: 1.533 ± 0.03
0.0ArgXaa: 0.0 ± 0.0
Ser
5.97SerAla: 5.97 ± 0.073
0.422SerCys: 0.422 ± 0.017
2.898SerAsp: 2.898 ± 0.049
2.466SerGlu: 2.466 ± 0.036
2.058SerPhe: 2.058 ± 0.034
5.537SerGly: 5.537 ± 0.076
1.039SerHis: 1.039 ± 0.025
2.072SerIle: 2.072 ± 0.037
1.202SerLys: 1.202 ± 0.033
4.892SerLeu: 4.892 ± 0.061
1.143SerMet: 1.143 ± 0.025
1.144SerAsn: 1.144 ± 0.031
2.666SerPro: 2.666 ± 0.044
1.419SerGln: 1.419 ± 0.033
3.547SerArg: 3.547 ± 0.041
2.292SerSer: 2.292 ± 0.043
2.295SerThr: 2.295 ± 0.044
3.689SerVal: 3.689 ± 0.049
0.702SerTrp: 0.702 ± 0.023
1.143SerTyr: 1.143 ± 0.029
0.0SerXaa: 0.0 ± 0.0
Thr
6.757ThrAla: 6.757 ± 0.077
0.509ThrCys: 0.509 ± 0.02
3.022ThrAsp: 3.022 ± 0.047
3.148ThrGlu: 3.148 ± 0.05
1.878ThrPhe: 1.878 ± 0.034
5.727ThrGly: 5.727 ± 0.071
1.067ThrHis: 1.067 ± 0.03
2.474ThrIle: 2.474 ± 0.051
1.185ThrLys: 1.185 ± 0.033
5.979ThrLeu: 5.979 ± 0.074
1.183ThrMet: 1.183 ± 0.028
1.163ThrAsn: 1.163 ± 0.031
3.511ThrPro: 3.511 ± 0.048
1.379ThrGln: 1.379 ± 0.033
3.754ThrArg: 3.754 ± 0.051
2.466ThrSer: 2.466 ± 0.046
2.813ThrThr: 2.813 ± 0.056
4.079ThrVal: 4.079 ± 0.054
0.678ThrTrp: 0.678 ± 0.023
1.123ThrTyr: 1.123 ± 0.03
0.0ThrXaa: 0.0 ± 0.0
Val
9.247ValAla: 9.247 ± 0.09
0.619ValCys: 0.619 ± 0.02
3.658ValAsp: 3.658 ± 0.044
4.377ValGlu: 4.377 ± 0.062
2.491ValPhe: 2.491 ± 0.042
5.04ValGly: 5.04 ± 0.058
1.269ValHis: 1.269 ± 0.03
4.007ValIle: 4.007 ± 0.059
2.138ValLys: 2.138 ± 0.047
7.638ValLeu: 7.638 ± 0.077
2.097ValMet: 2.097 ± 0.04
1.861ValAsn: 1.861 ± 0.033
3.771ValPro: 3.771 ± 0.048
2.254ValGln: 2.254 ± 0.038
4.463ValArg: 4.463 ± 0.054
4.034ValSer: 4.034 ± 0.062
4.969ValThr: 4.969 ± 0.061
5.452ValVal: 5.452 ± 0.079
1.032ValTrp: 1.032 ± 0.029
1.378ValTyr: 1.378 ± 0.031
0.0ValXaa: 0.0 ± 0.0
Trp
1.532TrpAla: 1.532 ± 0.035
0.137TrpCys: 0.137 ± 0.01
0.707TrpAsp: 0.707 ± 0.022
0.696TrpGlu: 0.696 ± 0.022
0.568TrpPhe: 0.568 ± 0.022
1.075TrpGly: 1.075 ± 0.027
0.379TrpHis: 0.379 ± 0.02
0.644TrpIle: 0.644 ± 0.021
0.433TrpLys: 0.433 ± 0.02
1.789TrpLeu: 1.789 ± 0.041
0.415TrpMet: 0.415 ± 0.018
0.412TrpAsn: 0.412 ± 0.018
0.734TrpPro: 0.734 ± 0.021
0.661TrpGln: 0.661 ± 0.02
1.233TrpArg: 1.233 ± 0.031
0.843TrpSer: 0.843 ± 0.024
0.776TrpThr: 0.776 ± 0.025
0.976TrpVal: 0.976 ± 0.026
0.237TrpTrp: 0.237 ± 0.014
0.292TrpTyr: 0.292 ± 0.013
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.477TyrAla: 2.477 ± 0.043
0.216TyrCys: 0.216 ± 0.013
1.452TyrAsp: 1.452 ± 0.031
1.157TyrGlu: 1.157 ± 0.029
0.761TyrPhe: 0.761 ± 0.021
2.002TyrGly: 2.002 ± 0.041
0.439TyrHis: 0.439 ± 0.019
0.793TyrIle: 0.793 ± 0.026
0.502TyrLys: 0.502 ± 0.018
2.109TyrLeu: 2.109 ± 0.041
0.432TyrMet: 0.432 ± 0.017
0.484TyrAsn: 0.484 ± 0.017
1.044TyrPro: 1.044 ± 0.024
0.641TyrGln: 0.641 ± 0.022
1.609TyrArg: 1.609 ± 0.033
1.03TyrSer: 1.03 ± 0.031
1.038TyrThr: 1.038 ± 0.027
1.342TyrVal: 1.342 ± 0.03
0.341TyrTrp: 0.341 ± 0.017
0.497TyrTyr: 0.497 ± 0.016
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5113 proteins (1552672 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski