Amino acid dipepetide frequency for Polaribacter glomeratus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.231AlaAla: 4.231 ± 0.073
0.494AlaCys: 0.494 ± 0.02
3.359AlaAsp: 3.359 ± 0.062
3.594AlaGlu: 3.594 ± 0.069
3.404AlaPhe: 3.404 ± 0.059
3.874AlaGly: 3.874 ± 0.069
0.97AlaHis: 0.97 ± 0.031
5.851AlaIle: 5.851 ± 0.087
4.857AlaLys: 4.857 ± 0.072
5.575AlaLeu: 5.575 ± 0.075
1.299AlaMet: 1.299 ± 0.038
3.715AlaAsn: 3.715 ± 0.068
1.722AlaPro: 1.722 ± 0.039
2.168AlaGln: 2.168 ± 0.045
1.8AlaArg: 1.8 ± 0.039
4.218AlaSer: 4.218 ± 0.074
3.939AlaThr: 3.939 ± 0.071
3.891AlaVal: 3.891 ± 0.065
0.538AlaTrp: 0.538 ± 0.02
2.188AlaTyr: 2.188 ± 0.044
0.0AlaXaa: 0.0 ± 0.0
Cys
0.416CysAla: 0.416 ± 0.02
0.091CysCys: 0.091 ± 0.009
0.407CysAsp: 0.407 ± 0.022
0.411CysGlu: 0.411 ± 0.019
0.432CysPhe: 0.432 ± 0.02
0.534CysGly: 0.534 ± 0.023
0.164CysHis: 0.164 ± 0.014
0.545CysIle: 0.545 ± 0.021
0.513CysLys: 0.513 ± 0.021
0.573CysLeu: 0.573 ± 0.025
0.127CysMet: 0.127 ± 0.01
0.444CysAsn: 0.444 ± 0.022
0.266CysPro: 0.266 ± 0.018
0.189CysGln: 0.189 ± 0.013
0.169CysArg: 0.169 ± 0.012
0.475CysSer: 0.475 ± 0.027
0.405CysThr: 0.405 ± 0.019
0.405CysVal: 0.405 ± 0.024
0.062CysTrp: 0.062 ± 0.007
0.281CysTyr: 0.281 ± 0.018
0.0CysXaa: 0.0 ± 0.0
Asp
3.787AspAla: 3.787 ± 0.071
0.386AspCys: 0.386 ± 0.02
2.694AspAsp: 2.694 ± 0.059
3.697AspGlu: 3.697 ± 0.059
3.85AspPhe: 3.85 ± 0.051
3.486AspGly: 3.486 ± 0.108
0.68AspHis: 0.68 ± 0.024
4.452AspIle: 4.452 ± 0.073
4.517AspLys: 4.517 ± 0.072
5.074AspLeu: 5.074 ± 0.078
0.904AspMet: 0.904 ± 0.032
2.983AspAsn: 2.983 ± 0.059
1.327AspPro: 1.327 ± 0.035
1.291AspGln: 1.291 ± 0.032
1.593AspArg: 1.593 ± 0.037
3.041AspSer: 3.041 ± 0.057
2.834AspThr: 2.834 ± 0.064
3.729AspVal: 3.729 ± 0.066
0.718AspTrp: 0.718 ± 0.027
2.428AspTyr: 2.428 ± 0.051
0.0AspXaa: 0.0 ± 0.0
Glu
3.953GluAla: 3.953 ± 0.066
0.301GluCys: 0.301 ± 0.017
3.331GluAsp: 3.331 ± 0.062
4.427GluGlu: 4.427 ± 0.075
3.076GluPhe: 3.076 ± 0.055
3.254GluGly: 3.254 ± 0.06
1.008GluHis: 1.008 ± 0.03
6.217GluIle: 6.217 ± 0.085
6.38GluLys: 6.38 ± 0.095
6.038GluLeu: 6.038 ± 0.094
1.522GluMet: 1.522 ± 0.034
5.475GluAsn: 5.475 ± 0.077
1.312GluPro: 1.312 ± 0.032
1.987GluGln: 1.987 ± 0.046
2.208GluArg: 2.208 ± 0.048
3.271GluSer: 3.271 ± 0.046
3.895GluThr: 3.895 ± 0.066
4.103GluVal: 4.103 ± 0.078
0.58GluTrp: 0.58 ± 0.025
2.371GluTyr: 2.371 ± 0.052
0.0GluXaa: 0.0 ± 0.0
Phe
3.029PheAla: 3.029 ± 0.053
0.426PheCys: 0.426 ± 0.019
3.355PheAsp: 3.355 ± 0.054
3.367PheGlu: 3.367 ± 0.053
3.017PhePhe: 3.017 ± 0.065
3.721PheGly: 3.721 ± 0.058
0.798PheHis: 0.798 ± 0.027
4.597PheIle: 4.597 ± 0.086
4.46PheLys: 4.46 ± 0.075
5.382PheLeu: 5.382 ± 0.1
1.115PheMet: 1.115 ± 0.035
3.717PheAsn: 3.717 ± 0.063
1.573PhePro: 1.573 ± 0.039
1.491PheGln: 1.491 ± 0.037
1.564PheArg: 1.564 ± 0.043
4.565PheSer: 4.565 ± 0.082
3.578PheThr: 3.578 ± 0.065
3.113PheVal: 3.113 ± 0.063
0.615PheTrp: 0.615 ± 0.025
2.398PheTyr: 2.398 ± 0.054
0.0PheXaa: 0.0 ± 0.0
Gly
3.937GlyAla: 3.937 ± 0.077
0.484GlyCys: 0.484 ± 0.027
3.046GlyAsp: 3.046 ± 0.072
3.256GlyGlu: 3.256 ± 0.059
3.93GlyPhe: 3.93 ± 0.061
4.211GlyGly: 4.211 ± 0.082
0.928GlyHis: 0.928 ± 0.034
5.311GlyIle: 5.311 ± 0.078
5.004GlyLys: 5.004 ± 0.087
5.364GlyLeu: 5.364 ± 0.074
1.379GlyMet: 1.379 ± 0.04
3.807GlyAsn: 3.807 ± 0.066
1.069GlyPro: 1.069 ± 0.033
1.624GlyGln: 1.624 ± 0.035
1.835GlyArg: 1.835 ± 0.045
3.877GlySer: 3.877 ± 0.065
3.923GlyThr: 3.923 ± 0.085
4.177GlyVal: 4.177 ± 0.069
0.654GlyTrp: 0.654 ± 0.024
2.513GlyTyr: 2.513 ± 0.053
0.0GlyXaa: 0.0 ± 0.0
His
0.814HisAla: 0.814 ± 0.031
0.146HisCys: 0.146 ± 0.01
0.703HisAsp: 0.703 ± 0.026
0.81HisGlu: 0.81 ± 0.03
1.098HisPhe: 1.098 ± 0.034
0.909HisGly: 0.909 ± 0.033
0.431HisHis: 0.431 ± 0.021
1.351HisIle: 1.351 ± 0.035
1.308HisLys: 1.308 ± 0.038
1.643HisLeu: 1.643 ± 0.036
0.263HisMet: 0.263 ± 0.015
0.853HisAsn: 0.853 ± 0.026
0.797HisPro: 0.797 ± 0.03
0.786HisGln: 0.786 ± 0.028
0.589HisArg: 0.589 ± 0.027
0.946HisSer: 0.946 ± 0.03
0.91HisThr: 0.91 ± 0.028
0.788HisVal: 0.788 ± 0.026
0.169HisTrp: 0.169 ± 0.012
0.707HisTyr: 0.707 ± 0.026
0.0HisXaa: 0.0 ± 0.0
Ile
5.907IleAla: 5.907 ± 0.077
0.662IleCys: 0.662 ± 0.027
5.096IleAsp: 5.096 ± 0.078
5.831IleGlu: 5.831 ± 0.084
4.099IlePhe: 4.099 ± 0.067
5.11IleGly: 5.11 ± 0.081
1.337IleHis: 1.337 ± 0.041
7.065IleIle: 7.065 ± 0.113
6.954IleLys: 6.954 ± 0.087
7.733IleLeu: 7.733 ± 0.104
1.352IleMet: 1.352 ± 0.037
5.553IleAsn: 5.553 ± 0.083
3.131IlePro: 3.131 ± 0.054
2.568IleGln: 2.568 ± 0.046
2.546IleArg: 2.546 ± 0.052
6.41IleSer: 6.41 ± 0.083
5.232IleThr: 5.232 ± 0.077
5.023IleVal: 5.023 ± 0.07
0.753IleTrp: 0.753 ± 0.026
3.041IleTyr: 3.041 ± 0.064
0.0IleXaa: 0.0 ± 0.0
Lys
4.904LysAla: 4.904 ± 0.08
0.348LysCys: 0.348 ± 0.018
4.585LysAsp: 4.585 ± 0.079
7.267LysGlu: 7.267 ± 0.107
3.271LysPhe: 3.271 ± 0.051
4.67LysGly: 4.67 ± 0.07
1.388LysHis: 1.388 ± 0.032
7.571LysIle: 7.571 ± 0.088
8.278LysLys: 8.278 ± 0.112
6.979LysLeu: 6.979 ± 0.097
2.187LysMet: 2.187 ± 0.05
6.703LysAsn: 6.703 ± 0.082
2.444LysPro: 2.444 ± 0.051
2.787LysGln: 2.787 ± 0.053
2.828LysArg: 2.828 ± 0.052
5.091LysSer: 5.091 ± 0.077
5.334LysThr: 5.334 ± 0.079
5.062LysVal: 5.062 ± 0.077
0.829LysTrp: 0.829 ± 0.027
3.354LysTyr: 3.354 ± 0.057
0.0LysXaa: 0.0 ± 0.0
Leu
5.442LeuAla: 5.442 ± 0.079
0.6LeuCys: 0.6 ± 0.024
4.918LeuAsp: 4.918 ± 0.073
6.491LeuGlu: 6.491 ± 0.078
5.315LeuPhe: 5.315 ± 0.098
5.604LeuGly: 5.604 ± 0.075
1.487LeuHis: 1.487 ± 0.036
7.416LeuIle: 7.416 ± 0.118
8.535LeuLys: 8.535 ± 0.106
8.617LeuLeu: 8.617 ± 0.123
1.902LeuMet: 1.902 ± 0.048
6.103LeuAsn: 6.103 ± 0.08
3.179LeuPro: 3.179 ± 0.052
3.317LeuGln: 3.317 ± 0.056
2.825LeuArg: 2.825 ± 0.044
6.483LeuSer: 6.483 ± 0.075
5.11LeuThr: 5.11 ± 0.074
5.305LeuVal: 5.305 ± 0.079
0.746LeuTrp: 0.746 ± 0.028
3.017LeuTyr: 3.017 ± 0.057
0.0LeuXaa: 0.0 ± 0.0
Met
1.375MetAla: 1.375 ± 0.041
0.124MetCys: 0.124 ± 0.011
1.034MetAsp: 1.034 ± 0.036
1.122MetGlu: 1.122 ± 0.033
0.907MetPhe: 0.907 ± 0.03
1.158MetGly: 1.158 ± 0.033
0.379MetHis: 0.379 ± 0.015
1.555MetIle: 1.555 ± 0.042
2.164MetLys: 2.164 ± 0.044
1.83MetLeu: 1.83 ± 0.042
0.556MetMet: 0.556 ± 0.023
1.371MetAsn: 1.371 ± 0.035
0.747MetPro: 0.747 ± 0.022
0.755MetGln: 0.755 ± 0.031
0.745MetArg: 0.745 ± 0.028
1.339MetSer: 1.339 ± 0.034
0.987MetThr: 0.987 ± 0.029
1.158MetVal: 1.158 ± 0.037
0.151MetTrp: 0.151 ± 0.011
0.69MetTyr: 0.69 ± 0.028
0.0MetXaa: 0.0 ± 0.0
Asn
4.04AsnAla: 4.04 ± 0.076
0.518AsnCys: 0.518 ± 0.029
3.271AsnAsp: 3.271 ± 0.069
3.902AsnGlu: 3.902 ± 0.055
3.647AsnPhe: 3.647 ± 0.065
4.248AsnGly: 4.248 ± 0.089
1.073AsnHis: 1.073 ± 0.029
5.457AsnIle: 5.457 ± 0.074
5.193AsnLys: 5.193 ± 0.076
6.171AsnLeu: 6.171 ± 0.089
1.264AsnMet: 1.264 ± 0.035
4.56AsnAsn: 4.56 ± 0.1
2.804AsnPro: 2.804 ± 0.056
2.444AsnGln: 2.444 ± 0.051
2.052AsnArg: 2.052 ± 0.041
4.647AsnSer: 4.647 ± 0.077
4.184AsnThr: 4.184 ± 0.098
3.638AsnVal: 3.638 ± 0.061
0.876AsnTrp: 0.876 ± 0.028
3.165AsnTyr: 3.165 ± 0.056
0.0AsnXaa: 0.0 ± 0.0
Pro
1.705ProAla: 1.705 ± 0.04
0.19ProCys: 0.19 ± 0.012
1.661ProAsp: 1.661 ± 0.04
2.29ProGlu: 2.29 ± 0.047
1.941ProPhe: 1.941 ± 0.05
1.546ProGly: 1.546 ± 0.036
0.522ProHis: 0.522 ± 0.025
2.739ProIle: 2.739 ± 0.052
2.574ProLys: 2.574 ± 0.053
2.734ProLeu: 2.734 ± 0.05
0.587ProMet: 0.587 ± 0.024
2.196ProAsn: 2.196 ± 0.049
0.674ProPro: 0.674 ± 0.027
0.91ProGln: 0.91 ± 0.033
0.829ProArg: 0.829 ± 0.028
1.935ProSer: 1.935 ± 0.041
1.998ProThr: 1.998 ± 0.045
1.923ProVal: 1.923 ± 0.045
0.293ProTrp: 0.293 ± 0.016
1.169ProTyr: 1.169 ± 0.029
0.0ProXaa: 0.0 ± 0.0
Gln
1.659GlnAla: 1.659 ± 0.038
0.147GlnCys: 0.147 ± 0.01
1.516GlnAsp: 1.516 ± 0.037
2.4GlnGlu: 2.4 ± 0.046
1.743GlnPhe: 1.743 ± 0.041
1.628GlnGly: 1.628 ± 0.041
0.523GlnHis: 0.523 ± 0.021
2.833GlnIle: 2.833 ± 0.048
3.213GlnLys: 3.213 ± 0.062
3.29GlnLeu: 3.29 ± 0.054
0.713GlnMet: 0.713 ± 0.023
2.338GlnAsn: 2.338 ± 0.047
0.892GlnPro: 0.892 ± 0.03
1.446GlnGln: 1.446 ± 0.039
1.067GlnArg: 1.067 ± 0.029
1.753GlnSer: 1.753 ± 0.04
1.873GlnThr: 1.873 ± 0.046
1.829GlnVal: 1.829 ± 0.042
0.29GlnTrp: 0.29 ± 0.019
1.189GlnTyr: 1.189 ± 0.035
0.0GlnXaa: 0.0 ± 0.0
Arg
1.917ArgAla: 1.917 ± 0.042
0.171ArgCys: 0.171 ± 0.013
1.523ArgAsp: 1.523 ± 0.034
1.983ArgGlu: 1.983 ± 0.038
1.788ArgPhe: 1.788 ± 0.039
1.719ArgGly: 1.719 ± 0.044
0.507ArgHis: 0.507 ± 0.023
2.761ArgIle: 2.761 ± 0.048
2.772ArgLys: 2.772 ± 0.061
2.828ArgLeu: 2.828 ± 0.05
0.757ArgMet: 0.757 ± 0.025
2.059ArgAsn: 2.059 ± 0.05
0.884ArgPro: 0.884 ± 0.026
0.921ArgGln: 0.921 ± 0.028
1.093ArgArg: 1.093 ± 0.031
1.728ArgSer: 1.728 ± 0.035
1.803ArgThr: 1.803 ± 0.039
1.89ArgVal: 1.89 ± 0.048
0.299ArgTrp: 0.299 ± 0.015
1.297ArgTyr: 1.297 ± 0.042
0.0ArgXaa: 0.0 ± 0.0
Ser
3.775SerAla: 3.775 ± 0.068
0.589SerCys: 0.589 ± 0.024
3.525SerAsp: 3.525 ± 0.065
4.228SerGlu: 4.228 ± 0.056
4.262SerPhe: 4.262 ± 0.076
4.307SerGly: 4.307 ± 0.072
0.957SerHis: 0.957 ± 0.031
5.627SerIle: 5.627 ± 0.073
5.695SerLys: 5.695 ± 0.072
6.294SerLeu: 6.294 ± 0.083
1.193SerMet: 1.193 ± 0.037
4.203SerAsn: 4.203 ± 0.084
1.823SerPro: 1.823 ± 0.045
1.977SerGln: 1.977 ± 0.048
1.919SerArg: 1.919 ± 0.046
4.714SerSer: 4.714 ± 0.088
3.796SerThr: 3.796 ± 0.065
4.052SerVal: 4.052 ± 0.055
0.752SerTrp: 0.752 ± 0.03
2.783SerTyr: 2.783 ± 0.058
0.0SerXaa: 0.0 ± 0.0
Thr
3.914ThrAla: 3.914 ± 0.076
0.342ThrCys: 0.342 ± 0.017
3.43ThrAsp: 3.43 ± 0.069
3.447ThrGlu: 3.447 ± 0.059
3.44ThrPhe: 3.44 ± 0.067
3.786ThrGly: 3.786 ± 0.063
0.942ThrHis: 0.942 ± 0.029
5.651ThrIle: 5.651 ± 0.079
4.655ThrLys: 4.655 ± 0.072
5.435ThrLeu: 5.435 ± 0.075
0.911ThrMet: 0.911 ± 0.03
3.957ThrAsn: 3.957 ± 0.083
2.25ThrPro: 2.25 ± 0.047
1.859ThrGln: 1.859 ± 0.042
1.549ThrArg: 1.549 ± 0.035
4.235ThrSer: 4.235 ± 0.075
3.787ThrThr: 3.787 ± 0.076
3.69ThrVal: 3.69 ± 0.071
0.605ThrTrp: 0.605 ± 0.023
2.434ThrTyr: 2.434 ± 0.056
0.0ThrXaa: 0.0 ± 0.0
Val
4.09ValAla: 4.09 ± 0.068
0.478ValCys: 0.478 ± 0.023
3.455ValAsp: 3.455 ± 0.058
3.563ValGlu: 3.563 ± 0.064
3.555ValPhe: 3.555 ± 0.073
3.628ValGly: 3.628 ± 0.067
0.96ValHis: 0.96 ± 0.032
4.869ValIle: 4.869 ± 0.069
4.602ValLys: 4.602 ± 0.063
6.106ValLeu: 6.106 ± 0.077
1.124ValMet: 1.124 ± 0.034
3.658ValAsn: 3.658 ± 0.064
1.888ValPro: 1.888 ± 0.045
1.711ValGln: 1.711 ± 0.041
1.757ValArg: 1.757 ± 0.044
4.444ValSer: 4.444 ± 0.075
3.701ValThr: 3.701 ± 0.073
3.922ValVal: 3.922 ± 0.069
0.571ValTrp: 0.571 ± 0.023
2.191ValTyr: 2.191 ± 0.045
0.0ValXaa: 0.0 ± 0.0
Trp
0.536TrpAla: 0.536 ± 0.022
0.089TrpCys: 0.089 ± 0.008
0.54TrpAsp: 0.54 ± 0.023
0.6TrpGlu: 0.6 ± 0.025
0.625TrpPhe: 0.625 ± 0.027
0.638TrpGly: 0.638 ± 0.028
0.201TrpHis: 0.201 ± 0.013
0.783TrpIle: 0.783 ± 0.026
0.778TrpLys: 0.778 ± 0.027
0.901TrpLeu: 0.901 ± 0.028
0.308TrpMet: 0.308 ± 0.016
0.727TrpAsn: 0.727 ± 0.028
0.192TrpPro: 0.192 ± 0.014
0.419TrpGln: 0.419 ± 0.022
0.399TrpArg: 0.399 ± 0.02
0.7TrpSer: 0.7 ± 0.028
0.511TrpThr: 0.511 ± 0.02
0.586TrpVal: 0.586 ± 0.026
0.129TrpTrp: 0.129 ± 0.011
0.423TrpTyr: 0.423 ± 0.017
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.252TyrAla: 2.252 ± 0.052
0.322TyrCys: 0.322 ± 0.018
2.021TyrAsp: 2.021 ± 0.05
2.025TyrGlu: 2.025 ± 0.047
2.483TyrPhe: 2.483 ± 0.048
2.299TyrGly: 2.299 ± 0.046
0.767TyrHis: 0.767 ± 0.025
2.753TyrIle: 2.753 ± 0.051
3.424TyrLys: 3.424 ± 0.056
3.842TyrLeu: 3.842 ± 0.059
0.683TyrMet: 0.683 ± 0.025
2.734TyrAsn: 2.734 ± 0.053
1.44TyrPro: 1.44 ± 0.034
1.693TyrGln: 1.693 ± 0.043
1.333TyrArg: 1.333 ± 0.034
2.6TyrSer: 2.6 ± 0.05
2.519TyrThr: 2.519 ± 0.059
1.992TyrVal: 1.992 ± 0.041
0.458TyrTrp: 0.458 ± 0.021
1.746TyrTyr: 1.746 ± 0.049
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3368 proteins (1169193 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski