Amino acid dipepetide frequency for Pontibacter akesuensis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.459AlaAla: 8.459 ± 0.115
0.69AlaCys: 0.69 ± 0.024
4.32AlaAsp: 4.32 ± 0.064
5.253AlaGlu: 5.253 ± 0.072
3.79AlaPhe: 3.79 ± 0.054
6.46AlaGly: 6.46 ± 0.087
1.52AlaHis: 1.52 ± 0.033
4.967AlaIle: 4.967 ± 0.07
4.376AlaLys: 4.376 ± 0.062
8.3AlaLeu: 8.3 ± 0.094
2.075AlaMet: 2.075 ± 0.035
3.433AlaAsn: 3.433 ± 0.056
3.42AlaPro: 3.42 ± 0.068
3.728AlaGln: 3.728 ± 0.06
3.414AlaArg: 3.414 ± 0.052
4.911AlaSer: 4.911 ± 0.064
4.902AlaThr: 4.902 ± 0.089
5.915AlaVal: 5.915 ± 0.063
0.853AlaTrp: 0.853 ± 0.022
3.135AlaTyr: 3.135 ± 0.049
0.0AlaXaa: 0.0 ± 0.0
Cys
0.599CysAla: 0.599 ± 0.023
0.12CysCys: 0.12 ± 0.01
0.353CysAsp: 0.353 ± 0.017
0.387CysGlu: 0.387 ± 0.021
0.326CysPhe: 0.326 ± 0.018
0.651CysGly: 0.651 ± 0.035
0.172CysHis: 0.172 ± 0.012
0.452CysIle: 0.452 ± 0.016
0.313CysLys: 0.313 ± 0.016
0.701CysLeu: 0.701 ± 0.025
0.154CysMet: 0.154 ± 0.011
0.31CysAsn: 0.31 ± 0.015
0.323CysPro: 0.323 ± 0.019
0.243CysGln: 0.243 ± 0.014
0.314CysArg: 0.314 ± 0.017
0.506CysSer: 0.506 ± 0.022
0.446CysThr: 0.446 ± 0.024
0.446CysVal: 0.446 ± 0.02
0.063CysTrp: 0.063 ± 0.006
0.245CysTyr: 0.245 ± 0.014
0.0CysXaa: 0.0 ± 0.0
Asp
4.122AspAla: 4.122 ± 0.06
0.342AspCys: 0.342 ± 0.016
2.26AspAsp: 2.26 ± 0.048
3.451AspGlu: 3.451 ± 0.055
2.815AspPhe: 2.815 ± 0.047
3.39AspGly: 3.39 ± 0.064
0.83AspHis: 0.83 ± 0.028
3.248AspIle: 3.248 ± 0.057
3.41AspLys: 3.41 ± 0.054
4.786AspLeu: 4.786 ± 0.073
1.322AspMet: 1.322 ± 0.03
2.337AspAsn: 2.337 ± 0.051
2.031AspPro: 2.031 ± 0.045
1.703AspGln: 1.703 ± 0.034
2.314AspArg: 2.314 ± 0.049
2.741AspSer: 2.741 ± 0.046
2.676AspThr: 2.676 ± 0.045
3.665AspVal: 3.665 ± 0.058
0.69AspTrp: 0.69 ± 0.023
2.34AspTyr: 2.34 ± 0.039
0.0AspXaa: 0.0 ± 0.0
Glu
5.891GluAla: 5.891 ± 0.09
0.263GluCys: 0.263 ± 0.016
3.119GluAsp: 3.119 ± 0.049
5.319GluGlu: 5.319 ± 0.078
2.291GluPhe: 2.291 ± 0.039
4.283GluGly: 4.283 ± 0.06
1.334GluHis: 1.334 ± 0.034
3.608GluIle: 3.608 ± 0.053
4.362GluLys: 4.362 ± 0.073
6.73GluLeu: 6.73 ± 0.09
1.615GluMet: 1.615 ± 0.04
3.085GluAsn: 3.085 ± 0.052
2.173GluPro: 2.173 ± 0.047
3.633GluGln: 3.633 ± 0.059
3.382GluArg: 3.382 ± 0.064
2.996GluSer: 2.996 ± 0.044
3.137GluThr: 3.137 ± 0.05
5.129GluVal: 5.129 ± 0.065
0.737GluTrp: 0.737 ± 0.025
2.036GluTyr: 2.036 ± 0.043
0.0GluXaa: 0.0 ± 0.0
Phe
3.474PheAla: 3.474 ± 0.052
0.365PheCys: 0.365 ± 0.019
2.543PheAsp: 2.543 ± 0.044
2.731PheGlu: 2.731 ± 0.049
2.178PhePhe: 2.178 ± 0.046
3.243PheGly: 3.243 ± 0.059
0.778PheHis: 0.778 ± 0.026
2.731PheIle: 2.731 ± 0.054
2.358PheLys: 2.358 ± 0.047
4.234PheLeu: 4.234 ± 0.071
1.12PheMet: 1.12 ± 0.026
2.2PheAsn: 2.2 ± 0.046
1.642PhePro: 1.642 ± 0.032
1.412PheGln: 1.412 ± 0.034
2.095PheArg: 2.095 ± 0.045
3.296PheSer: 3.296 ± 0.05
2.89PheThr: 2.89 ± 0.055
3.031PheVal: 3.031 ± 0.051
0.623PheTrp: 0.623 ± 0.022
1.886PheTyr: 1.886 ± 0.033
0.0PheXaa: 0.0 ± 0.0
Gly
5.401GlyAla: 5.401 ± 0.077
0.645GlyCys: 0.645 ± 0.037
3.299GlyAsp: 3.299 ± 0.054
4.134GlyGlu: 4.134 ± 0.06
3.441GlyPhe: 3.441 ± 0.052
5.267GlyGly: 5.267 ± 0.099
1.365GlyHis: 1.365 ± 0.032
4.696GlyIle: 4.696 ± 0.071
4.64GlyLys: 4.64 ± 0.06
6.549GlyLeu: 6.549 ± 0.073
1.938GlyMet: 1.938 ± 0.042
3.333GlyAsn: 3.333 ± 0.058
1.92GlyPro: 1.92 ± 0.054
2.665GlyGln: 2.665 ± 0.048
2.977GlyArg: 2.977 ± 0.049
4.377GlySer: 4.377 ± 0.082
4.458GlyThr: 4.458 ± 0.097
4.867GlyVal: 4.867 ± 0.079
0.925GlyTrp: 0.925 ± 0.03
2.995GlyTyr: 2.995 ± 0.053
0.0GlyXaa: 0.0 ± 0.0
His
1.382HisAla: 1.382 ± 0.029
0.178HisCys: 0.178 ± 0.012
0.931HisAsp: 0.931 ± 0.026
1.165HisGlu: 1.165 ± 0.031
1.064HisPhe: 1.064 ± 0.03
1.322HisGly: 1.322 ± 0.032
0.567HisHis: 0.567 ± 0.021
1.219HisIle: 1.219 ± 0.029
0.985HisLys: 0.985 ± 0.028
2.194HisLeu: 2.194 ± 0.041
0.464HisMet: 0.464 ± 0.018
0.943HisAsn: 0.943 ± 0.026
1.114HisPro: 1.114 ± 0.03
0.869HisGln: 0.869 ± 0.027
0.931HisArg: 0.931 ± 0.031
1.101HisSer: 1.101 ± 0.03
1.228HisThr: 1.228 ± 0.033
1.238HisVal: 1.238 ± 0.029
0.232HisTrp: 0.232 ± 0.013
0.867HisTyr: 0.867 ± 0.029
0.0HisXaa: 0.0 ± 0.0
Ile
5.064IleAla: 5.064 ± 0.064
0.511IleCys: 0.511 ± 0.021
3.216IleAsp: 3.216 ± 0.052
3.448IleGlu: 3.448 ± 0.053
2.463IlePhe: 2.463 ± 0.049
4.455IleGly: 4.455 ± 0.067
1.075IleHis: 1.075 ± 0.024
3.648IleIle: 3.648 ± 0.068
3.475IleLys: 3.475 ± 0.06
5.696IleLeu: 5.696 ± 0.082
1.215IleMet: 1.215 ± 0.028
2.988IleAsn: 2.988 ± 0.051
2.644IlePro: 2.644 ± 0.045
1.97IleGln: 1.97 ± 0.039
2.809IleArg: 2.809 ± 0.05
4.085IleSer: 4.085 ± 0.058
3.771IleThr: 3.771 ± 0.063
3.777IleVal: 3.777 ± 0.063
0.565IleTrp: 0.565 ± 0.022
2.061IleTyr: 2.061 ± 0.045
0.0IleXaa: 0.0 ± 0.0
Lys
5.076LysAla: 5.076 ± 0.074
0.237LysCys: 0.237 ± 0.014
3.082LysAsp: 3.082 ± 0.062
4.302LysGlu: 4.302 ± 0.071
1.894LysPhe: 1.894 ± 0.041
3.921LysGly: 3.921 ± 0.058
1.145LysHis: 1.145 ± 0.032
3.221LysIle: 3.221 ± 0.059
3.865LysLys: 3.865 ± 0.073
5.766LysLeu: 5.766 ± 0.068
1.473LysMet: 1.473 ± 0.036
2.713LysAsn: 2.713 ± 0.049
2.645LysPro: 2.645 ± 0.048
2.677LysGln: 2.677 ± 0.044
2.754LysArg: 2.754 ± 0.044
3.149LysSer: 3.149 ± 0.053
3.076LysThr: 3.076 ± 0.05
4.211LysVal: 4.211 ± 0.064
0.605LysTrp: 0.605 ± 0.022
2.57LysTyr: 2.57 ± 0.046
0.0LysXaa: 0.0 ± 0.0
Leu
8.368LeuAla: 8.368 ± 0.097
0.793LeuCys: 0.793 ± 0.026
4.733LeuAsp: 4.733 ± 0.06
6.272LeuGlu: 6.272 ± 0.074
4.522LeuPhe: 4.522 ± 0.07
6.375LeuGly: 6.375 ± 0.084
2.226LeuHis: 2.226 ± 0.043
5.313LeuIle: 5.313 ± 0.078
5.862LeuLys: 5.862 ± 0.066
11.963LeuLeu: 11.963 ± 0.158
2.335LeuMet: 2.335 ± 0.044
4.67LeuAsn: 4.67 ± 0.056
4.699LeuPro: 4.699 ± 0.061
5.391LeuGln: 5.391 ± 0.08
4.755LeuArg: 4.755 ± 0.056
6.4LeuSer: 6.4 ± 0.071
5.434LeuThr: 5.434 ± 0.074
6.828LeuVal: 6.828 ± 0.086
1.001LeuTrp: 1.001 ± 0.029
3.702LeuTyr: 3.702 ± 0.065
0.0LeuXaa: 0.0 ± 0.0
Met
2.305MetAla: 2.305 ± 0.042
0.116MetCys: 0.116 ± 0.01
1.171MetAsp: 1.171 ± 0.031
1.551MetGlu: 1.551 ± 0.036
0.82MetPhe: 0.82 ± 0.027
1.658MetGly: 1.658 ± 0.037
0.541MetHis: 0.541 ± 0.021
1.287MetIle: 1.287 ± 0.033
1.74MetLys: 1.74 ± 0.04
2.567MetLeu: 2.567 ± 0.049
0.629MetMet: 0.629 ± 0.021
1.12MetAsn: 1.12 ± 0.028
1.176MetPro: 1.176 ± 0.027
1.227MetGln: 1.227 ± 0.032
1.269MetArg: 1.269 ± 0.026
1.331MetSer: 1.331 ± 0.031
1.094MetThr: 1.094 ± 0.03
1.631MetVal: 1.631 ± 0.035
0.194MetTrp: 0.194 ± 0.012
0.674MetTyr: 0.674 ± 0.019
0.0MetXaa: 0.0 ± 0.0
Asn
3.508AsnAla: 3.508 ± 0.055
0.323AsnCys: 0.323 ± 0.017
2.13AsnAsp: 2.13 ± 0.045
2.643AsnGlu: 2.643 ± 0.048
2.158AsnPhe: 2.158 ± 0.046
3.409AsnGly: 3.409 ± 0.079
0.793AsnHis: 0.793 ± 0.025
3.139AsnIle: 3.139 ± 0.051
2.635AsnLys: 2.635 ± 0.04
4.616AsnLeu: 4.616 ± 0.065
1.1AsnMet: 1.1 ± 0.027
2.381AsnAsn: 2.381 ± 0.055
2.527AsnPro: 2.527 ± 0.052
1.729AsnGln: 1.729 ± 0.042
2.329AsnArg: 2.329 ± 0.038
2.865AsnSer: 2.865 ± 0.053
2.771AsnThr: 2.771 ± 0.048
2.891AsnVal: 2.891 ± 0.056
0.672AsnTrp: 0.672 ± 0.025
2.189AsnTyr: 2.189 ± 0.043
0.0AsnXaa: 0.0 ± 0.0
Pro
3.991ProAla: 3.991 ± 0.073
0.22ProCys: 0.22 ± 0.014
2.707ProAsp: 2.707 ± 0.039
3.652ProGlu: 3.652 ± 0.057
1.86ProPhe: 1.86 ± 0.036
2.78ProGly: 2.78 ± 0.051
0.846ProHis: 0.846 ± 0.028
2.196ProIle: 2.196 ± 0.038
2.069ProLys: 2.069 ± 0.036
3.838ProLeu: 3.838 ± 0.05
0.873ProMet: 0.873 ± 0.022
1.943ProAsn: 1.943 ± 0.04
1.259ProPro: 1.259 ± 0.038
1.717ProGln: 1.717 ± 0.04
1.4ProArg: 1.4 ± 0.03
2.252ProSer: 2.252 ± 0.039
2.236ProThr: 2.236 ± 0.047
3.385ProVal: 3.385 ± 0.053
0.435ProTrp: 0.435 ± 0.018
1.707ProTyr: 1.707 ± 0.044
0.0ProXaa: 0.0 ± 0.0
Gln
3.605GlnAla: 3.605 ± 0.06
0.186GlnCys: 0.186 ± 0.013
2.087GlnAsp: 2.087 ± 0.039
3.507GlnGlu: 3.507 ± 0.06
1.502GlnPhe: 1.502 ± 0.034
2.549GlnGly: 2.549 ± 0.05
1.139GlnHis: 1.139 ± 0.032
2.036GlnIle: 2.036 ± 0.039
2.517GlnLys: 2.517 ± 0.049
4.875GlnLeu: 4.875 ± 0.075
0.946GlnMet: 0.946 ± 0.027
2.022GlnAsn: 2.022 ± 0.041
1.959GlnPro: 1.959 ± 0.039
3.296GlnGln: 3.296 ± 0.069
2.189GlnArg: 2.189 ± 0.042
1.997GlnSer: 1.997 ± 0.035
2.14GlnThr: 2.14 ± 0.042
3.485GlnVal: 3.485 ± 0.054
0.393GlnTrp: 0.393 ± 0.016
1.403GlnTyr: 1.403 ± 0.031
0.0GlnXaa: 0.0 ± 0.0
Arg
3.072ArgAla: 3.072 ± 0.049
0.259ArgCys: 0.259 ± 0.014
2.303ArgAsp: 2.303 ± 0.044
3.232ArgGlu: 3.232 ± 0.053
2.246ArgPhe: 2.246 ± 0.039
2.646ArgGly: 2.646 ± 0.046
1.117ArgHis: 1.117 ± 0.027
3.038ArgIle: 3.038 ± 0.053
2.887ArgLys: 2.887 ± 0.048
4.712ArgLeu: 4.712 ± 0.058
1.334ArgMet: 1.334 ± 0.035
2.453ArgAsn: 2.453 ± 0.043
1.637ArgPro: 1.637 ± 0.044
2.231ArgGln: 2.231 ± 0.045
2.254ArgArg: 2.254 ± 0.048
2.506ArgSer: 2.506 ± 0.043
2.372ArgThr: 2.372 ± 0.043
3.13ArgVal: 3.13 ± 0.05
0.593ArgTrp: 0.593 ± 0.024
1.972ArgTyr: 1.972 ± 0.04
0.0ArgXaa: 0.0 ± 0.0
Ser
4.489SerAla: 4.489 ± 0.067
0.569SerCys: 0.569 ± 0.023
3.01SerAsp: 3.01 ± 0.056
3.439SerGlu: 3.439 ± 0.055
3.183SerPhe: 3.183 ± 0.048
4.694SerGly: 4.694 ± 0.082
1.084SerHis: 1.084 ± 0.028
4.007SerIle: 4.007 ± 0.063
3.243SerLys: 3.243 ± 0.052
5.822SerLeu: 5.822 ± 0.065
1.531SerMet: 1.531 ± 0.034
2.768SerAsn: 2.768 ± 0.06
2.385SerPro: 2.385 ± 0.044
2.043SerGln: 2.043 ± 0.043
2.691SerArg: 2.691 ± 0.045
3.929SerSer: 3.929 ± 0.075
3.43SerThr: 3.43 ± 0.058
3.951SerVal: 3.951 ± 0.06
0.688SerTrp: 0.688 ± 0.02
2.512SerTyr: 2.512 ± 0.046
0.0SerXaa: 0.0 ± 0.0
Thr
5.123ThrAla: 5.123 ± 0.096
0.372ThrCys: 0.372 ± 0.026
3.083ThrAsp: 3.083 ± 0.049
3.254ThrGlu: 3.254 ± 0.046
2.651ThrPhe: 2.651 ± 0.05
4.541ThrGly: 4.541 ± 0.092
1.079ThrHis: 1.079 ± 0.03
3.472ThrIle: 3.472 ± 0.053
2.688ThrLys: 2.688 ± 0.042
5.632ThrLeu: 5.632 ± 0.067
1.11ThrMet: 1.11 ± 0.026
2.329ThrAsn: 2.329 ± 0.05
2.857ThrPro: 2.857 ± 0.048
1.98ThrGln: 1.98 ± 0.041
2.203ThrArg: 2.203 ± 0.044
3.529ThrSer: 3.529 ± 0.067
3.48ThrThr: 3.48 ± 0.092
4.238ThrVal: 4.238 ± 0.082
0.65ThrTrp: 0.65 ± 0.026
2.469ThrTyr: 2.469 ± 0.054
0.0ThrXaa: 0.0 ± 0.0
Val
6.267ValAla: 6.267 ± 0.076
0.56ValCys: 0.56 ± 0.022
3.42ValAsp: 3.42 ± 0.039
4.502ValGlu: 4.502 ± 0.063
3.008ValPhe: 3.008 ± 0.05
4.608ValGly: 4.608 ± 0.065
1.301ValHis: 1.301 ± 0.033
3.937ValIle: 3.937 ± 0.061
3.972ValLys: 3.972 ± 0.061
7.388ValLeu: 7.388 ± 0.079
1.722ValMet: 1.722 ± 0.04
3.123ValAsn: 3.123 ± 0.048
3.032ValPro: 3.032 ± 0.047
3.155ValGln: 3.155 ± 0.051
3.213ValArg: 3.213 ± 0.052
4.299ValSer: 4.299 ± 0.061
3.944ValThr: 3.944 ± 0.079
5.442ValVal: 5.442 ± 0.072
0.814ValTrp: 0.814 ± 0.028
2.67ValTyr: 2.67 ± 0.05
0.0ValXaa: 0.0 ± 0.0
Trp
0.858TrpAla: 0.858 ± 0.024
0.092TrpCys: 0.092 ± 0.007
0.567TrpAsp: 0.567 ± 0.021
0.648TrpGlu: 0.648 ± 0.023
0.531TrpPhe: 0.531 ± 0.02
0.78TrpGly: 0.78 ± 0.025
0.258TrpHis: 0.258 ± 0.014
0.572TrpIle: 0.572 ± 0.023
0.619TrpLys: 0.619 ± 0.021
1.339TrpLeu: 1.339 ± 0.039
0.32TrpMet: 0.32 ± 0.016
0.57TrpAsn: 0.57 ± 0.02
0.358TrpPro: 0.358 ± 0.016
0.611TrpGln: 0.611 ± 0.021
0.609TrpArg: 0.609 ± 0.02
0.664TrpSer: 0.664 ± 0.023
0.559TrpThr: 0.559 ± 0.024
0.807TrpVal: 0.807 ± 0.028
0.177TrpTrp: 0.177 ± 0.012
0.44TrpTyr: 0.44 ± 0.019
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.865TyrAla: 2.865 ± 0.05
0.283TyrCys: 0.283 ± 0.015
2.14TyrAsp: 2.14 ± 0.045
2.233TyrGlu: 2.233 ± 0.043
2.088TyrPhe: 2.088 ± 0.047
2.858TyrGly: 2.858 ± 0.057
0.825TyrHis: 0.825 ± 0.028
2.149TyrIle: 2.149 ± 0.038
2.367TyrLys: 2.367 ± 0.044
3.937TyrLeu: 3.937 ± 0.06
0.825TyrMet: 0.825 ± 0.024
2.089TyrAsn: 2.089 ± 0.042
1.565TyrPro: 1.565 ± 0.036
1.554TyrGln: 1.554 ± 0.032
2.094TyrArg: 2.094 ± 0.043
2.531TyrSer: 2.531 ± 0.044
2.695TyrThr: 2.695 ± 0.056
2.309TyrVal: 2.309 ± 0.041
0.467TyrTrp: 0.467 ± 0.02
1.732TyrTyr: 1.732 ± 0.048
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4073 proteins (1372960 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski