Amino acid dipepetide frequency for Aquabacter spiritensis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
22.734AlaAla: 22.734 ± 0.226
1.255AlaCys: 1.255 ± 0.034
7.557AlaAsp: 7.557 ± 0.091
7.838AlaGlu: 7.838 ± 0.091
5.129AlaPhe: 5.129 ± 0.06
13.224AlaGly: 13.224 ± 0.158
2.545AlaHis: 2.545 ± 0.05
6.332AlaIle: 6.332 ± 0.072
3.569AlaLys: 3.569 ± 0.071
15.594AlaLeu: 15.594 ± 0.137
3.454AlaMet: 3.454 ± 0.059
2.674AlaAsn: 2.674 ± 0.052
7.724AlaPro: 7.724 ± 0.11
4.357AlaGln: 4.357 ± 0.072
10.76AlaArg: 10.76 ± 0.115
6.31AlaSer: 6.31 ± 0.089
6.53AlaThr: 6.53 ± 0.078
10.261AlaVal: 10.261 ± 0.09
1.493AlaTrp: 1.493 ± 0.035
2.766AlaTyr: 2.766 ± 0.048
0.0AlaXaa: 0.0 ± 0.0
Cys
1.193CysAla: 1.193 ± 0.031
0.089CysCys: 0.089 ± 0.008
0.503CysAsp: 0.503 ± 0.018
0.388CysGlu: 0.388 ± 0.015
0.301CysPhe: 0.301 ± 0.012
0.951CysGly: 0.951 ± 0.028
0.213CysHis: 0.213 ± 0.012
0.359CysIle: 0.359 ± 0.016
0.132CysLys: 0.132 ± 0.009
0.868CysLeu: 0.868 ± 0.023
0.128CysMet: 0.128 ± 0.01
0.164CysAsn: 0.164 ± 0.01
0.477CysPro: 0.477 ± 0.017
0.189CysGln: 0.189 ± 0.011
0.624CysArg: 0.624 ± 0.022
0.364CysSer: 0.364 ± 0.014
0.423CysThr: 0.423 ± 0.017
0.646CysVal: 0.646 ± 0.02
0.104CysTrp: 0.104 ± 0.008
0.183CysTyr: 0.183 ± 0.012
0.0CysXaa: 0.0 ± 0.0
Asp
7.597AspAla: 7.597 ± 0.085
0.431AspCys: 0.431 ± 0.018
2.515AspAsp: 2.515 ± 0.043
2.675AspGlu: 2.675 ± 0.049
2.11AspPhe: 2.11 ± 0.039
5.294AspGly: 5.294 ± 0.075
1.133AspHis: 1.133 ± 0.033
2.64AspIle: 2.64 ± 0.043
1.293AspLys: 1.293 ± 0.031
6.539AspLeu: 6.539 ± 0.076
1.214AspMet: 1.214 ± 0.03
0.956AspAsn: 0.956 ± 0.029
3.851AspPro: 3.851 ± 0.055
1.427AspGln: 1.427 ± 0.03
4.21AspArg: 4.21 ± 0.065
1.822AspSer: 1.822 ± 0.036
2.53AspThr: 2.53 ± 0.041
4.211AspVal: 4.211 ± 0.052
0.852AspTrp: 0.852 ± 0.027
1.376AspTyr: 1.376 ± 0.029
0.0AspXaa: 0.0 ± 0.0
Glu
7.833GluAla: 7.833 ± 0.087
0.297GluCys: 0.297 ± 0.018
2.618GluAsp: 2.618 ± 0.05
2.693GluGlu: 2.693 ± 0.06
1.369GluPhe: 1.369 ± 0.031
4.134GluGly: 4.134 ± 0.058
0.953GluHis: 0.953 ± 0.025
3.111GluIle: 3.111 ± 0.047
1.756GluLys: 1.756 ± 0.038
4.237GluLeu: 4.237 ± 0.065
1.425GluMet: 1.425 ± 0.029
1.16GluAsn: 1.16 ± 0.028
2.518GluPro: 2.518 ± 0.049
1.51GluGln: 1.51 ± 0.036
4.579GluArg: 4.579 ± 0.075
2.002GluSer: 2.002 ± 0.037
3.376GluThr: 3.376 ± 0.059
3.859GluVal: 3.859 ± 0.059
0.545GluTrp: 0.545 ± 0.021
0.711GluTyr: 0.711 ± 0.025
0.0GluXaa: 0.0 ± 0.0
Phe
5.183PheAla: 5.183 ± 0.06
0.387PheCys: 0.387 ± 0.015
2.502PheAsp: 2.502 ± 0.043
1.952PheGlu: 1.952 ± 0.036
1.358PhePhe: 1.358 ± 0.032
3.86PheGly: 3.86 ± 0.051
0.683PheHis: 0.683 ± 0.022
1.466PheIle: 1.466 ± 0.028
0.87PheLys: 0.87 ± 0.024
3.53PheLeu: 3.53 ± 0.059
0.668PheMet: 0.668 ± 0.02
0.915PheAsn: 0.915 ± 0.022
1.725PhePro: 1.725 ± 0.037
0.941PheGln: 0.941 ± 0.026
2.2PheArg: 2.2 ± 0.037
2.128PheSer: 2.128 ± 0.045
2.012PheThr: 2.012 ± 0.039
2.976PheVal: 2.976 ± 0.048
0.482PheTrp: 0.482 ± 0.017
0.847PheTyr: 0.847 ± 0.029
0.0PheXaa: 0.0 ± 0.0
Gly
12.266GlyAla: 12.266 ± 0.154
0.842GlyCys: 0.842 ± 0.025
4.348GlyAsp: 4.348 ± 0.069
4.451GlyGlu: 4.451 ± 0.055
3.776GlyPhe: 3.776 ± 0.055
9.692GlyGly: 9.692 ± 0.469
1.879GlyHis: 1.879 ± 0.036
4.676GlyIle: 4.676 ± 0.065
2.528GlyLys: 2.528 ± 0.047
9.982GlyLeu: 9.982 ± 0.1
2.288GlyMet: 2.288 ± 0.045
2.002GlyAsn: 2.002 ± 0.075
4.432GlyPro: 4.432 ± 0.059
2.55GlyGln: 2.55 ± 0.049
6.771GlyArg: 6.771 ± 0.076
4.779GlySer: 4.779 ± 0.15
5.96GlyThr: 5.96 ± 0.231
6.451GlyVal: 6.451 ± 0.086
1.366GlyTrp: 1.366 ± 0.033
2.308GlyTyr: 2.308 ± 0.046
0.0GlyXaa: 0.0 ± 0.0
His
2.49HisAla: 2.49 ± 0.047
0.198HisCys: 0.198 ± 0.011
1.088HisAsp: 1.088 ± 0.027
0.807HisGlu: 0.807 ± 0.026
0.77HisPhe: 0.77 ± 0.029
1.862HisGly: 1.862 ± 0.041
0.51HisHis: 0.51 ± 0.022
0.813HisIle: 0.813 ± 0.023
0.36HisLys: 0.36 ± 0.017
2.201HisLeu: 2.201 ± 0.039
0.449HisMet: 0.449 ± 0.016
0.363HisAsn: 0.363 ± 0.016
1.367HisPro: 1.367 ± 0.031
0.473HisGln: 0.473 ± 0.018
1.357HisArg: 1.357 ± 0.033
0.75HisSer: 0.75 ± 0.022
0.741HisThr: 0.741 ± 0.027
1.605HisVal: 1.605 ± 0.034
0.302HisTrp: 0.302 ± 0.013
0.477HisTyr: 0.477 ± 0.021
0.0HisXaa: 0.0 ± 0.0
Ile
7.508IleAla: 7.508 ± 0.079
0.443IleCys: 0.443 ± 0.019
2.976IleAsp: 2.976 ± 0.044
2.943IleGlu: 2.943 ± 0.051
1.452IlePhe: 1.452 ± 0.035
5.022IleGly: 5.022 ± 0.077
0.796IleHis: 0.796 ± 0.024
1.684IleIle: 1.684 ± 0.041
1.14IleLys: 1.14 ± 0.031
4.597IleLeu: 4.597 ± 0.051
0.776IleMet: 0.776 ± 0.024
1.117IleAsn: 1.117 ± 0.03
2.286IlePro: 2.286 ± 0.04
1.092IleGln: 1.092 ± 0.029
2.779IleArg: 2.779 ± 0.041
2.491IleSer: 2.491 ± 0.059
2.393IleThr: 2.393 ± 0.045
4.121IleVal: 4.121 ± 0.061
0.505IleTrp: 0.505 ± 0.018
1.06IleTyr: 1.06 ± 0.025
0.0IleXaa: 0.0 ± 0.0
Lys
3.709LysAla: 3.709 ± 0.066
0.124LysCys: 0.124 ± 0.009
1.463LysAsp: 1.463 ± 0.033
1.229LysGlu: 1.229 ± 0.036
0.698LysPhe: 0.698 ± 0.022
2.264LysGly: 2.264 ± 0.045
0.417LysHis: 0.417 ± 0.019
1.339LysIle: 1.339 ± 0.037
0.897LysLys: 0.897 ± 0.03
2.524LysLeu: 2.524 ± 0.048
0.606LysMet: 0.606 ± 0.021
0.567LysAsn: 0.567 ± 0.021
1.718LysPro: 1.718 ± 0.047
0.709LysGln: 0.709 ± 0.024
1.836LysArg: 1.836 ± 0.038
1.393LysSer: 1.393 ± 0.038
1.644LysThr: 1.644 ± 0.034
2.345LysVal: 2.345 ± 0.048
0.311LysTrp: 0.311 ± 0.015
0.495LysTyr: 0.495 ± 0.021
0.0LysXaa: 0.0 ± 0.0
Leu
16.151LeuAla: 16.151 ± 0.16
0.938LeuCys: 0.938 ± 0.026
6.222LeuAsp: 6.222 ± 0.073
4.837LeuGlu: 4.837 ± 0.061
3.768LeuPhe: 3.768 ± 0.056
9.088LeuGly: 9.088 ± 0.089
1.733LeuHis: 1.733 ± 0.037
4.63LeuIle: 4.63 ± 0.053
3.129LeuLys: 3.129 ± 0.059
10.08LeuLeu: 10.08 ± 0.126
2.339LeuMet: 2.339 ± 0.037
2.158LeuAsn: 2.158 ± 0.038
5.996LeuPro: 5.996 ± 0.085
2.412LeuGln: 2.412 ± 0.048
7.027LeuArg: 7.027 ± 0.088
6.391LeuSer: 6.391 ± 0.084
5.655LeuThr: 5.655 ± 0.104
8.486LeuVal: 8.486 ± 0.085
1.126LeuTrp: 1.126 ± 0.027
1.95LeuTyr: 1.95 ± 0.036
0.0LeuXaa: 0.0 ± 0.0
Met
3.086MetAla: 3.086 ± 0.047
0.142MetCys: 0.142 ± 0.009
1.04MetAsp: 1.04 ± 0.026
0.974MetGlu: 0.974 ± 0.027
0.66MetPhe: 0.66 ± 0.02
1.754MetGly: 1.754 ± 0.036
0.39MetHis: 0.39 ± 0.015
1.146MetIle: 1.146 ± 0.03
0.754MetLys: 0.754 ± 0.03
2.235MetLeu: 2.235 ± 0.04
0.604MetMet: 0.604 ± 0.023
0.584MetAsn: 0.584 ± 0.021
1.485MetPro: 1.485 ± 0.032
0.64MetGln: 0.64 ± 0.019
1.836MetArg: 1.836 ± 0.037
1.574MetSer: 1.574 ± 0.03
1.654MetThr: 1.654 ± 0.036
1.706MetVal: 1.706 ± 0.036
0.195MetTrp: 0.195 ± 0.011
0.284MetTyr: 0.284 ± 0.014
0.0MetXaa: 0.0 ± 0.0
Asn
2.933AsnAla: 2.933 ± 0.047
0.198AsnCys: 0.198 ± 0.012
1.088AsnAsp: 1.088 ± 0.032
0.867AsnGlu: 0.867 ± 0.023
0.811AsnPhe: 0.811 ± 0.029
2.18AsnGly: 2.18 ± 0.077
0.359AsnHis: 0.359 ± 0.017
1.064AsnIle: 1.064 ± 0.027
0.501AsnLys: 0.501 ± 0.021
2.243AsnLeu: 2.243 ± 0.046
0.488AsnMet: 0.488 ± 0.019
0.539AsnAsn: 0.539 ± 0.022
1.614AsnPro: 1.614 ± 0.035
0.607AsnGln: 0.607 ± 0.024
1.473AsnArg: 1.473 ± 0.032
1.092AsnSer: 1.092 ± 0.042
1.111AsnThr: 1.111 ± 0.036
1.808AsnVal: 1.808 ± 0.041
0.353AsnTrp: 0.353 ± 0.016
0.586AsnTyr: 0.586 ± 0.025
0.0AsnXaa: 0.0 ± 0.0
Pro
8.187ProAla: 8.187 ± 0.119
0.349ProCys: 0.349 ± 0.016
4.083ProAsp: 4.083 ± 0.056
3.545ProGlu: 3.545 ± 0.061
2.195ProPhe: 2.195 ± 0.045
5.352ProGly: 5.352 ± 0.069
1.168ProHis: 1.168 ± 0.029
2.263ProIle: 2.263 ± 0.042
1.478ProLys: 1.478 ± 0.038
5.287ProLeu: 5.287 ± 0.081
1.154ProMet: 1.154 ± 0.026
1.317ProAsn: 1.317 ± 0.03
3.605ProPro: 3.605 ± 0.086
1.578ProGln: 1.578 ± 0.037
3.622ProArg: 3.622 ± 0.069
2.784ProSer: 2.784 ± 0.05
2.645ProThr: 2.645 ± 0.051
4.563ProVal: 4.563 ± 0.064
0.714ProTrp: 0.714 ± 0.023
1.24ProTyr: 1.24 ± 0.033
0.0ProXaa: 0.0 ± 0.0
Gln
3.901GlnAla: 3.901 ± 0.055
0.184GlnCys: 0.184 ± 0.011
1.357GlnAsp: 1.357 ± 0.034
1.23GlnGlu: 1.23 ± 0.033
0.885GlnPhe: 0.885 ± 0.024
2.175GlnGly: 2.175 ± 0.041
0.501GlnHis: 0.501 ± 0.02
1.707GlnIle: 1.707 ± 0.037
0.85GlnLys: 0.85 ± 0.025
2.304GlnLeu: 2.304 ± 0.039
0.809GlnMet: 0.809 ± 0.023
0.693GlnAsn: 0.693 ± 0.02
1.531GlnPro: 1.531 ± 0.031
0.91GlnGln: 0.91 ± 0.028
2.078GlnArg: 2.078 ± 0.047
1.686GlnSer: 1.686 ± 0.06
1.51GlnThr: 1.51 ± 0.034
2.242GlnVal: 2.242 ± 0.044
0.333GlnTrp: 0.333 ± 0.018
0.507GlnTyr: 0.507 ± 0.019
0.0GlnXaa: 0.0 ± 0.0
Arg
9.595ArgAla: 9.595 ± 0.106
0.528ArgCys: 0.528 ± 0.021
4.079ArgAsp: 4.079 ± 0.063
3.673ArgGlu: 3.673 ± 0.066
2.932ArgPhe: 2.932 ± 0.051
5.217ArgGly: 5.217 ± 0.065
1.728ArgHis: 1.728 ± 0.036
4.074ArgIle: 4.074 ± 0.063
1.705ArgLys: 1.705 ± 0.041
8.599ArgLeu: 8.599 ± 0.099
1.78ArgMet: 1.78 ± 0.036
1.534ArgAsn: 1.534 ± 0.031
4.145ArgPro: 4.145 ± 0.068
2.134ArgGln: 2.134 ± 0.039
6.34ArgArg: 6.34 ± 0.098
3.375ArgSer: 3.375 ± 0.048
3.836ArgThr: 3.836 ± 0.048
5.037ArgVal: 5.037 ± 0.072
0.833ArgTrp: 0.833 ± 0.024
1.6ArgTyr: 1.6 ± 0.035
0.0ArgXaa: 0.0 ± 0.0
Ser
6.576SerAla: 6.576 ± 0.094
0.392SerCys: 0.392 ± 0.016
2.707SerAsp: 2.707 ± 0.049
2.297SerGlu: 2.297 ± 0.044
2.037SerPhe: 2.037 ± 0.038
6.284SerGly: 6.284 ± 0.189
0.931SerHis: 0.931 ± 0.025
2.295SerIle: 2.295 ± 0.058
1.195SerLys: 1.195 ± 0.03
5.159SerLeu: 5.159 ± 0.073
1.057SerMet: 1.057 ± 0.027
1.15SerAsn: 1.15 ± 0.036
2.852SerPro: 2.852 ± 0.051
1.383SerGln: 1.383 ± 0.032
3.238SerArg: 3.238 ± 0.052
2.59SerSer: 2.59 ± 0.077
2.58SerThr: 2.58 ± 0.064
3.917SerVal: 3.917 ± 0.074
0.649SerTrp: 0.649 ± 0.023
1.209SerTyr: 1.209 ± 0.032
0.0SerXaa: 0.0 ± 0.0
Thr
6.588ThrAla: 6.588 ± 0.085
0.456ThrCys: 0.456 ± 0.021
2.567ThrAsp: 2.567 ± 0.05
2.393ThrGlu: 2.393 ± 0.049
2.177ThrPhe: 2.177 ± 0.045
5.714ThrGly: 5.714 ± 0.132
0.944ThrHis: 0.944 ± 0.025
2.666ThrIle: 2.666 ± 0.063
1.217ThrLys: 1.217 ± 0.031
6.591ThrLeu: 6.591 ± 0.149
1.079ThrMet: 1.079 ± 0.024
1.241ThrAsn: 1.241 ± 0.042
3.506ThrPro: 3.506 ± 0.056
1.381ThrGln: 1.381 ± 0.034
3.442ThrArg: 3.442 ± 0.057
2.769ThrSer: 2.769 ± 0.06
2.835ThrThr: 2.835 ± 0.082
4.574ThrVal: 4.574 ± 0.075
0.621ThrTrp: 0.621 ± 0.023
1.338ThrTyr: 1.338 ± 0.037
0.0ThrXaa: 0.0 ± 0.0
Val
10.642ValAla: 10.642 ± 0.094
0.717ValCys: 0.717 ± 0.021
4.044ValAsp: 4.044 ± 0.054
4.351ValGlu: 4.351 ± 0.059
2.939ValPhe: 2.939 ± 0.054
6.111ValGly: 6.111 ± 0.072
1.414ValHis: 1.414 ± 0.03
3.602ValIle: 3.602 ± 0.054
2.095ValLys: 2.095 ± 0.044
7.964ValLeu: 7.964 ± 0.086
1.728ValMet: 1.728 ± 0.035
1.888ValAsn: 1.888 ± 0.038
4.535ValPro: 4.535 ± 0.06
1.987ValGln: 1.987 ± 0.041
5.833ValArg: 5.833 ± 0.077
4.315ValSer: 4.315 ± 0.073
4.835ValThr: 4.835 ± 0.071
6.489ValVal: 6.489 ± 0.086
0.888ValTrp: 0.888 ± 0.029
1.407ValTyr: 1.407 ± 0.036
0.0ValXaa: 0.0 ± 0.0
Trp
1.22TrpAla: 1.22 ± 0.03
0.116TrpCys: 0.116 ± 0.009
0.578TrpAsp: 0.578 ± 0.021
0.47TrpGlu: 0.47 ± 0.02
0.485TrpPhe: 0.485 ± 0.018
0.951TrpGly: 0.951 ± 0.023
0.32TrpHis: 0.32 ± 0.013
0.61TrpIle: 0.61 ± 0.019
0.322TrpLys: 0.322 ± 0.016
1.393TrpLeu: 1.393 ± 0.033
0.302TrpMet: 0.302 ± 0.015
0.372TrpAsn: 0.372 ± 0.017
0.674TrpPro: 0.674 ± 0.023
0.462TrpGln: 0.462 ± 0.02
1.105TrpArg: 1.105 ± 0.031
0.775TrpSer: 0.775 ± 0.023
0.776TrpThr: 0.776 ± 0.023
0.802TrpVal: 0.802 ± 0.025
0.208TrpTrp: 0.208 ± 0.013
0.247TrpTyr: 0.247 ± 0.013
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.713TyrAla: 2.713 ± 0.043
0.214TyrCys: 0.214 ± 0.012
1.342TyrAsp: 1.342 ± 0.035
1.075TyrGlu: 1.075 ± 0.031
0.833TyrPhe: 0.833 ± 0.028
2.218TyrGly: 2.218 ± 0.043
0.365TyrHis: 0.365 ± 0.017
0.703TyrIle: 0.703 ± 0.023
0.497TyrLys: 0.497 ± 0.021
2.188TyrLeu: 2.188 ± 0.041
0.394TyrMet: 0.394 ± 0.018
0.522TyrAsn: 0.522 ± 0.019
1.087TyrPro: 1.087 ± 0.029
0.648TyrGln: 0.648 ± 0.022
1.634TyrArg: 1.634 ± 0.034
1.109TyrSer: 1.109 ± 0.03
1.056TyrThr: 1.056 ± 0.034
1.685TyrVal: 1.685 ± 0.036
0.308TyrTrp: 0.308 ± 0.015
0.548TyrTyr: 0.548 ± 0.02
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4631 proteins (1533501 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski