Amino acid dipepetide frequency for Nitrosovibrio sp. Nv17

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
15.058AlaAla: 15.058 ± 0.207
1.359AlaCys: 1.359 ± 0.045
6.043AlaAsp: 6.043 ± 0.102
6.252AlaGlu: 6.252 ± 0.101
3.805AlaPhe: 3.805 ± 0.069
10.679AlaGly: 10.679 ± 0.158
2.724AlaHis: 2.724 ± 0.055
5.616AlaIle: 5.616 ± 0.087
2.58AlaLys: 2.58 ± 0.057
12.862AlaLeu: 12.862 ± 0.161
2.986AlaMet: 2.986 ± 0.06
2.582AlaAsn: 2.582 ± 0.062
4.984AlaPro: 4.984 ± 0.086
4.279AlaGln: 4.279 ± 0.089
9.281AlaArg: 9.281 ± 0.104
6.303AlaSer: 6.303 ± 0.088
5.245AlaThr: 5.245 ± 0.077
7.732AlaVal: 7.732 ± 0.094
1.638AlaTrp: 1.638 ± 0.049
2.717AlaTyr: 2.717 ± 0.057
0.0AlaXaa: 0.0 ± 0.0
Cys
1.02CysAla: 1.02 ± 0.037
0.142CysCys: 0.142 ± 0.012
0.504CysAsp: 0.504 ± 0.029
0.47CysGlu: 0.47 ± 0.027
0.308CysPhe: 0.308 ± 0.018
0.947CysGly: 0.947 ± 0.036
0.356CysHis: 0.356 ± 0.03
0.488CysIle: 0.488 ± 0.025
0.213CysLys: 0.213 ± 0.017
0.914CysLeu: 0.914 ± 0.036
0.198CysMet: 0.198 ± 0.016
0.251CysAsn: 0.251 ± 0.018
0.48CysPro: 0.48 ± 0.028
0.256CysGln: 0.256 ± 0.019
0.803CysArg: 0.803 ± 0.039
0.539CysSer: 0.539 ± 0.026
0.467CysThr: 0.467 ± 0.023
0.637CysVal: 0.637 ± 0.03
0.109CysTrp: 0.109 ± 0.012
0.238CysTyr: 0.238 ± 0.018
0.0CysXaa: 0.0 ± 0.0
Asp
6.813AspAla: 6.813 ± 0.111
0.44AspCys: 0.44 ± 0.022
2.856AspAsp: 2.856 ± 0.06
3.502AspGlu: 3.502 ± 0.076
2.156AspPhe: 2.156 ± 0.053
4.851AspGly: 4.851 ± 0.095
1.347AspHis: 1.347 ± 0.042
3.013AspIle: 3.013 ± 0.074
1.46AspLys: 1.46 ± 0.04
5.615AspLeu: 5.615 ± 0.086
1.268AspMet: 1.268 ± 0.04
1.243AspAsn: 1.243 ± 0.044
3.277AspPro: 3.277 ± 0.064
1.508AspGln: 1.508 ± 0.04
3.987AspArg: 3.987 ± 0.071
2.558AspSer: 2.558 ± 0.059
2.823AspThr: 2.823 ± 0.05
3.882AspVal: 3.882 ± 0.072
0.868AspTrp: 0.868 ± 0.037
1.624AspTyr: 1.624 ± 0.05
0.0AspXaa: 0.0 ± 0.0
Glu
6.96GluAla: 6.96 ± 0.109
0.441GluCys: 0.441 ± 0.026
2.698GluAsp: 2.698 ± 0.064
3.259GluGlu: 3.259 ± 0.075
2.022GluPhe: 2.022 ± 0.056
4.184GluGly: 4.184 ± 0.079
1.493GluHis: 1.493 ± 0.046
3.686GluIle: 3.686 ± 0.07
2.413GluLys: 2.413 ± 0.063
5.725GluLeu: 5.725 ± 0.086
1.454GluMet: 1.454 ± 0.045
1.68GluAsn: 1.68 ± 0.047
2.507GluPro: 2.507 ± 0.057
2.502GluGln: 2.502 ± 0.066
5.063GluArg: 5.063 ± 0.082
2.997GluSer: 2.997 ± 0.063
3.045GluThr: 3.045 ± 0.062
3.83GluVal: 3.83 ± 0.073
0.789GluTrp: 0.789 ± 0.033
1.339GluTyr: 1.339 ± 0.04
0.0GluXaa: 0.0 ± 0.0
Phe
3.427PheAla: 3.427 ± 0.074
0.385PheCys: 0.385 ± 0.024
2.498PheAsp: 2.498 ± 0.056
2.108PheGlu: 2.108 ± 0.054
1.447PhePhe: 1.447 ± 0.046
3.026PheGly: 3.026 ± 0.064
0.952PheHis: 0.952 ± 0.033
1.849PheIle: 1.849 ± 0.056
0.865PheLys: 0.865 ± 0.032
3.339PheLeu: 3.339 ± 0.058
0.878PheMet: 0.878 ± 0.033
1.144PheAsn: 1.144 ± 0.047
1.752PhePro: 1.752 ± 0.049
1.155PheGln: 1.155 ± 0.036
2.455PheArg: 2.455 ± 0.061
2.418PheSer: 2.418 ± 0.062
1.837PheThr: 1.837 ± 0.05
2.424PheVal: 2.424 ± 0.056
0.452PheTrp: 0.452 ± 0.026
1.017PheTyr: 1.017 ± 0.039
0.0PheXaa: 0.0 ± 0.0
Gly
8.148GlyAla: 8.148 ± 0.132
0.901GlyCys: 0.901 ± 0.037
4.355GlyAsp: 4.355 ± 0.103
4.981GlyGlu: 4.981 ± 0.083
3.269GlyPhe: 3.269 ± 0.068
7.244GlyGly: 7.244 ± 0.152
2.123GlyHis: 2.123 ± 0.055
5.024GlyIle: 5.024 ± 0.087
3.255GlyLys: 3.255 ± 0.068
8.434GlyLeu: 8.434 ± 0.101
2.682GlyMet: 2.682 ± 0.065
2.47GlyAsn: 2.47 ± 0.066
2.695GlyPro: 2.695 ± 0.059
2.797GlyGln: 2.797 ± 0.06
6.568GlyArg: 6.568 ± 0.099
5.0GlySer: 5.0 ± 0.098
4.586GlyThr: 4.586 ± 0.08
6.118GlyVal: 6.118 ± 0.097
1.292GlyTrp: 1.292 ± 0.04
2.625GlyTyr: 2.625 ± 0.054
0.0GlyXaa: 0.0 ± 0.0
His
3.273HisAla: 3.273 ± 0.066
0.275HisCys: 0.275 ± 0.017
1.416HisAsp: 1.416 ± 0.046
1.341HisGlu: 1.341 ± 0.041
0.95HisPhe: 0.95 ± 0.037
2.481HisGly: 2.481 ± 0.06
0.813HisHis: 0.813 ± 0.035
1.087HisIle: 1.087 ± 0.034
0.57HisLys: 0.57 ± 0.028
2.674HisLeu: 2.674 ± 0.056
0.531HisMet: 0.531 ± 0.026
0.51HisAsn: 0.51 ± 0.026
1.757HisPro: 1.757 ± 0.048
0.78HisGln: 0.78 ± 0.031
1.963HisArg: 1.963 ± 0.056
1.175HisSer: 1.175 ± 0.039
1.141HisThr: 1.141 ± 0.038
1.791HisVal: 1.791 ± 0.047
0.343HisTrp: 0.343 ± 0.019
0.722HisTyr: 0.722 ± 0.031
0.0HisXaa: 0.0 ± 0.0
Ile
5.931IleAla: 5.931 ± 0.091
0.451IleCys: 0.451 ± 0.025
3.332IleAsp: 3.332 ± 0.071
3.315IleGlu: 3.315 ± 0.071
1.587IlePhe: 1.587 ± 0.044
4.579IleGly: 4.579 ± 0.076
1.277IleHis: 1.277 ± 0.043
2.129IleIle: 2.129 ± 0.061
1.395IleLys: 1.395 ± 0.055
5.094IleLeu: 5.094 ± 0.089
0.971IleMet: 0.971 ± 0.037
1.44IleAsn: 1.44 ± 0.046
2.75IlePro: 2.75 ± 0.053
1.567IleGln: 1.567 ± 0.043
3.924IleArg: 3.924 ± 0.058
2.944IleSer: 2.944 ± 0.064
2.523IleThr: 2.523 ± 0.063
3.883IleVal: 3.883 ± 0.079
0.527IleTrp: 0.527 ± 0.028
1.181IleTyr: 1.181 ± 0.035
0.0IleXaa: 0.0 ± 0.0
Lys
3.144LysAla: 3.144 ± 0.068
0.191LysCys: 0.191 ± 0.016
1.503LysAsp: 1.503 ± 0.047
1.725LysGlu: 1.725 ± 0.054
0.837LysPhe: 0.837 ± 0.033
2.202LysGly: 2.202 ± 0.064
0.643LysHis: 0.643 ± 0.03
1.506LysIle: 1.506 ± 0.055
1.318LysLys: 1.318 ± 0.051
3.103LysLeu: 3.103 ± 0.071
0.788LysMet: 0.788 ± 0.032
0.982LysAsn: 0.982 ± 0.034
1.657LysPro: 1.657 ± 0.046
1.194LysGln: 1.194 ± 0.04
2.256LysArg: 2.256 ± 0.056
1.632LysSer: 1.632 ± 0.048
1.721LysThr: 1.721 ± 0.046
2.108LysVal: 2.108 ± 0.058
0.373LysTrp: 0.373 ± 0.021
0.658LysTyr: 0.658 ± 0.033
0.0LysXaa: 0.0 ± 0.0
Leu
13.675LeuAla: 13.675 ± 0.173
0.983LeuCys: 0.983 ± 0.035
6.479LeuAsp: 6.479 ± 0.1
6.276LeuGlu: 6.276 ± 0.092
3.473LeuPhe: 3.473 ± 0.074
8.565LeuGly: 8.565 ± 0.126
2.622LeuHis: 2.622 ± 0.062
4.752LeuIle: 4.752 ± 0.079
3.411LeuLys: 3.411 ± 0.079
11.666LeuLeu: 11.666 ± 0.142
2.33LeuMet: 2.33 ± 0.058
2.753LeuAsn: 2.753 ± 0.059
6.265LeuPro: 6.265 ± 0.092
3.87LeuGln: 3.87 ± 0.068
8.23LeuArg: 8.23 ± 0.113
6.299LeuSer: 6.299 ± 0.077
5.18LeuThr: 5.18 ± 0.078
7.136LeuVal: 7.136 ± 0.103
1.211LeuTrp: 1.211 ± 0.038
2.365LeuTyr: 2.365 ± 0.056
0.0LeuXaa: 0.0 ± 0.0
Met
2.738MetAla: 2.738 ± 0.065
0.134MetCys: 0.134 ± 0.013
1.254MetAsp: 1.254 ± 0.037
1.498MetGlu: 1.498 ± 0.044
0.648MetPhe: 0.648 ± 0.025
1.85MetGly: 1.85 ± 0.054
0.614MetHis: 0.614 ± 0.027
1.073MetIle: 1.073 ± 0.038
0.995MetLys: 0.995 ± 0.036
2.861MetLeu: 2.861 ± 0.055
0.599MetMet: 0.599 ± 0.029
0.819MetAsn: 0.819 ± 0.03
1.446MetPro: 1.446 ± 0.044
1.04MetGln: 1.04 ± 0.032
1.963MetArg: 1.963 ± 0.055
1.451MetSer: 1.451 ± 0.04
1.353MetThr: 1.353 ± 0.042
1.617MetVal: 1.617 ± 0.047
0.153MetTrp: 0.153 ± 0.015
0.309MetTyr: 0.309 ± 0.022
0.0MetXaa: 0.0 ± 0.0
Asn
2.722AsnAla: 2.722 ± 0.062
0.232AsnCys: 0.232 ± 0.017
1.354AsnAsp: 1.354 ± 0.042
1.249AsnGlu: 1.249 ± 0.039
1.029AsnPhe: 1.029 ± 0.039
2.36AsnGly: 2.36 ± 0.053
0.644AsnHis: 0.644 ± 0.028
1.383AsnIle: 1.383 ± 0.045
0.668AsnLys: 0.668 ± 0.03
3.061AsnLeu: 3.061 ± 0.065
0.543AsnMet: 0.543 ± 0.024
0.733AsnAsn: 0.733 ± 0.038
1.996AsnPro: 1.996 ± 0.051
0.89AsnGln: 0.89 ± 0.039
2.057AsnArg: 2.057 ± 0.044
1.263AsnSer: 1.263 ± 0.049
1.313AsnThr: 1.313 ± 0.037
1.902AsnVal: 1.902 ± 0.054
0.396AsnTrp: 0.396 ± 0.023
0.657AsnTyr: 0.657 ± 0.023
0.0AsnXaa: 0.0 ± 0.0
Pro
6.37ProAla: 6.37 ± 0.102
0.427ProCys: 0.427 ± 0.021
3.503ProAsp: 3.503 ± 0.066
3.67ProGlu: 3.67 ± 0.063
1.752ProPhe: 1.752 ± 0.046
5.094ProGly: 5.094 ± 0.085
1.214ProHis: 1.214 ± 0.042
2.083ProIle: 2.083 ± 0.052
1.172ProLys: 1.172 ± 0.044
5.148ProLeu: 5.148 ± 0.079
1.229ProMet: 1.229 ± 0.037
1.178ProAsn: 1.178 ± 0.037
2.873ProPro: 2.873 ± 0.068
1.75ProGln: 1.75 ± 0.047
3.163ProArg: 3.163 ± 0.065
2.662ProSer: 2.662 ± 0.063
2.073ProThr: 2.073 ± 0.056
4.185ProVal: 4.185 ± 0.083
0.683ProTrp: 0.683 ± 0.029
1.315ProTyr: 1.315 ± 0.039
0.0ProXaa: 0.0 ± 0.0
Gln
4.867GlnAla: 4.867 ± 0.076
0.26GlnCys: 0.26 ± 0.016
1.808GlnAsp: 1.808 ± 0.052
2.013GlnGlu: 2.013 ± 0.047
1.046GlnPhe: 1.046 ± 0.038
3.016GlnGly: 3.016 ± 0.081
0.88GlnHis: 0.88 ± 0.033
1.816GlnIle: 1.816 ± 0.041
1.122GlnLys: 1.122 ± 0.043
3.611GlnLeu: 3.611 ± 0.072
0.86GlnMet: 0.86 ± 0.035
0.826GlnAsn: 0.826 ± 0.035
1.912GlnPro: 1.912 ± 0.048
1.512GlnGln: 1.512 ± 0.051
2.903GlnArg: 2.903 ± 0.072
1.839GlnSer: 1.839 ± 0.046
1.528GlnThr: 1.528 ± 0.045
2.558GlnVal: 2.558 ± 0.051
0.516GlnTrp: 0.516 ± 0.027
0.786GlnTyr: 0.786 ± 0.032
0.0GlnXaa: 0.0 ± 0.0
Arg
7.513ArgAla: 7.513 ± 0.121
0.671ArgCys: 0.671 ± 0.028
4.059ArgAsp: 4.059 ± 0.066
4.812ArgGlu: 4.812 ± 0.083
3.142ArgPhe: 3.142 ± 0.063
4.998ArgGly: 4.998 ± 0.078
2.589ArgHis: 2.589 ± 0.068
4.845ArgIle: 4.845 ± 0.079
2.5ArgLys: 2.5 ± 0.058
9.051ArgLeu: 9.051 ± 0.125
2.135ArgMet: 2.135 ± 0.054
2.317ArgAsn: 2.317 ± 0.054
3.607ArgPro: 3.607 ± 0.061
3.323ArgGln: 3.323 ± 0.068
6.286ArgArg: 6.286 ± 0.105
3.971ArgSer: 3.971 ± 0.075
3.637ArgThr: 3.637 ± 0.071
5.045ArgVal: 5.045 ± 0.08
1.046ArgTrp: 1.046 ± 0.04
2.385ArgTyr: 2.385 ± 0.059
0.0ArgXaa: 0.0 ± 0.0
Ser
5.867SerAla: 5.867 ± 0.086
0.511SerCys: 0.511 ± 0.025
2.681SerAsp: 2.681 ± 0.058
2.689SerGlu: 2.689 ± 0.057
2.025SerPhe: 2.025 ± 0.053
5.879SerGly: 5.879 ± 0.091
1.425SerHis: 1.425 ± 0.038
2.871SerIle: 2.871 ± 0.056
1.387SerLys: 1.387 ± 0.044
5.835SerLeu: 5.835 ± 0.094
1.423SerMet: 1.423 ± 0.045
1.38SerAsn: 1.38 ± 0.041
2.905SerPro: 2.905 ± 0.057
1.837SerGln: 1.837 ± 0.046
4.434SerArg: 4.434 ± 0.081
3.067SerSer: 3.067 ± 0.065
2.687SerThr: 2.687 ± 0.057
4.009SerVal: 4.009 ± 0.077
0.686SerTrp: 0.686 ± 0.028
1.324SerTyr: 1.324 ± 0.043
0.0SerXaa: 0.0 ± 0.0
Thr
5.249ThrAla: 5.249 ± 0.088
0.461ThrCys: 0.461 ± 0.027
2.56ThrAsp: 2.56 ± 0.058
2.437ThrGlu: 2.437 ± 0.06
1.79ThrPhe: 1.79 ± 0.047
4.746ThrGly: 4.746 ± 0.074
1.248ThrHis: 1.248 ± 0.042
2.278ThrIle: 2.278 ± 0.053
0.994ThrLys: 0.994 ± 0.033
6.487ThrLeu: 6.487 ± 0.11
1.015ThrMet: 1.015 ± 0.034
1.011ThrAsn: 1.011 ± 0.038
3.152ThrPro: 3.152 ± 0.055
1.67ThrGln: 1.67 ± 0.044
3.68ThrArg: 3.68 ± 0.068
2.531ThrSer: 2.531 ± 0.055
2.4ThrThr: 2.4 ± 0.064
3.651ThrVal: 3.651 ± 0.069
0.652ThrTrp: 0.652 ± 0.031
1.223ThrTyr: 1.223 ± 0.04
0.0ThrXaa: 0.0 ± 0.0
Val
8.136ValAla: 8.136 ± 0.116
0.675ValCys: 0.675 ± 0.028
4.146ValAsp: 4.146 ± 0.079
4.576ValGlu: 4.576 ± 0.079
2.627ValPhe: 2.627 ± 0.066
4.929ValGly: 4.929 ± 0.08
1.55ValHis: 1.55 ± 0.04
3.589ValIle: 3.589 ± 0.074
2.13ValLys: 2.13 ± 0.057
7.418ValLeu: 7.418 ± 0.1
1.743ValMet: 1.743 ± 0.044
1.991ValAsn: 1.991 ± 0.06
3.666ValPro: 3.666 ± 0.065
2.199ValGln: 2.199 ± 0.047
5.391ValArg: 5.391 ± 0.078
4.145ValSer: 4.145 ± 0.066
3.852ValThr: 3.852 ± 0.078
5.603ValVal: 5.603 ± 0.101
0.789ValTrp: 0.789 ± 0.033
1.617ValTyr: 1.617 ± 0.041
0.0ValXaa: 0.0 ± 0.0
Trp
1.028TrpAla: 1.028 ± 0.038
0.141TrpCys: 0.141 ± 0.015
0.602TrpAsp: 0.602 ± 0.026
0.649TrpGlu: 0.649 ± 0.033
0.474TrpPhe: 0.474 ± 0.028
0.865TrpGly: 0.865 ± 0.035
0.39TrpHis: 0.39 ± 0.022
0.779TrpIle: 0.779 ± 0.03
0.508TrpLys: 0.508 ± 0.028
1.799TrpLeu: 1.799 ± 0.055
0.368TrpMet: 0.368 ± 0.022
0.443TrpAsn: 0.443 ± 0.023
0.578TrpPro: 0.578 ± 0.028
0.63TrpGln: 0.63 ± 0.03
1.178TrpArg: 1.178 ± 0.044
0.751TrpSer: 0.751 ± 0.034
0.539TrpThr: 0.539 ± 0.026
0.87TrpVal: 0.87 ± 0.035
0.215TrpTrp: 0.215 ± 0.017
0.336TrpTyr: 0.336 ± 0.023
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.787TyrAla: 2.787 ± 0.066
0.259TyrCys: 0.259 ± 0.019
1.44TyrAsp: 1.44 ± 0.039
1.265TyrGlu: 1.265 ± 0.04
1.042TyrPhe: 1.042 ± 0.037
2.073TyrGly: 2.073 ± 0.054
0.689TyrHis: 0.689 ± 0.029
0.933TyrIle: 0.933 ± 0.036
0.584TyrLys: 0.584 ± 0.028
2.843TyrLeu: 2.843 ± 0.063
0.414TyrMet: 0.414 ± 0.024
0.701TyrAsn: 0.701 ± 0.031
1.284TyrPro: 1.284 ± 0.044
0.923TyrGln: 0.923 ± 0.034
2.343TyrArg: 2.343 ± 0.046
1.38TyrSer: 1.38 ± 0.045
1.284TyrThr: 1.284 ± 0.042
1.791TyrVal: 1.791 ± 0.04
0.405TyrTrp: 0.405 ± 0.022
0.756TyrTyr: 0.756 ± 0.033
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2594 proteins (829186 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski