Amino acid dipepetide frequency for Verrucomicrobia bacterium IMCC26134

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
16.605AlaAla: 16.605 ± 0.225
1.138AlaCys: 1.138 ± 0.038
6.341AlaAsp: 6.341 ± 0.089
6.403AlaGlu: 6.403 ± 0.133
4.301AlaPhe: 4.301 ± 0.084
11.045AlaGly: 11.045 ± 0.159
2.232AlaHis: 2.232 ± 0.058
5.126AlaIle: 5.126 ± 0.076
4.511AlaLys: 4.511 ± 0.102
12.915AlaLeu: 12.915 ± 0.142
2.217AlaMet: 2.217 ± 0.055
3.626AlaAsn: 3.626 ± 0.145
6.132AlaPro: 6.132 ± 0.114
3.612AlaGln: 3.612 ± 0.067
7.681AlaArg: 7.681 ± 0.169
7.187AlaSer: 7.187 ± 0.13
7.632AlaThr: 7.632 ± 0.222
7.927AlaVal: 7.927 ± 0.112
1.763AlaTrp: 1.763 ± 0.058
2.62AlaTyr: 2.62 ± 0.053
0.0AlaXaa: 0.0 ± 0.0
Cys
1.098CysAla: 1.098 ± 0.042
0.111CysCys: 0.111 ± 0.012
0.526CysAsp: 0.526 ± 0.024
0.461CysGlu: 0.461 ± 0.022
0.358CysPhe: 0.358 ± 0.022
0.861CysGly: 0.861 ± 0.034
0.269CysHis: 0.269 ± 0.024
0.443CysIle: 0.443 ± 0.02
0.229CysLys: 0.229 ± 0.016
0.903CysLeu: 0.903 ± 0.032
0.138CysMet: 0.138 ± 0.012
0.216CysAsn: 0.216 ± 0.016
0.474CysPro: 0.474 ± 0.024
0.215CysGln: 0.215 ± 0.017
0.646CysArg: 0.646 ± 0.03
0.565CysSer: 0.565 ± 0.027
0.499CysThr: 0.499 ± 0.026
0.678CysVal: 0.678 ± 0.032
0.147CysTrp: 0.147 ± 0.012
0.261CysTyr: 0.261 ± 0.014
0.0CysXaa: 0.0 ± 0.0
Asp
6.075AspAla: 6.075 ± 0.1
0.462AspCys: 0.462 ± 0.025
2.343AspAsp: 2.343 ± 0.059
2.824AspGlu: 2.824 ± 0.068
2.318AspPhe: 2.318 ± 0.051
4.693AspGly: 4.693 ± 0.116
1.176AspHis: 1.176 ± 0.033
2.435AspIle: 2.435 ± 0.06
1.683AspLys: 1.683 ± 0.052
5.665AspLeu: 5.665 ± 0.089
0.666AspMet: 0.666 ± 0.029
1.401AspAsn: 1.401 ± 0.041
2.999AspPro: 2.999 ± 0.06
1.479AspGln: 1.479 ± 0.043
3.218AspArg: 3.218 ± 0.072
2.712AspSer: 2.712 ± 0.065
2.998AspThr: 2.998 ± 0.069
3.373AspVal: 3.373 ± 0.064
0.971AspTrp: 0.971 ± 0.031
1.69AspTyr: 1.69 ± 0.042
0.0AspXaa: 0.0 ± 0.0
Glu
5.994GluAla: 5.994 ± 0.125
0.414GluCys: 0.414 ± 0.022
1.995GluAsp: 1.995 ± 0.057
2.374GluGlu: 2.374 ± 0.07
2.043GluPhe: 2.043 ± 0.051
3.455GluGly: 3.455 ± 0.074
1.136GluHis: 1.136 ± 0.043
2.986GluIle: 2.986 ± 0.071
2.416GluLys: 2.416 ± 0.079
5.902GluLeu: 5.902 ± 0.107
0.975GluMet: 0.975 ± 0.036
1.616GluAsn: 1.616 ± 0.041
2.196GluPro: 2.196 ± 0.058
1.845GluGln: 1.845 ± 0.047
3.602GluArg: 3.602 ± 0.087
2.683GluSer: 2.683 ± 0.064
2.953GluThr: 2.953 ± 0.064
3.673GluVal: 3.673 ± 0.077
0.711GluTrp: 0.711 ± 0.027
0.968GluTyr: 0.968 ± 0.032
0.0GluXaa: 0.0 ± 0.0
Phe
4.575PheAla: 4.575 ± 0.087
0.415PheCys: 0.415 ± 0.021
2.424PheAsp: 2.424 ± 0.051
1.795PheGlu: 1.795 ± 0.041
1.694PhePhe: 1.694 ± 0.043
3.383PheGly: 3.383 ± 0.073
0.763PheHis: 0.763 ± 0.03
1.923PheIle: 1.923 ± 0.048
1.354PheLys: 1.354 ± 0.041
3.54PheLeu: 3.54 ± 0.072
0.683PheMet: 0.683 ± 0.03
1.511PheAsn: 1.511 ± 0.041
1.745PhePro: 1.745 ± 0.047
1.1PheGln: 1.1 ± 0.031
2.208PheArg: 2.208 ± 0.056
2.701PheSer: 2.701 ± 0.083
3.036PheThr: 3.036 ± 0.103
2.681PheVal: 2.681 ± 0.066
0.578PheTrp: 0.578 ± 0.029
1.151PheTyr: 1.151 ± 0.034
0.0PheXaa: 0.0 ± 0.0
Gly
8.803GlyAla: 8.803 ± 0.16
0.852GlyCys: 0.852 ± 0.031
4.341GlyAsp: 4.341 ± 0.092
3.996GlyGlu: 3.996 ± 0.087
3.378GlyPhe: 3.378 ± 0.082
7.41GlyGly: 7.41 ± 0.171
1.755GlyHis: 1.755 ± 0.045
4.056GlyIle: 4.056 ± 0.088
3.461GlyLys: 3.461 ± 0.09
8.881GlyLeu: 8.881 ± 0.099
1.608GlyMet: 1.608 ± 0.047
2.799GlyAsn: 2.799 ± 0.128
3.072GlyPro: 3.072 ± 0.062
2.666GlyGln: 2.666 ± 0.054
5.345GlyArg: 5.345 ± 0.099
5.464GlySer: 5.464 ± 0.209
6.165GlyThr: 6.165 ± 0.306
6.256GlyVal: 6.256 ± 0.117
1.464GlyTrp: 1.464 ± 0.042
2.409GlyTyr: 2.409 ± 0.063
0.0GlyXaa: 0.0 ± 0.0
His
2.459HisAla: 2.459 ± 0.067
0.232HisCys: 0.232 ± 0.015
1.126HisAsp: 1.126 ± 0.038
1.075HisGlu: 1.075 ± 0.041
0.977HisPhe: 0.977 ± 0.035
1.87HisGly: 1.87 ± 0.055
0.61HisHis: 0.61 ± 0.029
0.814HisIle: 0.814 ± 0.034
0.548HisLys: 0.548 ± 0.025
2.335HisLeu: 2.335 ± 0.062
0.328HisMet: 0.328 ± 0.021
0.557HisAsn: 0.557 ± 0.023
1.452HisPro: 1.452 ± 0.045
0.622HisGln: 0.622 ± 0.025
1.395HisArg: 1.395 ± 0.044
1.057HisSer: 1.057 ± 0.041
1.149HisThr: 1.149 ± 0.038
1.381HisVal: 1.381 ± 0.048
0.366HisTrp: 0.366 ± 0.02
0.623HisTyr: 0.623 ± 0.026
0.0HisXaa: 0.0 ± 0.0
Ile
5.828IleAla: 5.828 ± 0.091
0.437IleCys: 0.437 ± 0.024
3.02IleAsp: 3.02 ± 0.067
2.969IleGlu: 2.969 ± 0.07
1.706IlePhe: 1.706 ± 0.056
4.121IleGly: 4.121 ± 0.086
0.981IleHis: 0.981 ± 0.034
2.186IleIle: 2.186 ± 0.058
1.672IleLys: 1.672 ± 0.048
4.284IleLeu: 4.284 ± 0.081
0.709IleMet: 0.709 ± 0.031
1.859IleAsn: 1.859 ± 0.074
2.395IlePro: 2.395 ± 0.045
1.367IleGln: 1.367 ± 0.05
2.785IleArg: 2.785 ± 0.063
2.977IleSer: 2.977 ± 0.085
3.518IleThr: 3.518 ± 0.116
3.468IleVal: 3.468 ± 0.065
0.495IleTrp: 0.495 ± 0.025
1.248IleTyr: 1.248 ± 0.038
0.0IleXaa: 0.0 ± 0.0
Lys
4.032LysAla: 4.032 ± 0.101
0.226LysCys: 0.226 ± 0.017
1.736LysAsp: 1.736 ± 0.048
1.619LysGlu: 1.619 ± 0.06
1.264LysPhe: 1.264 ± 0.041
2.425LysGly: 2.425 ± 0.072
0.791LysHis: 0.791 ± 0.031
2.034LysIle: 2.034 ± 0.053
1.865LysLys: 1.865 ± 0.066
3.919LysLeu: 3.919 ± 0.071
0.788LysMet: 0.788 ± 0.032
1.32LysAsn: 1.32 ± 0.043
2.261LysPro: 2.261 ± 0.065
1.072LysGln: 1.072 ± 0.034
2.155LysArg: 2.155 ± 0.063
2.209LysSer: 2.209 ± 0.057
2.481LysThr: 2.481 ± 0.064
2.559LysVal: 2.559 ± 0.064
0.534LysTrp: 0.534 ± 0.027
0.746LysTyr: 0.746 ± 0.034
0.0LysXaa: 0.0 ± 0.0
Leu
14.175LeuAla: 14.175 ± 0.188
1.098LeuCys: 1.098 ± 0.037
5.592LeuAsp: 5.592 ± 0.087
4.95LeuGlu: 4.95 ± 0.083
3.68LeuPhe: 3.68 ± 0.074
8.737LeuGly: 8.737 ± 0.116
2.309LeuHis: 2.309 ± 0.064
4.843LeuIle: 4.843 ± 0.082
3.86LeuLys: 3.86 ± 0.077
10.441LeuLeu: 10.441 ± 0.187
1.745LeuMet: 1.745 ± 0.047
3.308LeuAsn: 3.308 ± 0.094
6.074LeuPro: 6.074 ± 0.096
3.003LeuGln: 3.003 ± 0.065
7.64LeuArg: 7.64 ± 0.114
6.676LeuSer: 6.676 ± 0.13
6.993LeuThr: 6.993 ± 0.192
7.646LeuVal: 7.646 ± 0.103
1.261LeuTrp: 1.261 ± 0.044
2.147LeuTyr: 2.147 ± 0.046
0.0LeuXaa: 0.0 ± 0.0
Met
1.867MetAla: 1.867 ± 0.049
0.15MetCys: 0.15 ± 0.014
0.826MetAsp: 0.826 ± 0.031
0.8MetGlu: 0.8 ± 0.034
0.533MetPhe: 0.533 ± 0.023
1.267MetGly: 1.267 ± 0.04
0.346MetHis: 0.346 ± 0.019
0.95MetIle: 0.95 ± 0.032
0.919MetLys: 0.919 ± 0.029
1.805MetLeu: 1.805 ± 0.046
0.365MetMet: 0.365 ± 0.023
0.647MetAsn: 0.647 ± 0.025
1.182MetPro: 1.182 ± 0.039
0.584MetGln: 0.584 ± 0.025
1.17MetArg: 1.17 ± 0.042
1.324MetSer: 1.324 ± 0.039
1.067MetThr: 1.067 ± 0.035
1.121MetVal: 1.121 ± 0.037
0.138MetTrp: 0.138 ± 0.01
0.214MetTyr: 0.214 ± 0.015
0.0MetXaa: 0.0 ± 0.0
Asn
3.499AsnAla: 3.499 ± 0.091
0.244AsnCys: 0.244 ± 0.015
1.449AsnAsp: 1.449 ± 0.043
1.32AsnGlu: 1.32 ± 0.034
1.298AsnPhe: 1.298 ± 0.06
3.027AsnGly: 3.027 ± 0.142
0.631AsnHis: 0.631 ± 0.029
1.422AsnIle: 1.422 ± 0.051
0.871AsnLys: 0.871 ± 0.031
3.579AsnLeu: 3.579 ± 0.094
0.42AsnMet: 0.42 ± 0.021
1.172AsnAsn: 1.172 ± 0.06
2.285AsnPro: 2.285 ± 0.051
1.009AsnGln: 1.009 ± 0.031
1.796AsnArg: 1.796 ± 0.046
1.907AsnSer: 1.907 ± 0.078
2.202AsnThr: 2.202 ± 0.093
2.302AsnVal: 2.302 ± 0.108
0.555AsnTrp: 0.555 ± 0.03
1.033AsnTyr: 1.033 ± 0.051
0.0AsnXaa: 0.0 ± 0.0
Pro
7.83ProAla: 7.83 ± 0.151
0.383ProCys: 0.383 ± 0.022
3.245ProAsp: 3.245 ± 0.065
3.256ProGlu: 3.256 ± 0.067
1.92ProPhe: 1.92 ± 0.053
4.409ProGly: 4.409 ± 0.082
1.118ProHis: 1.118 ± 0.041
1.979ProIle: 1.979 ± 0.048
1.737ProLys: 1.737 ± 0.054
4.952ProLeu: 4.952 ± 0.091
0.931ProMet: 0.931 ± 0.035
1.377ProAsn: 1.377 ± 0.039
2.687ProPro: 2.687 ± 0.084
1.376ProGln: 1.376 ± 0.049
2.873ProArg: 2.873 ± 0.074
3.389ProSer: 3.389 ± 0.066
2.864ProThr: 2.864 ± 0.073
4.492ProVal: 4.492 ± 0.075
0.829ProTrp: 0.829 ± 0.039
1.124ProTyr: 1.124 ± 0.038
0.0ProXaa: 0.0 ± 0.0
Gln
3.426GlnAla: 3.426 ± 0.062
0.237GlnCys: 0.237 ± 0.016
1.19GlnAsp: 1.19 ± 0.035
1.328GlnGlu: 1.328 ± 0.043
1.12GlnPhe: 1.12 ± 0.033
2.185GlnGly: 2.185 ± 0.048
0.621GlnHis: 0.621 ± 0.025
1.754GlnIle: 1.754 ± 0.046
1.271GlnLys: 1.271 ± 0.045
3.333GlnLeu: 3.333 ± 0.059
0.587GlnMet: 0.587 ± 0.027
1.002GlnAsn: 1.002 ± 0.034
1.773GlnPro: 1.773 ± 0.048
1.08GlnGln: 1.08 ± 0.035
2.102GlnArg: 2.102 ± 0.056
1.877GlnSer: 1.877 ± 0.046
2.009GlnThr: 2.009 ± 0.078
2.177GlnVal: 2.177 ± 0.042
0.441GlnTrp: 0.441 ± 0.03
0.557GlnTyr: 0.557 ± 0.027
0.0GlnXaa: 0.0 ± 0.0
Arg
7.013ArgAla: 7.013 ± 0.16
0.566ArgCys: 0.566 ± 0.027
3.386ArgAsp: 3.386 ± 0.085
3.835ArgGlu: 3.835 ± 0.105
2.742ArgPhe: 2.742 ± 0.056
4.094ArgGly: 4.094 ± 0.074
1.646ArgHis: 1.646 ± 0.05
3.616ArgIle: 3.616 ± 0.076
2.147ArgLys: 2.147 ± 0.061
7.488ArgLeu: 7.488 ± 0.132
1.314ArgMet: 1.314 ± 0.044
1.719ArgAsn: 1.719 ± 0.045
3.158ArgPro: 3.158 ± 0.074
2.071ArgGln: 2.071 ± 0.061
4.537ArgArg: 4.537 ± 0.109
3.464ArgSer: 3.464 ± 0.06
3.489ArgThr: 3.489 ± 0.065
4.52ArgVal: 4.52 ± 0.08
1.059ArgTrp: 1.059 ± 0.039
1.785ArgTyr: 1.785 ± 0.046
0.0ArgXaa: 0.0 ± 0.0
Ser
7.756SerAla: 7.756 ± 0.186
0.487SerCys: 0.487 ± 0.026
3.007SerAsp: 3.007 ± 0.075
2.639SerGlu: 2.639 ± 0.056
2.53SerPhe: 2.53 ± 0.067
6.541SerGly: 6.541 ± 0.215
1.079SerHis: 1.079 ± 0.034
2.912SerIle: 2.912 ± 0.096
1.817SerLys: 1.817 ± 0.049
6.535SerLeu: 6.535 ± 0.138
1.025SerMet: 1.025 ± 0.033
1.89SerAsn: 1.89 ± 0.059
3.349SerPro: 3.349 ± 0.057
1.617SerGln: 1.617 ± 0.043
3.439SerArg: 3.439 ± 0.073
4.26SerSer: 4.26 ± 0.113
3.959SerThr: 3.959 ± 0.121
4.355SerVal: 4.355 ± 0.105
0.827SerTrp: 0.827 ± 0.031
1.56SerTyr: 1.56 ± 0.045
0.0SerXaa: 0.0 ± 0.0
Thr
7.664ThrAla: 7.664 ± 0.237
0.479ThrCys: 0.479 ± 0.023
3.092ThrAsp: 3.092 ± 0.083
2.673ThrGlu: 2.673 ± 0.061
2.618ThrPhe: 2.618 ± 0.102
6.472ThrGly: 6.472 ± 0.231
1.231ThrHis: 1.231 ± 0.042
3.028ThrIle: 3.028 ± 0.107
1.832ThrLys: 1.832 ± 0.043
8.039ThrLeu: 8.039 ± 0.232
0.884ThrMet: 0.884 ± 0.031
1.928ThrAsn: 1.928 ± 0.1
4.089ThrPro: 4.089 ± 0.091
1.795ThrGln: 1.795 ± 0.063
3.409ThrArg: 3.409 ± 0.062
3.801ThrSer: 3.801 ± 0.124
4.295ThrThr: 4.295 ± 0.192
5.19ThrVal: 5.19 ± 0.189
0.895ThrTrp: 0.895 ± 0.035
1.856ThrTyr: 1.856 ± 0.098
0.0ThrXaa: 0.0 ± 0.0
Val
8.27ValAla: 8.27 ± 0.13
0.755ValCys: 0.755 ± 0.033
3.422ValAsp: 3.422 ± 0.06
3.762ValGlu: 3.762 ± 0.079
3.046ValPhe: 3.046 ± 0.065
5.147ValGly: 5.147 ± 0.114
1.297ValHis: 1.297 ± 0.041
3.864ValIle: 3.864 ± 0.072
2.526ValLys: 2.526 ± 0.057
7.331ValLeu: 7.331 ± 0.101
1.24ValMet: 1.24 ± 0.038
2.66ValAsn: 2.66 ± 0.097
3.514ValPro: 3.514 ± 0.072
2.09ValGln: 2.09 ± 0.046
4.699ValArg: 4.699 ± 0.09
4.819ValSer: 4.819 ± 0.133
5.331ValThr: 5.331 ± 0.232
5.714ValVal: 5.714 ± 0.118
0.987ValTrp: 0.987 ± 0.033
1.598ValTyr: 1.598 ± 0.045
0.0ValXaa: 0.0 ± 0.0
Trp
1.267TrpAla: 1.267 ± 0.035
0.174TrpCys: 0.174 ± 0.011
0.695TrpAsp: 0.695 ± 0.029
0.662TrpGlu: 0.662 ± 0.031
0.623TrpPhe: 0.623 ± 0.035
1.007TrpGly: 1.007 ± 0.048
0.388TrpHis: 0.388 ± 0.022
0.708TrpIle: 0.708 ± 0.031
0.572TrpLys: 0.572 ± 0.029
1.818TrpLeu: 1.818 ± 0.063
0.326TrpMet: 0.326 ± 0.019
0.526TrpAsn: 0.526 ± 0.025
0.687TrpPro: 0.687 ± 0.033
0.584TrpGln: 0.584 ± 0.024
1.17TrpArg: 1.17 ± 0.038
1.041TrpSer: 1.041 ± 0.046
0.942TrpThr: 0.942 ± 0.034
0.9TrpVal: 0.9 ± 0.036
0.312TrpTrp: 0.312 ± 0.021
0.287TrpTyr: 0.287 ± 0.018
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.777TyrAla: 2.777 ± 0.073
0.238TyrCys: 0.238 ± 0.016
1.425TyrAsp: 1.425 ± 0.046
1.195TyrGlu: 1.195 ± 0.04
1.111TyrPhe: 1.111 ± 0.04
2.032TyrGly: 2.032 ± 0.055
0.596TyrHis: 0.596 ± 0.037
0.947TyrIle: 0.947 ± 0.034
0.743TyrLys: 0.743 ± 0.035
2.512TyrLeu: 2.512 ± 0.052
0.328TyrMet: 0.328 ± 0.02
0.885TyrAsn: 0.885 ± 0.043
1.249TyrPro: 1.249 ± 0.041
0.886TyrGln: 0.886 ± 0.031
1.781TyrArg: 1.781 ± 0.047
1.47TyrSer: 1.47 ± 0.047
1.691TyrThr: 1.691 ± 0.065
1.657TyrVal: 1.657 ± 0.046
0.354TyrTrp: 0.354 ± 0.022
0.829TyrTyr: 0.829 ± 0.036
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2622 proteins (1010659 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski