Amino acid dipepetide frequency for Dehalogenimonas formicexedens

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.931AlaAla: 10.931 ± 0.204
1.115AlaCys: 1.115 ± 0.049
4.896AlaAsp: 4.896 ± 0.094
6.219AlaGlu: 6.219 ± 0.106
3.396AlaPhe: 3.396 ± 0.083
8.437AlaGly: 8.437 ± 0.142
1.482AlaHis: 1.482 ± 0.053
6.308AlaIle: 6.308 ± 0.115
4.405AlaLys: 4.405 ± 0.092
9.29AlaLeu: 9.29 ± 0.131
2.512AlaMet: 2.512 ± 0.068
2.808AlaAsn: 2.808 ± 0.071
3.538AlaPro: 3.538 ± 0.086
2.841AlaGln: 2.841 ± 0.071
5.481AlaArg: 5.481 ± 0.106
5.73AlaSer: 5.73 ± 0.108
4.705AlaThr: 4.705 ± 0.122
7.544AlaVal: 7.544 ± 0.132
1.072AlaTrp: 1.072 ± 0.045
2.317AlaTyr: 2.317 ± 0.071
0.0AlaXaa: 0.0 ± 0.0
Cys
0.955CysAla: 0.955 ± 0.047
0.198CysCys: 0.198 ± 0.018
0.581CysAsp: 0.581 ± 0.033
0.52CysGlu: 0.52 ± 0.027
0.44CysPhe: 0.44 ± 0.032
1.372CysGly: 1.372 ± 0.048
0.422CysHis: 0.422 ± 0.032
0.527CysIle: 0.527 ± 0.031
0.479CysLys: 0.479 ± 0.031
1.028CysLeu: 1.028 ± 0.04
0.216CysMet: 0.216 ± 0.018
0.361CysAsn: 0.361 ± 0.023
0.904CysPro: 0.904 ± 0.046
0.431CysGln: 0.431 ± 0.024
0.866CysArg: 0.866 ± 0.044
0.632CysSer: 0.632 ± 0.036
0.486CysThr: 0.486 ± 0.027
0.693CysVal: 0.693 ± 0.033
0.14CysTrp: 0.14 ± 0.015
0.354CysTyr: 0.354 ± 0.023
0.0CysXaa: 0.0 ± 0.0
Asp
4.425AspAla: 4.425 ± 0.088
0.591AspCys: 0.591 ± 0.031
2.493AspAsp: 2.493 ± 0.076
3.477AspGlu: 3.477 ± 0.083
2.459AspPhe: 2.459 ± 0.068
3.72AspGly: 3.72 ± 0.09
0.896AspHis: 0.896 ± 0.036
3.915AspIle: 3.915 ± 0.088
2.642AspLys: 2.642 ± 0.066
5.369AspLeu: 5.369 ± 0.092
1.13AspMet: 1.13 ± 0.044
1.77AspAsn: 1.77 ± 0.053
2.65AspPro: 2.65 ± 0.076
1.426AspGln: 1.426 ± 0.047
2.884AspArg: 2.884 ± 0.077
2.795AspSer: 2.795 ± 0.059
2.586AspThr: 2.586 ± 0.06
3.491AspVal: 3.491 ± 0.084
0.73AspTrp: 0.73 ± 0.036
1.902AspTyr: 1.902 ± 0.066
0.0AspXaa: 0.0 ± 0.0
Glu
6.077GluAla: 6.077 ± 0.108
0.611GluCys: 0.611 ± 0.034
2.516GluAsp: 2.516 ± 0.069
3.832GluGlu: 3.832 ± 0.1
2.281GluPhe: 2.281 ± 0.058
3.813GluGly: 3.813 ± 0.088
1.159GluHis: 1.159 ± 0.051
4.526GluIle: 4.526 ± 0.098
3.964GluLys: 3.964 ± 0.094
6.532GluLeu: 6.532 ± 0.131
1.723GluMet: 1.723 ± 0.052
2.266GluAsn: 2.266 ± 0.068
2.345GluPro: 2.345 ± 0.067
2.049GluGln: 2.049 ± 0.071
3.771GluArg: 3.771 ± 0.097
3.59GluSer: 3.59 ± 0.074
3.745GluThr: 3.745 ± 0.098
4.313GluVal: 4.313 ± 0.094
0.649GluTrp: 0.649 ± 0.042
1.761GluTyr: 1.761 ± 0.06
0.0GluXaa: 0.0 ± 0.0
Phe
3.358PheAla: 3.358 ± 0.072
0.543PheCys: 0.543 ± 0.026
2.47PheAsp: 2.47 ± 0.065
2.302PheGlu: 2.302 ± 0.07
1.68PhePhe: 1.68 ± 0.067
3.473PheGly: 3.473 ± 0.077
0.805PheHis: 0.805 ± 0.036
2.854PheIle: 2.854 ± 0.073
2.072PheLys: 2.072 ± 0.061
3.58PheLeu: 3.58 ± 0.08
1.006PheMet: 1.006 ± 0.039
1.695PheAsn: 1.695 ± 0.051
1.749PhePro: 1.749 ± 0.05
1.164PheGln: 1.164 ± 0.044
2.088PheArg: 2.088 ± 0.063
2.729PheSer: 2.729 ± 0.077
2.482PheThr: 2.482 ± 0.065
2.666PheVal: 2.666 ± 0.067
0.535PheTrp: 0.535 ± 0.035
1.123PheTyr: 1.123 ± 0.042
0.0PheXaa: 0.0 ± 0.0
Gly
7.006GlyAla: 7.006 ± 0.146
1.097GlyCys: 1.097 ± 0.047
3.79GlyAsp: 3.79 ± 0.08
4.606GlyGlu: 4.606 ± 0.098
3.697GlyPhe: 3.697 ± 0.089
6.275GlyGly: 6.275 ± 0.146
1.65GlyHis: 1.65 ± 0.054
5.845GlyIle: 5.845 ± 0.106
4.605GlyLys: 4.605 ± 0.086
8.037GlyLeu: 8.037 ± 0.124
2.273GlyMet: 2.273 ± 0.063
2.655GlyAsn: 2.655 ± 0.078
2.485GlyPro: 2.485 ± 0.074
2.617GlyGln: 2.617 ± 0.063
4.425GlyArg: 4.425 ± 0.084
4.735GlySer: 4.735 ± 0.1
4.476GlyThr: 4.476 ± 0.127
5.916GlyVal: 5.916 ± 0.09
1.062GlyTrp: 1.062 ± 0.043
2.77GlyTyr: 2.77 ± 0.062
0.0GlyXaa: 0.0 ± 0.0
His
1.344HisAla: 1.344 ± 0.045
0.305HisCys: 0.305 ± 0.022
0.926HisAsp: 0.926 ± 0.037
0.982HisGlu: 0.982 ± 0.043
0.876HisPhe: 0.876 ± 0.043
1.482HisGly: 1.482 ± 0.05
0.484HisHis: 0.484 ± 0.03
1.122HisIle: 1.122 ± 0.049
0.784HisLys: 0.784 ± 0.039
2.101HisLeu: 2.101 ± 0.063
0.374HisMet: 0.374 ± 0.025
0.66HisAsn: 0.66 ± 0.036
1.303HisPro: 1.303 ± 0.054
0.716HisGln: 0.716 ± 0.036
1.225HisArg: 1.225 ± 0.05
1.082HisSer: 1.082 ± 0.039
0.901HisThr: 0.901 ± 0.041
1.126HisVal: 1.126 ± 0.052
0.255HisTrp: 0.255 ± 0.023
0.57HisTyr: 0.57 ± 0.029
0.0HisXaa: 0.0 ± 0.0
Ile
6.665IleAla: 6.665 ± 0.103
0.744IleCys: 0.744 ± 0.039
4.272IleAsp: 4.272 ± 0.082
4.409IleGlu: 4.409 ± 0.089
2.543IlePhe: 2.543 ± 0.065
5.514IleGly: 5.514 ± 0.113
1.168IleHis: 1.168 ± 0.046
4.621IleIle: 4.621 ± 0.093
3.34IleLys: 3.34 ± 0.082
6.079IleLeu: 6.079 ± 0.098
1.462IleMet: 1.462 ± 0.051
2.306IleAsn: 2.306 ± 0.068
3.188IlePro: 3.188 ± 0.083
1.726IleGln: 1.726 ± 0.061
3.417IleArg: 3.417 ± 0.071
4.279IleSer: 4.279 ± 0.084
4.004IleThr: 4.004 ± 0.099
4.76IleVal: 4.76 ± 0.088
0.664IleTrp: 0.664 ± 0.041
1.596IleTyr: 1.596 ± 0.055
0.0IleXaa: 0.0 ± 0.0
Lys
4.959LysAla: 4.959 ± 0.105
0.616LysCys: 0.616 ± 0.035
2.431LysAsp: 2.431 ± 0.059
3.348LysGlu: 3.348 ± 0.081
1.625LysPhe: 1.625 ± 0.054
3.577LysGly: 3.577 ± 0.089
0.891LysHis: 0.891 ± 0.039
3.365LysIle: 3.365 ± 0.067
3.004LysLys: 3.004 ± 0.086
5.015LysLeu: 5.015 ± 0.109
1.487LysMet: 1.487 ± 0.058
1.965LysAsn: 1.965 ± 0.057
2.729LysPro: 2.729 ± 0.061
1.665LysGln: 1.665 ± 0.046
2.903LysArg: 2.903 ± 0.079
3.025LysSer: 3.025 ± 0.076
3.348LysThr: 3.348 ± 0.076
3.661LysVal: 3.661 ± 0.08
0.555LysTrp: 0.555 ± 0.03
1.52LysTyr: 1.52 ± 0.055
0.0LysXaa: 0.0 ± 0.0
Leu
10.316LeuAla: 10.316 ± 0.153
0.97LeuCys: 0.97 ± 0.04
5.392LeuAsp: 5.392 ± 0.096
6.046LeuGlu: 6.046 ± 0.116
3.931LeuPhe: 3.931 ± 0.09
8.154LeuGly: 8.154 ± 0.128
1.555LeuHis: 1.555 ± 0.047
6.217LeuIle: 6.217 ± 0.112
6.034LeuLys: 6.034 ± 0.113
8.944LeuLeu: 8.944 ± 0.152
1.998LeuMet: 1.998 ± 0.055
3.526LeuAsn: 3.526 ± 0.077
5.102LeuPro: 5.102 ± 0.099
2.508LeuGln: 2.508 ± 0.07
5.287LeuArg: 5.287 ± 0.106
6.998LeuSer: 6.998 ± 0.107
5.545LeuThr: 5.545 ± 0.107
7.133LeuVal: 7.133 ± 0.12
0.998LeuTrp: 0.998 ± 0.051
2.154LeuTyr: 2.154 ± 0.057
0.0LeuXaa: 0.0 ± 0.0
Met
2.787MetAla: 2.787 ± 0.067
0.17MetCys: 0.17 ± 0.02
1.158MetAsp: 1.158 ± 0.037
1.271MetGlu: 1.271 ± 0.044
0.827MetPhe: 0.827 ± 0.041
1.756MetGly: 1.756 ± 0.064
0.329MetHis: 0.329 ± 0.024
1.357MetIle: 1.357 ± 0.049
1.5MetLys: 1.5 ± 0.051
2.342MetLeu: 2.342 ± 0.067
0.71MetMet: 0.71 ± 0.042
0.986MetAsn: 0.986 ± 0.035
1.545MetPro: 1.545 ± 0.047
0.571MetGln: 0.571 ± 0.033
1.248MetArg: 1.248 ± 0.043
1.56MetSer: 1.56 ± 0.048
1.634MetThr: 1.634 ± 0.053
2.006MetVal: 2.006 ± 0.069
0.209MetTrp: 0.209 ± 0.019
0.473MetTyr: 0.473 ± 0.029
0.0MetXaa: 0.0 ± 0.0
Asn
2.861AsnAla: 2.861 ± 0.074
0.433AsnCys: 0.433 ± 0.031
1.581AsnAsp: 1.581 ± 0.055
1.833AsnGlu: 1.833 ± 0.06
1.313AsnPhe: 1.313 ± 0.045
2.656AsnGly: 2.656 ± 0.085
0.692AsnHis: 0.692 ± 0.035
2.477AsnIle: 2.477 ± 0.067
1.507AsnLys: 1.507 ± 0.056
3.674AsnLeu: 3.674 ± 0.074
0.749AsnMet: 0.749 ± 0.032
1.281AsnAsn: 1.281 ± 0.053
2.398AsnPro: 2.398 ± 0.059
1.057AsnGln: 1.057 ± 0.038
2.197AsnArg: 2.197 ± 0.065
1.988AsnSer: 1.988 ± 0.07
1.957AsnThr: 1.957 ± 0.067
2.291AsnVal: 2.291 ± 0.066
0.494AsnTrp: 0.494 ± 0.03
1.143AsnTyr: 1.143 ± 0.047
0.0AsnXaa: 0.0 ± 0.0
Pro
4.419ProAla: 4.419 ± 0.105
0.435ProCys: 0.435 ± 0.029
3.012ProAsp: 3.012 ± 0.076
3.944ProGlu: 3.944 ± 0.083
1.904ProPhe: 1.904 ± 0.061
4.341ProGly: 4.341 ± 0.133
0.772ProHis: 0.772 ± 0.037
2.479ProIle: 2.479 ± 0.061
2.101ProLys: 2.101 ± 0.052
4.132ProLeu: 4.132 ± 0.076
1.13ProMet: 1.13 ± 0.047
1.586ProAsn: 1.586 ± 0.053
2.437ProPro: 2.437 ± 0.083
1.556ProGln: 1.556 ± 0.051
2.156ProArg: 2.156 ± 0.062
2.964ProSer: 2.964 ± 0.072
2.51ProThr: 2.51 ± 0.078
4.234ProVal: 4.234 ± 0.087
0.591ProTrp: 0.591 ± 0.047
1.217ProTyr: 1.217 ± 0.044
0.0ProXaa: 0.0 ± 0.0
Gln
3.412GlnAla: 3.412 ± 0.071
0.288GlnCys: 0.288 ± 0.023
1.294GlnAsp: 1.294 ± 0.053
1.779GlnGlu: 1.779 ± 0.056
1.061GlnPhe: 1.061 ± 0.049
2.039GlnGly: 2.039 ± 0.055
0.603GlnHis: 0.603 ± 0.034
2.016GlnIle: 2.016 ± 0.062
1.718GlnLys: 1.718 ± 0.054
2.933GlnLeu: 2.933 ± 0.077
0.799GlnMet: 0.799 ± 0.039
1.22GlnAsn: 1.22 ± 0.043
1.569GlnPro: 1.569 ± 0.057
1.034GlnGln: 1.034 ± 0.049
1.734GlnArg: 1.734 ± 0.061
1.929GlnSer: 1.929 ± 0.057
1.683GlnThr: 1.683 ± 0.049
2.348GlnVal: 2.348 ± 0.065
0.341GlnTrp: 0.341 ± 0.026
0.794GlnTyr: 0.794 ± 0.038
0.0GlnXaa: 0.0 ± 0.0
Arg
4.407ArgAla: 4.407 ± 0.088
0.746ArgCys: 0.746 ± 0.039
2.908ArgAsp: 2.908 ± 0.071
3.687ArgGlu: 3.687 ± 0.085
2.406ArgPhe: 2.406 ± 0.061
3.803ArgGly: 3.803 ± 0.081
1.327ArgHis: 1.327 ± 0.05
3.806ArgIle: 3.806 ± 0.087
2.902ArgLys: 2.902 ± 0.079
6.482ArgLeu: 6.482 ± 0.139
1.444ArgMet: 1.444 ± 0.046
1.79ArgAsn: 1.79 ± 0.057
2.375ArgPro: 2.375 ± 0.061
2.243ArgGln: 2.243 ± 0.058
4.042ArgArg: 4.042 ± 0.1
3.183ArgSer: 3.183 ± 0.088
2.625ArgThr: 2.625 ± 0.056
3.972ArgVal: 3.972 ± 0.093
0.693ArgTrp: 0.693 ± 0.032
1.859ArgTyr: 1.859 ± 0.056
0.0ArgXaa: 0.0 ± 0.0
Ser
5.374SerAla: 5.374 ± 0.11
0.725SerCys: 0.725 ± 0.04
2.877SerAsp: 2.877 ± 0.064
3.506SerGlu: 3.506 ± 0.077
2.661SerPhe: 2.661 ± 0.067
6.039SerGly: 6.039 ± 0.123
1.29SerHis: 1.29 ± 0.049
4.032SerIle: 4.032 ± 0.09
2.793SerLys: 2.793 ± 0.075
6.484SerLeu: 6.484 ± 0.116
1.443SerMet: 1.443 ± 0.049
1.828SerAsn: 1.828 ± 0.053
3.065SerPro: 3.065 ± 0.074
2.054SerGln: 2.054 ± 0.07
3.796SerArg: 3.796 ± 0.086
3.671SerSer: 3.671 ± 0.084
3.152SerThr: 3.152 ± 0.078
4.425SerVal: 4.425 ± 0.088
0.634SerTrp: 0.634 ± 0.033
1.601SerTyr: 1.601 ± 0.058
0.0SerXaa: 0.0 ± 0.0
Thr
5.399ThrAla: 5.399 ± 0.111
0.581ThrCys: 0.581 ± 0.034
2.925ThrAsp: 2.925 ± 0.072
3.254ThrGlu: 3.254 ± 0.085
2.179ThrPhe: 2.179 ± 0.067
5.563ThrGly: 5.563 ± 0.096
0.963ThrHis: 0.963 ± 0.044
3.559ThrIle: 3.559 ± 0.077
2.215ThrLys: 2.215 ± 0.071
5.351ThrLeu: 5.351 ± 0.102
1.222ThrMet: 1.222 ± 0.042
1.858ThrAsn: 1.858 ± 0.068
3.137ThrPro: 3.137 ± 0.073
1.523ThrGln: 1.523 ± 0.045
2.875ThrArg: 2.875 ± 0.072
3.378ThrSer: 3.378 ± 0.083
3.2ThrThr: 3.2 ± 0.128
4.504ThrVal: 4.504 ± 0.124
0.614ThrTrp: 0.614 ± 0.029
1.512ThrTyr: 1.512 ± 0.052
0.0ThrXaa: 0.0 ± 0.0
Val
7.292ValAla: 7.292 ± 0.122
0.833ValCys: 0.833 ± 0.036
3.689ValAsp: 3.689 ± 0.079
4.359ValGlu: 4.359 ± 0.083
3.261ValPhe: 3.261 ± 0.078
4.949ValGly: 4.949 ± 0.104
1.229ValHis: 1.229 ± 0.05
5.499ValIle: 5.499 ± 0.104
3.962ValLys: 3.962 ± 0.087
6.963ValLeu: 6.963 ± 0.123
1.907ValMet: 1.907 ± 0.054
2.541ValAsn: 2.541 ± 0.068
3.449ValPro: 3.449 ± 0.085
1.662ValGln: 1.662 ± 0.062
3.692ValArg: 3.692 ± 0.073
4.919ValSer: 4.919 ± 0.091
4.735ValThr: 4.735 ± 0.106
5.651ValVal: 5.651 ± 0.102
0.805ValTrp: 0.805 ± 0.039
1.877ValTyr: 1.877 ± 0.056
0.0ValXaa: 0.0 ± 0.0
Trp
0.861TrpAla: 0.861 ± 0.044
0.132TrpCys: 0.132 ± 0.014
0.619TrpAsp: 0.619 ± 0.03
0.595TrpGlu: 0.595 ± 0.035
0.489TrpPhe: 0.489 ± 0.031
0.912TrpGly: 0.912 ± 0.036
0.26TrpHis: 0.26 ± 0.023
0.693TrpIle: 0.693 ± 0.039
0.524TrpLys: 0.524 ± 0.028
1.365TrpLeu: 1.365 ± 0.056
0.308TrpMet: 0.308 ± 0.026
0.487TrpAsn: 0.487 ± 0.03
0.504TrpPro: 0.504 ± 0.026
0.659TrpGln: 0.659 ± 0.036
0.771TrpArg: 0.771 ± 0.035
0.532TrpSer: 0.532 ± 0.03
0.522TrpThr: 0.522 ± 0.043
0.833TrpVal: 0.833 ± 0.035
0.272TrpTrp: 0.272 ± 0.022
0.343TrpTyr: 0.343 ± 0.027
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.179TyrAla: 2.179 ± 0.059
0.473TyrCys: 0.473 ± 0.026
1.518TyrAsp: 1.518 ± 0.049
1.553TyrGlu: 1.553 ± 0.051
1.349TyrPhe: 1.349 ± 0.046
2.148TyrGly: 2.148 ± 0.056
0.641TyrHis: 0.641 ± 0.033
1.54TyrIle: 1.54 ± 0.057
1.138TyrLys: 1.138 ± 0.045
3.106TyrLeu: 3.106 ± 0.07
0.545TyrMet: 0.545 ± 0.03
0.991TyrAsn: 0.991 ± 0.043
1.449TyrPro: 1.449 ± 0.059
1.09TyrGln: 1.09 ± 0.048
1.843TyrArg: 1.843 ± 0.057
1.731TyrSer: 1.731 ± 0.056
1.518TyrThr: 1.518 ± 0.053
1.706TyrVal: 1.706 ± 0.052
0.367TyrTrp: 0.367 ± 0.025
0.879TyrTyr: 0.879 ± 0.047
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2117 proteins (607201 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski