Amino acid dipepetide frequency for Anaeromyxobacter sp. (strain Fw109-5)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
24.406AlaAla: 24.406 ± 0.298
1.575AlaCys: 1.575 ± 0.05
6.381AlaAsp: 6.381 ± 0.076
9.574AlaGlu: 9.574 ± 0.122
4.442AlaPhe: 4.442 ± 0.061
13.453AlaGly: 13.453 ± 0.125
2.653AlaHis: 2.653 ± 0.049
5.119AlaIle: 5.119 ± 0.06
3.232AlaLys: 3.232 ± 0.066
18.112AlaLeu: 18.112 ± 0.2
2.498AlaMet: 2.498 ± 0.047
1.911AlaAsn: 1.911 ± 0.041
9.06AlaPro: 9.06 ± 0.121
3.615AlaGln: 3.615 ± 0.05
15.486AlaArg: 15.486 ± 0.198
6.97AlaSer: 6.97 ± 0.088
6.479AlaThr: 6.479 ± 0.091
11.282AlaVal: 11.282 ± 0.104
1.878AlaTrp: 1.878 ± 0.039
2.385AlaTyr: 2.385 ± 0.045
0.0AlaXaa: 0.0 ± 0.0
Cys
1.346CysAla: 1.346 ± 0.043
0.113CysCys: 0.113 ± 0.016
0.543CysAsp: 0.543 ± 0.024
0.544CysGlu: 0.544 ± 0.022
0.231CysPhe: 0.231 ± 0.012
1.098CysGly: 1.098 ± 0.036
0.536CysHis: 0.536 ± 0.076
0.229CysIle: 0.229 ± 0.013
0.187CysLys: 0.187 ± 0.014
0.652CysLeu: 0.652 ± 0.018
0.107CysMet: 0.107 ± 0.009
0.178CysAsn: 0.178 ± 0.013
0.629CysPro: 0.629 ± 0.029
0.199CysGln: 0.199 ± 0.015
0.704CysArg: 0.704 ± 0.026
0.517CysSer: 0.517 ± 0.035
0.494CysThr: 0.494 ± 0.029
0.652CysVal: 0.652 ± 0.024
0.095CysTrp: 0.095 ± 0.009
0.16CysTyr: 0.16 ± 0.01
0.0CysXaa: 0.0 ± 0.0
Asp
8.056AspAla: 8.056 ± 0.092
0.458AspCys: 0.458 ± 0.035
2.7AspAsp: 2.7 ± 0.075
3.643AspGlu: 3.643 ± 0.052
1.438AspPhe: 1.438 ± 0.03
5.394AspGly: 5.394 ± 0.111
1.0AspHis: 1.0 ± 0.027
1.174AspIle: 1.174 ± 0.03
0.85AspLys: 0.85 ± 0.032
5.89AspLeu: 5.89 ± 0.065
0.565AspMet: 0.565 ± 0.021
0.556AspAsn: 0.556 ± 0.023
4.09AspPro: 4.09 ± 0.068
0.981AspGln: 0.981 ± 0.025
4.598AspArg: 4.598 ± 0.067
1.491AspSer: 1.491 ± 0.031
1.737AspThr: 1.737 ± 0.04
5.01AspVal: 5.01 ± 0.076
0.619AspTrp: 0.619 ± 0.022
0.95AspTyr: 0.95 ± 0.031
0.0AspXaa: 0.0 ± 0.0
Glu
9.531GluAla: 9.531 ± 0.142
0.371GluCys: 0.371 ± 0.016
2.898GluAsp: 2.898 ± 0.049
4.253GluGlu: 4.253 ± 0.069
1.363GluPhe: 1.363 ± 0.034
5.318GluGly: 5.318 ± 0.061
1.378GluHis: 1.378 ± 0.033
2.596GluIle: 2.596 ± 0.047
2.02GluLys: 2.02 ± 0.051
7.789GluLeu: 7.789 ± 0.095
0.979GluMet: 0.979 ± 0.026
0.967GluAsn: 0.967 ± 0.029
3.523GluPro: 3.523 ± 0.05
1.884GluGln: 1.884 ± 0.037
7.772GluArg: 7.772 ± 0.103
2.322GluSer: 2.322 ± 0.041
2.614GluThr: 2.614 ± 0.052
5.453GluVal: 5.453 ± 0.065
0.744GluTrp: 0.744 ± 0.02
1.024GluTyr: 1.024 ± 0.029
0.0GluXaa: 0.0 ± 0.0
Phe
4.09PheAla: 4.09 ± 0.059
0.322PheCys: 0.322 ± 0.017
1.932PheAsp: 1.932 ± 0.036
2.12PheGlu: 2.12 ± 0.039
1.118PhePhe: 1.118 ± 0.029
2.922PheGly: 2.922 ± 0.052
0.712PheHis: 0.712 ± 0.023
0.779PheIle: 0.779 ± 0.025
0.644PheLys: 0.644 ± 0.026
3.126PheLeu: 3.126 ± 0.05
0.485PheMet: 0.485 ± 0.017
0.524PheAsn: 0.524 ± 0.021
1.616PhePro: 1.616 ± 0.036
0.798PheGln: 0.798 ± 0.02
2.333PheArg: 2.333 ± 0.041
1.507PheSer: 1.507 ± 0.036
1.68PheThr: 1.68 ± 0.048
2.742PheVal: 2.742 ± 0.047
0.416PheTrp: 0.416 ± 0.018
0.68PheTyr: 0.68 ± 0.023
0.0PheXaa: 0.0 ± 0.0
Gly
13.325GlyAla: 13.325 ± 0.121
0.985GlyCys: 0.985 ± 0.032
4.801GlyAsp: 4.801 ± 0.054
6.083GlyGlu: 6.083 ± 0.072
3.031GlyPhe: 3.031 ± 0.041
8.889GlyGly: 8.889 ± 0.111
1.86GlyHis: 1.86 ± 0.04
3.209GlyIle: 3.209 ± 0.058
2.757GlyLys: 2.757 ± 0.052
8.718GlyLeu: 8.718 ± 0.096
1.741GlyMet: 1.741 ± 0.043
1.476GlyAsn: 1.476 ± 0.039
4.526GlyPro: 4.526 ± 0.062
2.154GlyGln: 2.154 ± 0.036
8.31GlyArg: 8.31 ± 0.098
4.304GlySer: 4.304 ± 0.075
4.8GlyThr: 4.8 ± 0.1
7.223GlyVal: 7.223 ± 0.084
1.406GlyTrp: 1.406 ± 0.034
1.991GlyTyr: 1.991 ± 0.043
0.0GlyXaa: 0.0 ± 0.0
His
2.93HisAla: 2.93 ± 0.053
0.187HisCys: 0.187 ± 0.011
1.129HisAsp: 1.129 ± 0.027
1.228HisGlu: 1.228 ± 0.033
0.632HisPhe: 0.632 ± 0.024
2.184HisGly: 2.184 ± 0.058
0.522HisHis: 0.522 ± 0.024
0.406HisIle: 0.406 ± 0.019
0.333HisLys: 0.333 ± 0.015
2.338HisLeu: 2.338 ± 0.056
0.299HisMet: 0.299 ± 0.014
0.291HisAsn: 0.291 ± 0.013
1.528HisPro: 1.528 ± 0.036
0.435HisGln: 0.435 ± 0.02
1.756HisArg: 1.756 ± 0.043
0.713HisSer: 0.713 ± 0.025
0.701HisThr: 0.701 ± 0.022
1.894HisVal: 1.894 ± 0.045
0.244HisTrp: 0.244 ± 0.011
0.398HisTyr: 0.398 ± 0.018
0.0HisXaa: 0.0 ± 0.0
Ile
5.146IleAla: 5.146 ± 0.063
0.275IleCys: 0.275 ± 0.013
1.977IleAsp: 1.977 ± 0.036
2.308IleGlu: 2.308 ± 0.043
0.921IlePhe: 0.921 ± 0.03
2.973IleGly: 2.973 ± 0.049
0.687IleHis: 0.687 ± 0.021
0.767IleIle: 0.767 ± 0.032
0.502IleLys: 0.502 ± 0.023
2.881IleLeu: 2.881 ± 0.05
0.355IleMet: 0.355 ± 0.017
0.432IleAsn: 0.432 ± 0.019
1.873IlePro: 1.873 ± 0.037
0.84IleGln: 0.84 ± 0.026
2.436IleArg: 2.436 ± 0.042
1.475IleSer: 1.475 ± 0.032
1.447IleThr: 1.447 ± 0.042
3.151IleVal: 3.151 ± 0.055
0.316IleTrp: 0.316 ± 0.013
0.553IleTyr: 0.553 ± 0.02
0.0IleXaa: 0.0 ± 0.0
Lys
2.992LysAla: 2.992 ± 0.062
0.165LysCys: 0.165 ± 0.012
1.333LysAsp: 1.333 ± 0.04
1.506LysGlu: 1.506 ± 0.042
0.532LysPhe: 0.532 ± 0.022
2.129LysGly: 2.129 ± 0.05
0.473LysHis: 0.473 ± 0.019
0.96LysIle: 0.96 ± 0.029
1.096LysLys: 1.096 ± 0.041
2.699LysLeu: 2.699 ± 0.06
0.472LysMet: 0.472 ± 0.021
0.501LysAsn: 0.501 ± 0.023
1.479LysPro: 1.479 ± 0.032
0.635LysGln: 0.635 ± 0.024
1.995LysArg: 1.995 ± 0.037
0.971LysSer: 0.971 ± 0.031
1.181LysThr: 1.181 ± 0.029
2.061LysVal: 2.061 ± 0.045
0.199LysTrp: 0.199 ± 0.012
0.464LysTyr: 0.464 ± 0.021
0.0LysXaa: 0.0 ± 0.0
Leu
18.912LeuAla: 18.912 ± 0.176
0.885LeuCys: 0.885 ± 0.023
6.104LeuAsp: 6.104 ± 0.069
7.233LeuGlu: 7.233 ± 0.091
3.188LeuPhe: 3.188 ± 0.054
9.779LeuGly: 9.779 ± 0.093
2.113LeuHis: 2.113 ± 0.035
1.993LeuIle: 1.993 ± 0.046
2.206LeuLys: 2.206 ± 0.048
11.116LeuLeu: 11.116 ± 0.124
1.264LeuMet: 1.264 ± 0.033
1.435LeuAsn: 1.435 ± 0.04
6.154LeuPro: 6.154 ± 0.076
2.418LeuGln: 2.418 ± 0.045
9.733LeuArg: 9.733 ± 0.111
5.571LeuSer: 5.571 ± 0.063
4.442LeuThr: 4.442 ± 0.062
10.055LeuVal: 10.055 ± 0.114
1.177LeuTrp: 1.177 ± 0.036
1.818LeuTyr: 1.818 ± 0.042
0.0LeuXaa: 0.0 ± 0.0
Met
1.993MetAla: 1.993 ± 0.038
0.116MetCys: 0.116 ± 0.008
0.731MetAsp: 0.731 ± 0.021
0.795MetGlu: 0.795 ± 0.025
0.385MetPhe: 0.385 ± 0.018
1.355MetGly: 1.355 ± 0.033
0.32MetHis: 0.32 ± 0.015
0.636MetIle: 0.636 ± 0.019
0.65MetLys: 0.65 ± 0.022
1.454MetLeu: 1.454 ± 0.033
0.302MetMet: 0.302 ± 0.015
0.467MetAsn: 0.467 ± 0.018
1.121MetPro: 1.121 ± 0.023
0.454MetGln: 0.454 ± 0.018
1.561MetArg: 1.561 ± 0.034
1.036MetSer: 1.036 ± 0.027
1.085MetThr: 1.085 ± 0.029
1.047MetVal: 1.047 ± 0.028
0.133MetTrp: 0.133 ± 0.009
0.232MetTyr: 0.232 ± 0.014
0.0MetXaa: 0.0 ± 0.0
Asn
2.2AsnAla: 2.2 ± 0.039
0.171AsnCys: 0.171 ± 0.012
0.8AsnAsp: 0.8 ± 0.027
0.813AsnGlu: 0.813 ± 0.027
0.492AsnPhe: 0.492 ± 0.021
1.492AsnGly: 1.492 ± 0.044
0.313AsnHis: 0.313 ± 0.014
0.467AsnIle: 0.467 ± 0.021
0.332AsnLys: 0.332 ± 0.017
1.763AsnLeu: 1.763 ± 0.038
0.245AsnMet: 0.245 ± 0.012
0.282AsnAsn: 0.282 ± 0.016
1.281AsnPro: 1.281 ± 0.029
0.399AsnGln: 0.399 ± 0.016
1.162AsnArg: 1.162 ± 0.03
0.532AsnSer: 0.532 ± 0.019
0.653AsnThr: 0.653 ± 0.027
1.694AsnVal: 1.694 ± 0.044
0.198AsnTrp: 0.198 ± 0.02
0.322AsnTyr: 0.322 ± 0.015
0.0AsnXaa: 0.0 ± 0.0
Pro
9.316ProAla: 9.316 ± 0.114
0.404ProCys: 0.404 ± 0.019
3.546ProAsp: 3.546 ± 0.049
4.567ProGlu: 4.567 ± 0.061
1.931ProPhe: 1.931 ± 0.041
6.228ProGly: 6.228 ± 0.076
1.057ProHis: 1.057 ± 0.03
1.85ProIle: 1.85 ± 0.038
1.236ProLys: 1.236 ± 0.033
5.651ProLeu: 5.651 ± 0.067
1.021ProMet: 1.021 ± 0.029
0.786ProAsn: 0.786 ± 0.024
4.662ProPro: 4.662 ± 0.089
1.321ProGln: 1.321 ± 0.032
5.353ProArg: 5.353 ± 0.077
3.295ProSer: 3.295 ± 0.051
2.66ProThr: 2.66 ± 0.068
4.61ProVal: 4.61 ± 0.063
0.785ProTrp: 0.785 ± 0.02
1.071ProTyr: 1.071 ± 0.032
0.0ProXaa: 0.0 ± 0.0
Gln
3.43GlnAla: 3.43 ± 0.052
0.192GlnCys: 0.192 ± 0.013
1.066GlnAsp: 1.066 ± 0.029
1.407GlnGlu: 1.407 ± 0.033
0.616GlnPhe: 0.616 ± 0.02
2.04GlnGly: 2.04 ± 0.038
0.482GlnHis: 0.482 ± 0.018
0.908GlnIle: 0.908 ± 0.026
0.727GlnLys: 0.727 ± 0.028
2.861GlnLeu: 2.861 ± 0.044
0.382GlnMet: 0.382 ± 0.019
0.453GlnAsn: 0.453 ± 0.021
1.357GlnPro: 1.357 ± 0.034
0.793GlnGln: 0.793 ± 0.027
2.462GlnArg: 2.462 ± 0.048
0.954GlnSer: 0.954 ± 0.026
1.006GlnThr: 1.006 ± 0.03
2.134GlnVal: 2.134 ± 0.039
0.281GlnTrp: 0.281 ± 0.013
0.458GlnTyr: 0.458 ± 0.017
0.0GlnXaa: 0.0 ± 0.0
Arg
14.329ArgAla: 14.329 ± 0.191
0.881ArgCys: 0.881 ± 0.03
4.626ArgAsp: 4.626 ± 0.066
6.474ArgGlu: 6.474 ± 0.1
3.298ArgPhe: 3.298 ± 0.052
7.64ArgGly: 7.64 ± 0.112
1.946ArgHis: 1.946 ± 0.04
3.487ArgIle: 3.487 ± 0.045
2.087ArgLys: 2.087 ± 0.043
10.068ArgLeu: 10.068 ± 0.109
1.869ArgMet: 1.869 ± 0.037
1.35ArgAsn: 1.35 ± 0.032
5.252ArgPro: 5.252 ± 0.077
2.053ArgGln: 2.053 ± 0.04
9.93ArgArg: 9.93 ± 0.151
4.237ArgSer: 4.237 ± 0.057
4.015ArgThr: 4.015 ± 0.052
7.261ArgVal: 7.261 ± 0.074
1.382ArgTrp: 1.382 ± 0.031
1.938ArgTyr: 1.938 ± 0.034
0.0ArgXaa: 0.0 ± 0.0
Ser
6.501SerAla: 6.501 ± 0.082
0.565SerCys: 0.565 ± 0.034
2.257SerAsp: 2.257 ± 0.052
2.457SerGlu: 2.457 ± 0.042
1.621SerPhe: 1.621 ± 0.035
4.981SerGly: 4.981 ± 0.074
0.892SerHis: 0.892 ± 0.026
1.597SerIle: 1.597 ± 0.037
0.939SerLys: 0.939 ± 0.027
4.7SerLeu: 4.7 ± 0.059
0.831SerMet: 0.831 ± 0.027
0.801SerAsn: 0.801 ± 0.027
3.211SerPro: 3.211 ± 0.054
1.029SerGln: 1.029 ± 0.031
4.062SerArg: 4.062 ± 0.054
2.676SerSer: 2.676 ± 0.057
2.445SerThr: 2.445 ± 0.056
3.549SerVal: 3.549 ± 0.067
0.763SerTrp: 0.763 ± 0.023
0.98SerTyr: 0.98 ± 0.03
0.0SerXaa: 0.0 ± 0.0
Thr
5.845ThrAla: 5.845 ± 0.086
0.529ThrCys: 0.529 ± 0.035
2.039ThrAsp: 2.039 ± 0.052
2.182ThrGlu: 2.182 ± 0.044
1.725ThrPhe: 1.725 ± 0.039
4.529ThrGly: 4.529 ± 0.075
0.867ThrHis: 0.867 ± 0.027
1.731ThrIle: 1.731 ± 0.038
0.991ThrLys: 0.991 ± 0.029
5.099ThrLeu: 5.099 ± 0.073
0.688ThrMet: 0.688 ± 0.021
0.835ThrAsn: 0.835 ± 0.03
3.377ThrPro: 3.377 ± 0.065
1.007ThrGln: 1.007 ± 0.032
3.706ThrArg: 3.706 ± 0.057
2.521ThrSer: 2.521 ± 0.082
2.358ThrThr: 2.358 ± 0.058
4.1ThrVal: 4.1 ± 0.096
0.662ThrTrp: 0.662 ± 0.023
0.979ThrTyr: 0.979 ± 0.034
0.0ThrXaa: 0.0 ± 0.0
Val
12.335ValAla: 12.335 ± 0.132
0.673ValCys: 0.673 ± 0.026
4.515ValAsp: 4.515 ± 0.077
5.839ValGlu: 5.839 ± 0.069
2.467ValPhe: 2.467 ± 0.043
6.422ValGly: 6.422 ± 0.08
1.713ValHis: 1.713 ± 0.04
2.551ValIle: 2.551 ± 0.049
2.298ValLys: 2.298 ± 0.05
9.022ValLeu: 9.022 ± 0.091
1.271ValMet: 1.271 ± 0.031
1.724ValAsn: 1.724 ± 0.042
4.977ValPro: 4.977 ± 0.06
2.083ValGln: 2.083 ± 0.039
7.614ValArg: 7.614 ± 0.088
4.187ValSer: 4.187 ± 0.073
4.417ValThr: 4.417 ± 0.102
7.954ValVal: 7.954 ± 0.101
0.844ValTrp: 0.844 ± 0.026
1.48ValTyr: 1.48 ± 0.035
0.0ValXaa: 0.0 ± 0.0
Trp
1.4TrpAla: 1.4 ± 0.035
0.113TrpCys: 0.113 ± 0.01
0.649TrpAsp: 0.649 ± 0.023
0.623TrpGlu: 0.623 ± 0.021
0.432TrpPhe: 0.432 ± 0.016
0.884TrpGly: 0.884 ± 0.027
0.237TrpHis: 0.237 ± 0.012
0.534TrpIle: 0.534 ± 0.019
0.399TrpLys: 0.399 ± 0.016
1.547TrpLeu: 1.547 ± 0.033
0.252TrpMet: 0.252 ± 0.012
0.352TrpAsn: 0.352 ± 0.02
0.648TrpPro: 0.648 ± 0.023
0.385TrpGln: 0.385 ± 0.016
1.336TrpArg: 1.336 ± 0.037
0.788TrpSer: 0.788 ± 0.023
0.753TrpThr: 0.753 ± 0.028
0.834TrpVal: 0.834 ± 0.025
0.205TrpTrp: 0.205 ± 0.014
0.258TrpTyr: 0.258 ± 0.015
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.43TyrAla: 2.43 ± 0.046
0.231TyrCys: 0.231 ± 0.016
1.174TyrAsp: 1.174 ± 0.035
1.151TyrGlu: 1.151 ± 0.03
0.682TyrPhe: 0.682 ± 0.024
1.878TyrGly: 1.878 ± 0.038
0.385TyrHis: 0.385 ± 0.016
0.325TyrIle: 0.325 ± 0.018
0.355TyrLys: 0.355 ± 0.019
2.11TyrLeu: 2.11 ± 0.042
0.246TyrMet: 0.246 ± 0.013
0.289TyrAsn: 0.289 ± 0.014
0.984TyrPro: 0.984 ± 0.025
0.528TyrGln: 0.528 ± 0.021
1.814TyrArg: 1.814 ± 0.04
0.792TyrSer: 0.792 ± 0.026
0.801TyrThr: 0.801 ± 0.028
1.682TyrVal: 1.682 ± 0.036
0.281TyrTrp: 0.281 ± 0.015
0.502TyrTyr: 0.502 ± 0.02
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4461 proteins (1575732 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski