Amino acid dipepetide frequency for Desulfurococcaceae archaeon AG1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.614AlaAla: 5.614 ± 0.124
0.51AlaCys: 0.51 ± 0.032
2.66AlaAsp: 2.66 ± 0.071
4.557AlaGlu: 4.557 ± 0.085
2.466AlaPhe: 2.466 ± 0.072
5.43AlaGly: 5.43 ± 0.112
0.958AlaHis: 0.958 ± 0.04
7.404AlaIle: 7.404 ± 0.147
3.726AlaLys: 3.726 ± 0.095
8.526AlaLeu: 8.526 ± 0.138
2.215AlaMet: 2.215 ± 0.066
1.605AlaAsn: 1.605 ± 0.052
2.752AlaPro: 2.752 ± 0.075
1.408AlaGln: 1.408 ± 0.056
6.109AlaArg: 6.109 ± 0.107
6.381AlaSer: 6.381 ± 0.122
3.123AlaThr: 3.123 ± 0.085
6.012AlaVal: 6.012 ± 0.113
0.854AlaTrp: 0.854 ± 0.041
2.822AlaTyr: 2.822 ± 0.073
0.0AlaXaa: 0.0 ± 0.0
Cys
0.321CysAla: 0.321 ± 0.022
0.072CysCys: 0.072 ± 0.014
0.315CysAsp: 0.315 ± 0.023
0.373CysGlu: 0.373 ± 0.026
0.236CysPhe: 0.236 ± 0.021
0.868CysGly: 0.868 ± 0.041
0.117CysHis: 0.117 ± 0.014
0.53CysIle: 0.53 ± 0.031
0.267CysLys: 0.267 ± 0.025
0.549CysLeu: 0.549 ± 0.033
0.133CysMet: 0.133 ± 0.016
0.178CysAsn: 0.178 ± 0.017
0.528CysPro: 0.528 ± 0.034
0.119CysGln: 0.119 ± 0.014
0.549CysArg: 0.549 ± 0.033
0.6CysSer: 0.6 ± 0.037
0.247CysThr: 0.247 ± 0.026
0.524CysVal: 0.524 ± 0.031
0.088CysTrp: 0.088 ± 0.014
0.249CysTyr: 0.249 ± 0.021
0.0CysXaa: 0.0 ± 0.0
Asp
3.173AspAla: 3.173 ± 0.083
0.198AspCys: 0.198 ± 0.019
1.765AspAsp: 1.765 ± 0.06
3.208AspGlu: 3.208 ± 0.075
1.286AspPhe: 1.286 ± 0.049
2.799AspGly: 2.799 ± 0.074
0.913AspHis: 0.913 ± 0.042
4.667AspIle: 4.667 ± 0.095
2.489AspLys: 2.489 ± 0.073
7.275AspLeu: 7.275 ± 0.122
1.18AspMet: 1.18 ± 0.045
1.259AspAsn: 1.259 ± 0.045
3.984AspPro: 3.984 ± 0.089
0.962AspGln: 0.962 ± 0.048
3.244AspArg: 3.244 ± 0.071
2.475AspSer: 2.475 ± 0.067
1.952AspThr: 1.952 ± 0.054
3.528AspVal: 3.528 ± 0.087
0.447AspTrp: 0.447 ± 0.029
1.778AspTyr: 1.778 ± 0.06
0.0AspXaa: 0.0 ± 0.0
Glu
6.369GluAla: 6.369 ± 0.106
0.421GluCys: 0.421 ± 0.027
3.98GluAsp: 3.98 ± 0.092
6.25GluGlu: 6.25 ± 0.14
1.718GluPhe: 1.718 ± 0.058
4.674GluGly: 4.674 ± 0.091
1.095GluHis: 1.095 ± 0.049
7.465GluIle: 7.465 ± 0.143
5.169GluLys: 5.169 ± 0.109
6.376GluLeu: 6.376 ± 0.123
1.459GluMet: 1.459 ± 0.059
2.19GluAsn: 2.19 ± 0.06
2.698GluPro: 2.698 ± 0.068
1.104GluGln: 1.104 ± 0.044
5.129GluArg: 5.129 ± 0.112
3.202GluSer: 3.202 ± 0.073
2.802GluThr: 2.802 ± 0.072
5.176GluVal: 5.176 ± 0.085
0.783GluTrp: 0.783 ± 0.039
2.419GluTyr: 2.419 ± 0.068
0.0GluXaa: 0.0 ± 0.0
Phe
1.929PheAla: 1.929 ± 0.063
0.196PheCys: 0.196 ± 0.018
1.558PheAsp: 1.558 ± 0.054
2.037PheGlu: 2.037 ± 0.069
1.174PhePhe: 1.174 ± 0.054
2.304PheGly: 2.304 ± 0.062
0.551PheHis: 0.551 ± 0.033
2.909PheIle: 2.909 ± 0.082
1.797PheLys: 1.797 ± 0.053
3.146PheLeu: 3.146 ± 0.088
0.74PheMet: 0.74 ± 0.04
0.967PheAsn: 0.967 ± 0.043
1.297PhePro: 1.297 ± 0.05
0.634PheGln: 0.634 ± 0.036
2.064PheArg: 2.064 ± 0.062
2.5PheSer: 2.5 ± 0.075
1.673PheThr: 1.673 ± 0.057
2.239PheVal: 2.239 ± 0.059
0.358PheTrp: 0.358 ± 0.026
1.3PheTyr: 1.3 ± 0.051
0.0PheXaa: 0.0 ± 0.0
Gly
5.187GlyAla: 5.187 ± 0.114
0.607GlyCys: 0.607 ± 0.038
3.977GlyAsp: 3.977 ± 0.086
5.236GlyGlu: 5.236 ± 0.103
3.534GlyPhe: 3.534 ± 0.087
5.969GlyGly: 5.969 ± 0.115
1.174GlyHis: 1.174 ± 0.045
6.927GlyIle: 6.927 ± 0.115
4.024GlyLys: 4.024 ± 0.088
7.723GlyLeu: 7.723 ± 0.128
1.92GlyMet: 1.92 ± 0.067
1.922GlyAsn: 1.922 ± 0.069
2.327GlyPro: 2.327 ± 0.072
1.012GlyGln: 1.012 ± 0.044
4.951GlyArg: 4.951 ± 0.1
6.507GlySer: 6.507 ± 0.119
2.696GlyThr: 2.696 ± 0.074
7.3GlyVal: 7.3 ± 0.126
0.965GlyTrp: 0.965 ± 0.047
4.06GlyTyr: 4.06 ± 0.094
0.0GlyXaa: 0.0 ± 0.0
His
1.136HisAla: 1.136 ± 0.048
0.101HisCys: 0.101 ± 0.014
0.677HisAsp: 0.677 ± 0.032
0.904HisGlu: 0.904 ± 0.039
0.402HisPhe: 0.402 ± 0.026
1.403HisGly: 1.403 ± 0.048
0.384HisHis: 0.384 ± 0.024
1.401HisIle: 1.401 ± 0.055
0.504HisLys: 0.504 ± 0.03
1.322HisLeu: 1.322 ± 0.047
0.456HisMet: 0.456 ± 0.03
0.429HisAsn: 0.429 ± 0.026
0.892HisPro: 0.892 ± 0.041
0.279HisGln: 0.279 ± 0.024
1.144HisArg: 1.144 ± 0.048
0.881HisSer: 0.881 ± 0.042
0.654HisThr: 0.654 ± 0.034
1.234HisVal: 1.234 ± 0.047
0.158HisTrp: 0.158 ± 0.017
0.594HisTyr: 0.594 ± 0.033
0.0HisXaa: 0.0 ± 0.0
Ile
9.425IleAla: 9.425 ± 0.141
0.607IleCys: 0.607 ± 0.031
5.891IleAsp: 5.891 ± 0.115
7.001IleGlu: 7.001 ± 0.128
2.907IlePhe: 2.907 ± 0.077
6.306IleGly: 6.306 ± 0.104
1.562IleHis: 1.562 ± 0.053
6.522IleIle: 6.522 ± 0.141
4.348IleLys: 4.348 ± 0.092
9.276IleLeu: 9.276 ± 0.158
2.046IleMet: 2.046 ± 0.058
2.858IleAsn: 2.858 ± 0.074
4.665IlePro: 4.665 ± 0.087
1.38IleGln: 1.38 ± 0.047
4.623IleArg: 4.623 ± 0.096
6.909IleSer: 6.909 ± 0.131
4.186IleThr: 4.186 ± 0.09
7.341IleVal: 7.341 ± 0.119
0.832IleTrp: 0.832 ± 0.046
4.541IleTyr: 4.541 ± 0.086
0.0IleXaa: 0.0 ± 0.0
Lys
4.564LysAla: 4.564 ± 0.097
0.466LysCys: 0.466 ± 0.029
2.509LysAsp: 2.509 ± 0.064
3.743LysGlu: 3.743 ± 0.097
0.863LysPhe: 0.863 ± 0.044
4.35LysGly: 4.35 ± 0.089
0.904LysHis: 0.904 ± 0.036
5.778LysIle: 5.778 ± 0.113
3.296LysLys: 3.296 ± 0.104
4.715LysLeu: 4.715 ± 0.107
1.091LysMet: 1.091 ± 0.047
1.659LysAsn: 1.659 ± 0.053
3.028LysPro: 3.028 ± 0.073
0.935LysGln: 0.935 ± 0.042
3.995LysArg: 3.995 ± 0.089
2.864LysSer: 2.864 ± 0.079
2.552LysThr: 2.552 ± 0.068
3.22LysVal: 3.22 ± 0.08
0.533LysTrp: 0.533 ± 0.029
1.913LysTyr: 1.913 ± 0.066
0.0LysXaa: 0.0 ± 0.0
Leu
8.427LeuAla: 8.427 ± 0.149
0.65LeuCys: 0.65 ± 0.039
4.942LeuAsp: 4.942 ± 0.08
7.691LeuGlu: 7.691 ± 0.142
2.943LeuPhe: 2.943 ± 0.082
9.236LeuGly: 9.236 ± 0.151
1.214LeuHis: 1.214 ± 0.047
7.514LeuIle: 7.514 ± 0.132
5.52LeuLys: 5.52 ± 0.117
9.472LeuLeu: 9.472 ± 0.169
2.125LeuMet: 2.125 ± 0.073
2.467LeuAsn: 2.467 ± 0.066
3.811LeuPro: 3.811 ± 0.079
1.522LeuGln: 1.522 ± 0.059
7.642LeuArg: 7.642 ± 0.129
8.162LeuSer: 8.162 ± 0.121
3.968LeuThr: 3.968 ± 0.099
8.006LeuVal: 8.006 ± 0.151
1.07LeuTrp: 1.07 ± 0.048
4.247LeuTyr: 4.247 ± 0.096
0.0LeuXaa: 0.0 ± 0.0
Met
1.859MetAla: 1.859 ± 0.049
0.155MetCys: 0.155 ± 0.016
1.243MetAsp: 1.243 ± 0.044
1.374MetGlu: 1.374 ± 0.057
0.706MetPhe: 0.706 ± 0.038
2.313MetGly: 2.313 ± 0.068
0.331MetHis: 0.331 ± 0.025
2.482MetIle: 2.482 ± 0.065
1.201MetLys: 1.201 ± 0.05
2.73MetLeu: 2.73 ± 0.068
0.526MetMet: 0.526 ± 0.04
0.663MetAsn: 0.663 ± 0.033
1.16MetPro: 1.16 ± 0.045
0.286MetGln: 0.286 ± 0.023
1.787MetArg: 1.787 ± 0.055
1.554MetSer: 1.554 ± 0.049
0.785MetThr: 0.785 ± 0.034
2.1MetVal: 2.1 ± 0.056
0.306MetTrp: 0.306 ± 0.026
0.639MetTyr: 0.639 ± 0.031
0.0MetXaa: 0.0 ± 0.0
Asn
2.059AsnAla: 2.059 ± 0.063
0.158AsnCys: 0.158 ± 0.018
1.025AsnAsp: 1.025 ± 0.044
1.5AsnGlu: 1.5 ± 0.051
0.614AsnPhe: 0.614 ± 0.038
1.679AsnGly: 1.679 ± 0.057
0.402AsnHis: 0.402 ± 0.029
3.366AsnIle: 3.366 ± 0.078
1.399AsnLys: 1.399 ± 0.051
2.754AsnLeu: 2.754 ± 0.059
0.946AsnMet: 0.946 ± 0.039
1.178AsnAsn: 1.178 ± 0.053
2.087AsnPro: 2.087 ± 0.06
0.515AsnGln: 0.515 ± 0.033
1.58AsnArg: 1.58 ± 0.058
1.461AsnSer: 1.461 ± 0.053
1.635AsnThr: 1.635 ± 0.052
2.048AsnVal: 2.048 ± 0.062
0.295AsnTrp: 0.295 ± 0.024
1.106AsnTyr: 1.106 ± 0.049
0.0AsnXaa: 0.0 ± 0.0
Pro
2.588ProAla: 2.588 ± 0.075
0.272ProCys: 0.272 ± 0.024
2.078ProAsp: 2.078 ± 0.065
3.705ProGlu: 3.705 ± 0.083
1.443ProPhe: 1.443 ± 0.05
4.058ProGly: 4.058 ± 0.094
0.857ProHis: 0.857 ± 0.043
3.136ProIle: 3.136 ± 0.078
2.102ProLys: 2.102 ± 0.058
4.537ProLeu: 4.537 ± 0.111
0.985ProMet: 0.985 ± 0.045
1.234ProAsn: 1.234 ± 0.048
2.493ProPro: 2.493 ± 0.086
1.165ProGln: 1.165 ± 0.046
3.206ProArg: 3.206 ± 0.088
3.345ProSer: 3.345 ± 0.071
1.94ProThr: 1.94 ± 0.06
3.563ProVal: 3.563 ± 0.084
0.735ProTrp: 0.735 ± 0.036
1.927ProTyr: 1.927 ± 0.065
0.0ProXaa: 0.0 ± 0.0
Gln
1.504GlnAla: 1.504 ± 0.052
0.126GlnCys: 0.126 ± 0.017
0.881GlnAsp: 0.881 ± 0.045
1.133GlnGlu: 1.133 ± 0.052
0.416GlnPhe: 0.416 ± 0.026
1.461GlnGly: 1.461 ± 0.058
0.268GlnHis: 0.268 ± 0.022
1.434GlnIle: 1.434 ± 0.049
0.828GlnLys: 0.828 ± 0.04
1.635GlnLeu: 1.635 ± 0.058
0.403GlnMet: 0.403 ± 0.026
0.497GlnAsn: 0.497 ± 0.027
0.767GlnPro: 0.767 ± 0.038
0.548GlnGln: 0.548 ± 0.047
1.275GlnArg: 1.275 ± 0.043
0.924GlnSer: 0.924 ± 0.045
0.816GlnThr: 0.816 ± 0.042
1.275GlnVal: 1.275 ± 0.056
0.184GlnTrp: 0.184 ± 0.018
0.587GlnTyr: 0.587 ± 0.033
0.0GlnXaa: 0.0 ± 0.0
Arg
4.971ArgAla: 4.971 ± 0.088
0.627ArgCys: 0.627 ± 0.031
4.042ArgAsp: 4.042 ± 0.087
5.987ArgGlu: 5.987 ± 0.112
2.413ArgPhe: 2.413 ± 0.069
5.89ArgGly: 5.89 ± 0.119
0.845ArgHis: 0.845 ± 0.042
7.309ArgIle: 7.309 ± 0.129
3.615ArgLys: 3.615 ± 0.098
6.478ArgLeu: 6.478 ± 0.122
1.439ArgMet: 1.439 ± 0.05
1.942ArgAsn: 1.942 ± 0.061
2.084ArgPro: 2.084 ± 0.071
0.837ArgGln: 0.837 ± 0.041
4.937ArgArg: 4.937 ± 0.112
5.151ArgSer: 5.151 ± 0.092
2.37ArgThr: 2.37 ± 0.062
5.969ArgVal: 5.969 ± 0.112
0.848ArgTrp: 0.848 ± 0.033
2.833ArgTyr: 2.833 ± 0.083
0.0ArgXaa: 0.0 ± 0.0
Ser
3.667SerAla: 3.667 ± 0.087
0.499SerCys: 0.499 ± 0.03
2.992SerAsp: 2.992 ± 0.073
4.215SerGlu: 4.215 ± 0.091
2.406SerPhe: 2.406 ± 0.06
5.621SerGly: 5.621 ± 0.101
1.054SerHis: 1.054 ± 0.045
7.546SerIle: 7.546 ± 0.129
3.952SerLys: 3.952 ± 0.09
7.635SerLeu: 7.635 ± 0.152
2.257SerMet: 2.257 ± 0.073
1.805SerAsn: 1.805 ± 0.059
3.337SerPro: 3.337 ± 0.071
1.432SerGln: 1.432 ± 0.054
5.319SerArg: 5.319 ± 0.117
5.4SerSer: 5.4 ± 0.132
3.008SerThr: 3.008 ± 0.078
4.712SerVal: 4.712 ± 0.095
0.922SerTrp: 0.922 ± 0.039
2.966SerTyr: 2.966 ± 0.076
0.0SerXaa: 0.0 ± 0.0
Thr
3.28ThrAla: 3.28 ± 0.08
0.319ThrCys: 0.319 ± 0.022
1.616ThrAsp: 1.616 ± 0.05
1.985ThrGlu: 1.985 ± 0.069
1.318ThrPhe: 1.318 ± 0.054
3.842ThrGly: 3.842 ± 0.087
0.812ThrHis: 0.812 ± 0.041
4.092ThrIle: 4.092 ± 0.098
1.774ThrLys: 1.774 ± 0.058
4.595ThrLeu: 4.595 ± 0.091
1.099ThrMet: 1.099 ± 0.048
1.192ThrAsn: 1.192 ± 0.045
2.649ThrPro: 2.649 ± 0.072
0.895ThrGln: 0.895 ± 0.045
2.664ThrArg: 2.664 ± 0.059
2.858ThrSer: 2.858 ± 0.078
2.244ThrThr: 2.244 ± 0.15
2.992ThrVal: 2.992 ± 0.085
0.501ThrTrp: 0.501 ± 0.036
1.668ThrTyr: 1.668 ± 0.061
0.0ThrXaa: 0.0 ± 0.0
Val
5.711ValAla: 5.711 ± 0.118
0.504ValCys: 0.504 ± 0.03
4.099ValAsp: 4.099 ± 0.087
6.379ValGlu: 6.379 ± 0.121
3.013ValPhe: 3.013 ± 0.085
5.576ValGly: 5.576 ± 0.118
0.78ValHis: 0.78 ± 0.038
7.392ValIle: 7.392 ± 0.122
4.6ValLys: 4.6 ± 0.081
6.855ValLeu: 6.855 ± 0.114
1.716ValMet: 1.716 ± 0.06
2.179ValAsn: 2.179 ± 0.058
2.669ValPro: 2.669 ± 0.068
0.98ValGln: 0.98 ± 0.041
5.727ValArg: 5.727 ± 0.108
5.994ValSer: 5.994 ± 0.107
3.4ValThr: 3.4 ± 0.092
6.624ValVal: 6.624 ± 0.12
0.787ValTrp: 0.787 ± 0.044
2.826ValTyr: 2.826 ± 0.067
0.0ValXaa: 0.0 ± 0.0
Trp
0.758TrpAla: 0.758 ± 0.042
0.076TrpCys: 0.076 ± 0.012
0.645TrpAsp: 0.645 ± 0.036
0.683TrpGlu: 0.683 ± 0.039
0.468TrpPhe: 0.468 ± 0.031
1.059TrpGly: 1.059 ± 0.038
0.167TrpHis: 0.167 ± 0.018
1.118TrpIle: 1.118 ± 0.055
0.512TrpLys: 0.512 ± 0.029
1.102TrpLeu: 1.102 ± 0.049
0.252TrpMet: 0.252 ± 0.019
0.366TrpAsn: 0.366 ± 0.025
0.339TrpPro: 0.339 ± 0.026
0.186TrpGln: 0.186 ± 0.016
1.039TrpArg: 1.039 ± 0.042
0.74TrpSer: 0.74 ± 0.042
0.283TrpThr: 0.283 ± 0.024
0.911TrpVal: 0.911 ± 0.045
0.196TrpTrp: 0.196 ± 0.02
0.429TrpTyr: 0.429 ± 0.028
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.592TyrAla: 2.592 ± 0.075
0.297TyrCys: 0.297 ± 0.022
1.686TyrAsp: 1.686 ± 0.059
2.523TyrGlu: 2.523 ± 0.065
1.048TyrPhe: 1.048 ± 0.047
3.182TyrGly: 3.182 ± 0.086
0.566TyrHis: 0.566 ± 0.036
4.233TyrIle: 4.233 ± 0.097
1.853TyrLys: 1.853 ± 0.054
3.928TyrLeu: 3.928 ± 0.089
1.21TyrMet: 1.21 ± 0.05
1.218TyrAsn: 1.218 ± 0.052
1.801TyrPro: 1.801 ± 0.053
0.764TyrGln: 0.764 ± 0.043
3.602TyrArg: 3.602 ± 0.076
2.837TyrSer: 2.837 ± 0.072
2.059TyrThr: 2.059 ± 0.067
3.04TyrVal: 3.04 ± 0.081
0.465TyrTrp: 0.465 ± 0.034
1.662TyrTyr: 1.662 ± 0.062
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1903 proteins (555226 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski