Amino acid dipepetide frequency for Paulinella micropora

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.513AlaAla: 6.513 ± 0.195
1.136AlaCys: 1.136 ± 0.065
3.504AlaAsp: 3.504 ± 0.109
4.156AlaGlu: 4.156 ± 0.131
2.745AlaPhe: 2.745 ± 0.118
5.314AlaGly: 5.314 ± 0.156
1.422AlaHis: 1.422 ± 0.081
6.036AlaIle: 6.036 ± 0.154
3.101AlaLys: 3.101 ± 0.103
8.939AlaLeu: 8.939 ± 0.209
1.847AlaMet: 1.847 ± 0.098
2.661AlaAsn: 2.661 ± 0.107
2.595AlaPro: 2.595 ± 0.104
2.855AlaGln: 2.855 ± 0.107
4.38AlaArg: 4.38 ± 0.123
4.992AlaSer: 4.992 ± 0.131
3.885AlaThr: 3.885 ± 0.119
4.856AlaVal: 4.856 ± 0.134
0.876AlaTrp: 0.876 ± 0.055
1.895AlaTyr: 1.895 ± 0.081
0.0AlaXaa: 0.0 ± 0.0
Cys
0.66CysAla: 0.66 ± 0.051
0.257CysCys: 0.257 ± 0.043
0.649CysAsp: 0.649 ± 0.054
0.59CysGlu: 0.59 ± 0.042
0.689CysPhe: 0.689 ± 0.056
1.165CysGly: 1.165 ± 0.069
0.308CysHis: 0.308 ± 0.034
0.942CysIle: 0.942 ± 0.06
0.52CysLys: 0.52 ± 0.045
1.503CysLeu: 1.503 ± 0.075
0.213CysMet: 0.213 ± 0.029
0.671CysAsn: 0.671 ± 0.05
0.601CysPro: 0.601 ± 0.044
0.509CysGln: 0.509 ± 0.042
0.788CysArg: 0.788 ± 0.058
0.927CysSer: 0.927 ± 0.055
0.652CysThr: 0.652 ± 0.052
0.711CysVal: 0.711 ± 0.044
0.227CysTrp: 0.227 ± 0.03
0.414CysTyr: 0.414 ± 0.038
0.0CysXaa: 0.0 ± 0.0
Asp
3.134AspAla: 3.134 ± 0.107
0.605AspCys: 0.605 ± 0.046
1.942AspAsp: 1.942 ± 0.093
2.8AspGlu: 2.8 ± 0.103
2.049AspPhe: 2.049 ± 0.091
3.152AspGly: 3.152 ± 0.107
1.012AspHis: 1.012 ± 0.056
3.54AspIle: 3.54 ± 0.107
2.148AspLys: 2.148 ± 0.095
6.267AspLeu: 6.267 ± 0.172
0.898AspMet: 0.898 ± 0.05
1.807AspAsn: 1.807 ± 0.089
2.697AspPro: 2.697 ± 0.109
2.093AspGln: 2.093 ± 0.09
2.807AspArg: 2.807 ± 0.098
2.98AspSer: 2.98 ± 0.112
2.379AspThr: 2.379 ± 0.082
2.584AspVal: 2.584 ± 0.099
0.777AspTrp: 0.777 ± 0.05
1.517AspTyr: 1.517 ± 0.074
0.0AspXaa: 0.0 ± 0.0
Glu
5.087GluAla: 5.087 ± 0.157
0.55GluCys: 0.55 ± 0.045
2.419GluAsp: 2.419 ± 0.107
3.746GluGlu: 3.746 ± 0.152
1.807GluPhe: 1.807 ± 0.082
3.826GluGly: 3.826 ± 0.118
1.001GluHis: 1.001 ± 0.055
4.64GluIle: 4.64 ± 0.154
2.921GluLys: 2.921 ± 0.113
7.158GluLeu: 7.158 ± 0.161
1.367GluMet: 1.367 ± 0.073
2.302GluAsn: 2.302 ± 0.083
2.118GluPro: 2.118 ± 0.093
2.499GluGln: 2.499 ± 0.097
3.962GluArg: 3.962 ± 0.133
3.482GluSer: 3.482 ± 0.105
3.064GluThr: 3.064 ± 0.13
3.94GluVal: 3.94 ± 0.143
0.737GluTrp: 0.737 ± 0.05
1.272GluTyr: 1.272 ± 0.069
0.0GluXaa: 0.0 ± 0.0
Phe
2.562PheAla: 2.562 ± 0.096
0.641PheCys: 0.641 ± 0.046
2.206PheAsp: 2.206 ± 0.088
2.096PheGlu: 2.096 ± 0.108
2.51PhePhe: 2.51 ± 0.309
2.914PheGly: 2.914 ± 0.117
0.902PheHis: 0.902 ± 0.058
2.727PheIle: 2.727 ± 0.128
1.627PheLys: 1.627 ± 0.121
4.563PheLeu: 4.563 ± 0.212
0.722PheMet: 0.722 ± 0.055
1.81PheAsn: 1.81 ± 0.095
1.708PhePro: 1.708 ± 0.072
1.36PheGln: 1.36 ± 0.079
2.008PheArg: 2.008 ± 0.085
3.086PheSer: 3.086 ± 0.141
2.181PheThr: 2.181 ± 0.088
2.115PheVal: 2.115 ± 0.111
0.597PheTrp: 0.597 ± 0.05
1.334PheTyr: 1.334 ± 0.087
0.0PheXaa: 0.0 ± 0.0
Gly
4.922GlyAla: 4.922 ± 0.156
1.081GlyCys: 1.081 ± 0.065
3.134GlyAsp: 3.134 ± 0.113
3.566GlyGlu: 3.566 ± 0.12
3.21GlyPhe: 3.21 ± 0.1
5.252GlyGly: 5.252 ± 0.181
1.488GlyHis: 1.488 ± 0.078
6.117GlyIle: 6.117 ± 0.15
3.427GlyLys: 3.427 ± 0.125
8.349GlyLeu: 8.349 ± 0.183
1.616GlyMet: 1.616 ± 0.079
2.492GlyAsn: 2.492 ± 0.102
2.507GlyPro: 2.507 ± 0.101
2.815GlyGln: 2.815 ± 0.11
4.156GlyArg: 4.156 ± 0.138
4.871GlySer: 4.871 ± 0.149
4.028GlyThr: 4.028 ± 0.127
4.603GlyVal: 4.603 ± 0.144
1.268GlyTrp: 1.268 ± 0.069
2.236GlyTyr: 2.236 ± 0.089
0.0GlyXaa: 0.0 ± 0.0
His
1.272HisAla: 1.272 ± 0.067
0.326HisCys: 0.326 ± 0.041
0.733HisAsp: 0.733 ± 0.049
0.953HisGlu: 0.953 ± 0.059
0.909HisPhe: 0.909 ± 0.06
1.58HisGly: 1.58 ± 0.074
0.674HisHis: 0.674 ± 0.059
1.459HisIle: 1.459 ± 0.075
0.865HisLys: 0.865 ± 0.053
2.598HisLeu: 2.598 ± 0.11
0.414HisMet: 0.414 ± 0.042
0.894HisAsn: 0.894 ± 0.057
1.246HisPro: 1.246 ± 0.076
1.001HisGln: 1.001 ± 0.058
1.448HisArg: 1.448 ± 0.074
1.396HisSer: 1.396 ± 0.076
0.968HisThr: 0.968 ± 0.059
1.081HisVal: 1.081 ± 0.062
0.41HisTrp: 0.41 ± 0.041
0.634HisTyr: 0.634 ± 0.048
0.0HisXaa: 0.0 ± 0.0
Ile
6.003IleAla: 6.003 ± 0.167
1.023IleCys: 1.023 ± 0.06
4.299IleAsp: 4.299 ± 0.129
4.695IleGlu: 4.695 ± 0.143
2.727IlePhe: 2.727 ± 0.11
5.776IleGly: 5.776 ± 0.122
1.554IleHis: 1.554 ± 0.085
4.981IleIle: 4.981 ± 0.151
3.218IleLys: 3.218 ± 0.112
8.393IleLeu: 8.393 ± 0.196
1.162IleMet: 1.162 ± 0.061
3.636IleAsn: 3.636 ± 0.143
3.746IlePro: 3.746 ± 0.109
2.76IleGln: 2.76 ± 0.091
4.116IleArg: 4.116 ± 0.129
5.604IleSer: 5.604 ± 0.142
4.343IleThr: 4.343 ± 0.138
4.911IleVal: 4.911 ± 0.147
0.902IleTrp: 0.902 ± 0.065
1.77IleTyr: 1.77 ± 0.08
0.0IleXaa: 0.0 ± 0.0
Lys
3.709LysAla: 3.709 ± 0.119
0.436LysCys: 0.436 ± 0.042
2.129LysAsp: 2.129 ± 0.087
2.914LysGlu: 2.914 ± 0.101
1.345LysPhe: 1.345 ± 0.109
3.13LysGly: 3.13 ± 0.111
0.814LysHis: 0.814 ± 0.054
3.397LysIle: 3.397 ± 0.117
2.565LysLys: 2.565 ± 0.162
4.878LysLeu: 4.878 ± 0.126
0.858LysMet: 0.858 ± 0.053
2.078LysAsn: 2.078 ± 0.092
2.038LysPro: 2.038 ± 0.1
2.129LysGln: 2.129 ± 0.097
2.796LysArg: 2.796 ± 0.117
3.189LysSer: 3.189 ± 0.114
2.335LysThr: 2.335 ± 0.093
3.218LysVal: 3.218 ± 0.104
0.363LysTrp: 0.363 ± 0.034
1.235LysTyr: 1.235 ± 0.069
0.0LysXaa: 0.0 ± 0.0
Leu
9.613LeuAla: 9.613 ± 0.218
1.4LeuCys: 1.4 ± 0.076
5.695LeuAsp: 5.695 ± 0.151
7.502LeuGlu: 7.502 ± 0.189
4.075LeuPhe: 4.075 ± 0.159
8.034LeuGly: 8.034 ± 0.183
2.558LeuHis: 2.558 ± 0.091
8.572LeuIle: 8.572 ± 0.199
5.549LeuLys: 5.549 ± 0.136
13.637LeuLeu: 13.637 ± 0.307
2.485LeuMet: 2.485 ± 0.106
5.116LeuAsn: 5.116 ± 0.15
5.838LeuPro: 5.838 ± 0.162
4.933LeuGln: 4.933 ± 0.147
6.773LeuArg: 6.773 ± 0.17
8.638LeuSer: 8.638 ± 0.175
6.348LeuThr: 6.348 ± 0.157
7.784LeuVal: 7.784 ± 0.189
1.374LeuTrp: 1.374 ± 0.075
2.602LeuTyr: 2.602 ± 0.102
0.0LeuXaa: 0.0 ± 0.0
Met
2.104MetAla: 2.104 ± 0.087
0.114MetCys: 0.114 ± 0.021
0.759MetAsp: 0.759 ± 0.056
1.198MetGlu: 1.198 ± 0.073
0.612MetPhe: 0.612 ± 0.057
1.657MetGly: 1.657 ± 0.085
0.348MetHis: 0.348 ± 0.031
1.598MetIle: 1.598 ± 0.077
1.045MetLys: 1.045 ± 0.069
2.06MetLeu: 2.06 ± 0.087
0.458MetMet: 0.458 ± 0.04
1.008MetAsn: 1.008 ± 0.062
0.997MetPro: 0.997 ± 0.065
0.715MetGln: 0.715 ± 0.059
1.129MetArg: 1.129 ± 0.068
1.525MetSer: 1.525 ± 0.061
1.209MetThr: 1.209 ± 0.057
1.58MetVal: 1.58 ± 0.082
0.125MetTrp: 0.125 ± 0.02
0.363MetTyr: 0.363 ± 0.035
0.0MetXaa: 0.0 ± 0.0
Asn
2.532AsnAla: 2.532 ± 0.115
0.652AsnCys: 0.652 ± 0.047
1.997AsnAsp: 1.997 ± 0.073
2.25AsnGlu: 2.25 ± 0.083
1.909AsnPhe: 1.909 ± 0.122
2.558AsnGly: 2.558 ± 0.086
1.015AsnHis: 1.015 ± 0.068
3.002AsnIle: 3.002 ± 0.108
2.162AsnLys: 2.162 ± 0.096
5.226AsnLeu: 5.226 ± 0.147
0.773AsnMet: 0.773 ± 0.061
2.041AsnAsn: 2.041 ± 0.106
2.408AsnPro: 2.408 ± 0.093
1.997AsnGln: 1.997 ± 0.078
2.694AsnArg: 2.694 ± 0.088
3.295AsnSer: 3.295 ± 0.096
2.166AsnThr: 2.166 ± 0.092
2.217AsnVal: 2.217 ± 0.093
0.751AsnTrp: 0.751 ± 0.05
1.393AsnTyr: 1.393 ± 0.06
0.0AsnXaa: 0.0 ± 0.0
Pro
2.679ProAla: 2.679 ± 0.1
0.557ProCys: 0.557 ± 0.048
2.261ProAsp: 2.261 ± 0.097
2.998ProGlu: 2.998 ± 0.083
1.928ProPhe: 1.928 ± 0.076
3.189ProGly: 3.189 ± 0.111
0.883ProHis: 0.883 ± 0.062
3.672ProIle: 3.672 ± 0.119
1.975ProLys: 1.975 ± 0.111
5.329ProLeu: 5.329 ± 0.159
0.825ProMet: 0.825 ± 0.052
1.95ProAsn: 1.95 ± 0.09
1.748ProPro: 1.748 ± 0.094
1.686ProGln: 1.686 ± 0.079
2.107ProArg: 2.107 ± 0.082
3.156ProSer: 3.156 ± 0.109
2.598ProThr: 2.598 ± 0.098
3.071ProVal: 3.071 ± 0.109
0.696ProTrp: 0.696 ± 0.057
1.305ProTyr: 1.305 ± 0.068
0.0ProXaa: 0.0 ± 0.0
Gln
3.232GlnAla: 3.232 ± 0.123
0.436GlnCys: 0.436 ± 0.04
1.745GlnAsp: 1.745 ± 0.068
2.642GlnGlu: 2.642 ± 0.104
1.29GlnPhe: 1.29 ± 0.073
2.664GlnGly: 2.664 ± 0.102
0.847GlnHis: 0.847 ± 0.053
2.994GlnIle: 2.994 ± 0.103
1.961GlnLys: 1.961 ± 0.076
5.49GlnLeu: 5.49 ± 0.161
0.861GlnMet: 0.861 ± 0.052
1.488GlnAsn: 1.488 ± 0.076
1.675GlnPro: 1.675 ± 0.07
2.085GlnGln: 2.085 ± 0.093
2.664GlnArg: 2.664 ± 0.106
2.76GlnSer: 2.76 ± 0.102
1.81GlnThr: 1.81 ± 0.086
3.156GlnVal: 3.156 ± 0.107
0.715GlnTrp: 0.715 ± 0.052
0.883GlnTyr: 0.883 ± 0.06
0.0GlnXaa: 0.0 ± 0.0
Arg
3.533ArgAla: 3.533 ± 0.13
0.759ArgCys: 0.759 ± 0.051
2.672ArgAsp: 2.672 ± 0.103
3.265ArgGlu: 3.265 ± 0.114
2.496ArgPhe: 2.496 ± 0.095
3.749ArgGly: 3.749 ± 0.118
1.312ArgHis: 1.312 ± 0.086
4.64ArgIle: 4.64 ± 0.134
2.675ArgLys: 2.675 ± 0.109
7.319ArgLeu: 7.319 ± 0.159
1.257ArgMet: 1.257 ± 0.074
2.547ArgAsn: 2.547 ± 0.092
2.434ArgPro: 2.434 ± 0.107
3.009ArgGln: 3.009 ± 0.106
4.248ArgArg: 4.248 ± 0.15
3.815ArgSer: 3.815 ± 0.125
2.617ArgThr: 2.617 ± 0.102
3.54ArgVal: 3.54 ± 0.107
1.114ArgTrp: 1.114 ± 0.066
1.851ArgTyr: 1.851 ± 0.07
0.0ArgXaa: 0.0 ± 0.0
Ser
4.468SerAla: 4.468 ± 0.137
0.869SerCys: 0.869 ± 0.071
3.265SerAsp: 3.265 ± 0.117
3.529SerGlu: 3.529 ± 0.119
2.976SerPhe: 2.976 ± 0.139
5.124SerGly: 5.124 ± 0.144
1.572SerHis: 1.572 ± 0.072
5.662SerIle: 5.662 ± 0.147
3.141SerLys: 3.141 ± 0.118
8.594SerLeu: 8.594 ± 0.202
1.708SerMet: 1.708 ± 0.068
3.551SerAsn: 3.551 ± 0.12
2.91SerPro: 2.91 ± 0.101
2.807SerGln: 2.807 ± 0.091
3.991SerArg: 3.991 ± 0.114
5.556SerSer: 5.556 ± 0.173
4.028SerThr: 4.028 ± 0.137
4.072SerVal: 4.072 ± 0.119
1.099SerTrp: 1.099 ± 0.064
1.92SerTyr: 1.92 ± 0.08
0.0SerXaa: 0.0 ± 0.0
Thr
3.929ThrAla: 3.929 ± 0.124
0.696ThrCys: 0.696 ± 0.049
2.496ThrAsp: 2.496 ± 0.092
2.95ThrGlu: 2.95 ± 0.108
2.045ThrPhe: 2.045 ± 0.088
4.358ThrGly: 4.358 ± 0.165
1.063ThrHis: 1.063 ± 0.065
4.167ThrIle: 4.167 ± 0.122
2.232ThrLys: 2.232 ± 0.102
6.219ThrLeu: 6.219 ± 0.15
0.938ThrMet: 0.938 ± 0.062
2.089ThrAsn: 2.089 ± 0.09
2.705ThrPro: 2.705 ± 0.1
1.935ThrGln: 1.935 ± 0.086
2.65ThrArg: 2.65 ± 0.099
4.134ThrSer: 4.134 ± 0.138
2.998ThrThr: 2.998 ± 0.112
3.478ThrVal: 3.478 ± 0.104
0.553ThrTrp: 0.553 ± 0.049
1.514ThrTyr: 1.514 ± 0.068
0.0ThrXaa: 0.0 ± 0.0
Val
5.259ValAla: 5.259 ± 0.146
0.744ValCys: 0.744 ± 0.051
3.331ValAsp: 3.331 ± 0.11
3.925ValGlu: 3.925 ± 0.127
2.562ValPhe: 2.562 ± 0.11
4.636ValGly: 4.636 ± 0.152
1.14ValHis: 1.14 ± 0.065
4.753ValIle: 4.753 ± 0.128
2.635ValLys: 2.635 ± 0.106
6.703ValLeu: 6.703 ± 0.139
1.367ValMet: 1.367 ± 0.064
3.123ValAsn: 3.123 ± 0.103
2.793ValPro: 2.793 ± 0.097
2.258ValGln: 2.258 ± 0.098
3.478ValArg: 3.478 ± 0.129
4.391ValSer: 4.391 ± 0.119
3.617ValThr: 3.617 ± 0.137
4.325ValVal: 4.325 ± 0.154
0.814ValTrp: 0.814 ± 0.063
1.506ValTyr: 1.506 ± 0.072
0.0ValXaa: 0.0 ± 0.0
Trp
0.839TrpAla: 0.839 ± 0.066
0.209TrpCys: 0.209 ± 0.032
0.627TrpAsp: 0.627 ± 0.049
0.682TrpGlu: 0.682 ± 0.046
0.634TrpPhe: 0.634 ± 0.05
0.905TrpGly: 0.905 ± 0.061
0.315TrpHis: 0.315 ± 0.039
1.056TrpIle: 1.056 ± 0.063
0.608TrpLys: 0.608 ± 0.047
2.115TrpLeu: 2.115 ± 0.106
0.323TrpMet: 0.323 ± 0.032
0.656TrpAsn: 0.656 ± 0.045
0.616TrpPro: 0.616 ± 0.051
0.711TrpGln: 0.711 ± 0.055
0.839TrpArg: 0.839 ± 0.056
1.008TrpSer: 1.008 ± 0.063
0.557TrpThr: 0.557 ± 0.048
0.777TrpVal: 0.777 ± 0.057
0.268TrpTrp: 0.268 ± 0.032
0.432TrpTyr: 0.432 ± 0.036
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.657TyrAla: 1.657 ± 0.073
0.506TyrCys: 0.506 ± 0.047
1.323TyrAsp: 1.323 ± 0.063
1.437TyrGlu: 1.437 ± 0.073
1.154TyrPhe: 1.154 ± 0.083
2.155TyrGly: 2.155 ± 0.085
0.634TyrHis: 0.634 ± 0.053
1.66TyrIle: 1.66 ± 0.065
1.18TyrLys: 1.18 ± 0.076
3.189TyrLeu: 3.189 ± 0.112
0.506TyrMet: 0.506 ± 0.045
1.121TyrAsn: 1.121 ± 0.066
1.235TyrPro: 1.235 ± 0.067
1.242TyrGln: 1.242 ± 0.071
1.818TyrArg: 1.818 ± 0.084
2.041TyrSer: 2.041 ± 0.087
1.367TyrThr: 1.367 ± 0.081
1.341TyrVal: 1.341 ± 0.073
0.509TyrTrp: 0.509 ± 0.044
0.74TyrTyr: 0.74 ± 0.054
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 897 proteins (272856 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski