Amino acid dipepetide frequency for candidate division SR1 bacterium RAAC1_SR1_1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.705AlaAla: 2.705 ± 0.107
0.553AlaCys: 0.553 ± 0.044
2.548AlaAsp: 2.548 ± 0.095
3.02AlaGlu: 3.02 ± 0.123
2.191AlaPhe: 2.191 ± 0.077
3.804AlaGly: 3.804 ± 0.134
0.798AlaHis: 0.798 ± 0.051
4.708AlaIle: 4.708 ± 0.127
4.694AlaLys: 4.694 ± 0.158
4.843AlaLeu: 4.843 ± 0.141
1.284AlaMet: 1.284 ± 0.068
2.916AlaAsn: 2.916 ± 0.119
1.562AlaPro: 1.562 ± 0.084
2.065AlaGln: 2.065 ± 0.071
1.7AlaArg: 1.7 ± 0.091
3.034AlaSer: 3.034 ± 0.108
3.245AlaThr: 3.245 ± 0.147
2.866AlaVal: 2.866 ± 0.099
0.362AlaTrp: 0.362 ± 0.036
2.02AlaTyr: 2.02 ± 0.076
0.0AlaXaa: 0.0 ± 0.0
Cys
0.416CysAla: 0.416 ± 0.04
0.124CysCys: 0.124 ± 0.019
0.548CysAsp: 0.548 ± 0.053
0.61CysGlu: 0.61 ± 0.043
0.424CysPhe: 0.424 ± 0.038
0.792CysGly: 0.792 ± 0.055
0.177CysHis: 0.177 ± 0.024
0.806CysIle: 0.806 ± 0.052
0.801CysLys: 0.801 ± 0.054
0.694CysLeu: 0.694 ± 0.046
0.16CysMet: 0.16 ± 0.021
0.542CysAsn: 0.542 ± 0.049
0.444CysPro: 0.444 ± 0.046
0.303CysGln: 0.303 ± 0.034
0.219CysArg: 0.219 ± 0.026
0.742CysSer: 0.742 ± 0.061
0.669CysThr: 0.669 ± 0.077
0.461CysVal: 0.461 ± 0.041
0.062CysTrp: 0.062 ± 0.014
0.362CysTyr: 0.362 ± 0.035
0.0CysXaa: 0.0 ± 0.0
Asp
2.818AspAla: 2.818 ± 0.091
0.525AspCys: 0.525 ± 0.043
2.809AspAsp: 2.809 ± 0.134
3.627AspGlu: 3.627 ± 0.148
3.25AspPhe: 3.25 ± 0.11
3.793AspGly: 3.793 ± 0.17
0.815AspHis: 0.815 ± 0.055
5.81AspIle: 5.81 ± 0.134
4.405AspLys: 4.405 ± 0.133
4.869AspLeu: 4.869 ± 0.15
1.495AspMet: 1.495 ± 0.073
3.082AspAsn: 3.082 ± 0.117
1.91AspPro: 1.91 ± 0.08
2.079AspGln: 2.079 ± 0.085
1.7AspArg: 1.7 ± 0.086
2.896AspSer: 2.896 ± 0.094
3.259AspThr: 3.259 ± 0.113
3.045AspVal: 3.045 ± 0.099
0.419AspTrp: 0.419 ± 0.037
2.2AspTyr: 2.2 ± 0.084
0.0AspXaa: 0.0 ± 0.0
Glu
2.998GluAla: 2.998 ± 0.12
0.497GluCys: 0.497 ± 0.042
3.391GluAsp: 3.391 ± 0.115
5.338GluGlu: 5.338 ± 0.177
2.548GluPhe: 2.548 ± 0.087
3.447GluGly: 3.447 ± 0.122
1.16GluHis: 1.16 ± 0.06
6.467GluIle: 6.467 ± 0.178
7.709GluLys: 7.709 ± 0.274
5.717GluLeu: 5.717 ± 0.163
1.441GluMet: 1.441 ± 0.074
4.672GluAsn: 4.672 ± 0.151
1.309GluPro: 1.309 ± 0.07
2.894GluGln: 2.894 ± 0.112
2.144GluArg: 2.144 ± 0.105
3.627GluSer: 3.627 ± 0.107
4.02GluThr: 4.02 ± 0.126
3.298GluVal: 3.298 ± 0.128
0.388GluTrp: 0.388 ± 0.037
2.843GluTyr: 2.843 ± 0.096
0.0GluXaa: 0.0 ± 0.0
Phe
2.686PheAla: 2.686 ± 0.101
0.537PheCys: 0.537 ± 0.043
3.38PheAsp: 3.38 ± 0.099
2.644PheGlu: 2.644 ± 0.11
3.264PhePhe: 3.264 ± 0.149
3.658PheGly: 3.658 ± 0.11
0.683PheHis: 0.683 ± 0.054
3.947PheIle: 3.947 ± 0.115
3.104PheLys: 3.104 ± 0.112
5.467PheLeu: 5.467 ± 0.159
1.042PheMet: 1.042 ± 0.053
2.247PheAsn: 2.247 ± 0.098
1.666PhePro: 1.666 ± 0.075
1.554PheGln: 1.554 ± 0.064
1.329PheArg: 1.329 ± 0.063
4.2PheSer: 4.2 ± 0.153
2.916PheThr: 2.916 ± 0.09
3.568PheVal: 3.568 ± 0.112
0.52PheTrp: 0.52 ± 0.044
1.733PheTyr: 1.733 ± 0.081
0.0PheXaa: 0.0 ± 0.0
Gly
3.374GlyAla: 3.374 ± 0.098
0.761GlyCys: 0.761 ± 0.057
3.447GlyAsp: 3.447 ± 0.106
4.09GlyGlu: 4.09 ± 0.139
3.843GlyPhe: 3.843 ± 0.123
4.804GlyGly: 4.804 ± 0.186
0.888GlyHis: 0.888 ± 0.055
6.596GlyIle: 6.596 ± 0.185
5.65GlyLys: 5.65 ± 0.167
6.001GlyLeu: 6.001 ± 0.162
1.497GlyMet: 1.497 ± 0.077
4.113GlyAsn: 4.113 ± 0.179
1.292GlyPro: 1.292 ± 0.068
2.208GlyGln: 2.208 ± 0.089
2.079GlyArg: 2.079 ± 0.087
4.531GlySer: 4.531 ± 0.182
4.86GlyThr: 4.86 ± 0.301
3.793GlyVal: 3.793 ± 0.106
0.596GlyTrp: 0.596 ± 0.059
3.225GlyTyr: 3.225 ± 0.107
0.0GlyXaa: 0.0 ± 0.0
His
0.767HisAla: 0.767 ± 0.046
0.216HisCys: 0.216 ± 0.023
0.728HisAsp: 0.728 ± 0.047
0.86HisGlu: 0.86 ± 0.05
0.899HisPhe: 0.899 ± 0.053
1.034HisGly: 1.034 ± 0.058
0.478HisHis: 0.478 ± 0.041
1.68HisIle: 1.68 ± 0.079
1.281HisLys: 1.281 ± 0.066
1.514HisLeu: 1.514 ± 0.075
0.351HisMet: 0.351 ± 0.034
0.862HisAsn: 0.862 ± 0.05
0.694HisPro: 0.694 ± 0.047
0.638HisGln: 0.638 ± 0.045
0.587HisArg: 0.587 ± 0.039
0.778HisSer: 0.778 ± 0.053
0.921HisThr: 0.921 ± 0.059
0.843HisVal: 0.843 ± 0.054
0.09HisTrp: 0.09 ± 0.016
0.728HisTyr: 0.728 ± 0.051
0.0HisXaa: 0.0 ± 0.0
Ile
5.166IleAla: 5.166 ± 0.129
0.801IleCys: 0.801 ± 0.051
5.607IleAsp: 5.607 ± 0.14
5.88IleGlu: 5.88 ± 0.154
4.534IlePhe: 4.534 ± 0.123
6.032IleGly: 6.032 ± 0.166
1.348IleHis: 1.348 ± 0.063
8.265IleIle: 8.265 ± 0.206
8.136IleLys: 8.136 ± 0.221
8.442IleLeu: 8.442 ± 0.217
1.638IleMet: 1.638 ± 0.076
5.304IleAsn: 5.304 ± 0.183
3.318IlePro: 3.318 ± 0.105
3.874IleGln: 3.874 ± 0.124
2.773IleArg: 2.773 ± 0.097
6.518IleSer: 6.518 ± 0.182
5.599IleThr: 5.599 ± 0.185
5.582IleVal: 5.582 ± 0.133
0.556IleTrp: 0.556 ± 0.044
3.517IleTyr: 3.517 ± 0.138
0.0IleXaa: 0.0 ± 0.0
Lys
4.175LysAla: 4.175 ± 0.123
0.483LysCys: 0.483 ± 0.036
5.422LysAsp: 5.422 ± 0.171
7.462LysGlu: 7.462 ± 0.26
2.933LysPhe: 2.933 ± 0.111
4.748LysGly: 4.748 ± 0.159
1.43LysHis: 1.43 ± 0.067
8.585LysIle: 8.585 ± 0.22
9.799LysLys: 9.799 ± 0.284
6.846LysLeu: 6.846 ± 0.183
2.332LysMet: 2.332 ± 0.101
5.911LysAsn: 5.911 ± 0.162
2.27LysPro: 2.27 ± 0.099
3.832LysGln: 3.832 ± 0.115
3.062LysArg: 3.062 ± 0.106
4.352LysSer: 4.352 ± 0.16
5.532LysThr: 5.532 ± 0.124
4.079LysVal: 4.079 ± 0.134
0.413LysTrp: 0.413 ± 0.033
3.596LysTyr: 3.596 ± 0.124
0.0LysXaa: 0.0 ± 0.0
Leu
5.034LeuAla: 5.034 ± 0.15
0.837LeuCys: 0.837 ± 0.057
4.978LeuAsp: 4.978 ± 0.134
6.189LeuGlu: 6.189 ± 0.2
5.046LeuPhe: 5.046 ± 0.154
6.723LeuGly: 6.723 ± 0.169
1.537LeuHis: 1.537 ± 0.081
7.212LeuIle: 7.212 ± 0.178
6.919LeuLys: 6.919 ± 0.222
9.035LeuLeu: 9.035 ± 0.232
1.924LeuMet: 1.924 ± 0.09
4.439LeuAsn: 4.439 ± 0.136
3.203LeuPro: 3.203 ± 0.103
3.998LeuGln: 3.998 ± 0.109
3.076LeuArg: 3.076 ± 0.102
6.532LeuSer: 6.532 ± 0.187
4.826LeuThr: 4.826 ± 0.144
4.981LeuVal: 4.981 ± 0.152
0.708LeuTrp: 0.708 ± 0.056
3.529LeuTyr: 3.529 ± 0.117
0.0LeuXaa: 0.0 ± 0.0
Met
1.127MetAla: 1.127 ± 0.066
0.149MetCys: 0.149 ± 0.02
1.11MetAsp: 1.11 ± 0.071
1.475MetGlu: 1.475 ± 0.081
1.025MetPhe: 1.025 ± 0.053
1.466MetGly: 1.466 ± 0.067
0.337MetHis: 0.337 ± 0.029
2.028MetIle: 2.028 ± 0.088
2.318MetLys: 2.318 ± 0.095
1.958MetLeu: 1.958 ± 0.075
0.646MetMet: 0.646 ± 0.052
1.242MetAsn: 1.242 ± 0.062
0.747MetPro: 0.747 ± 0.042
0.843MetGln: 0.843 ± 0.057
0.826MetArg: 0.826 ± 0.05
1.214MetSer: 1.214 ± 0.06
1.219MetThr: 1.219 ± 0.07
1.163MetVal: 1.163 ± 0.064
0.087MetTrp: 0.087 ± 0.014
0.868MetTyr: 0.868 ± 0.053
0.0MetXaa: 0.0 ± 0.0
Asn
2.568AsnAla: 2.568 ± 0.093
0.483AsnCys: 0.483 ± 0.041
2.818AsnAsp: 2.818 ± 0.111
3.321AsnGlu: 3.321 ± 0.113
2.714AsnPhe: 2.714 ± 0.104
3.809AsnGly: 3.809 ± 0.219
0.98AsnHis: 0.98 ± 0.06
6.484AsnIle: 6.484 ± 0.227
5.144AsnLys: 5.144 ± 0.15
4.815AsnLeu: 4.815 ± 0.119
1.292AsnMet: 1.292 ± 0.058
4.397AsnAsn: 4.397 ± 0.308
2.329AsnPro: 2.329 ± 0.083
2.579AsnGln: 2.579 ± 0.1
1.736AsnArg: 1.736 ± 0.074
3.413AsnSer: 3.413 ± 0.197
4.045AsnThr: 4.045 ± 0.218
2.981AsnVal: 2.981 ± 0.104
0.405AsnTrp: 0.405 ± 0.038
2.374AsnTyr: 2.374 ± 0.103
0.0AsnXaa: 0.0 ± 0.0
Pro
1.835ProAla: 1.835 ± 0.096
0.258ProCys: 0.258 ± 0.028
1.747ProAsp: 1.747 ± 0.075
2.652ProGlu: 2.652 ± 0.12
1.506ProPhe: 1.506 ± 0.067
1.913ProGly: 1.913 ± 0.088
0.469ProHis: 0.469 ± 0.04
2.944ProIle: 2.944 ± 0.087
2.38ProLys: 2.38 ± 0.095
2.719ProLeu: 2.719 ± 0.111
0.612ProMet: 0.612 ± 0.041
1.753ProAsn: 1.753 ± 0.07
0.674ProPro: 0.674 ± 0.054
1.202ProGln: 1.202 ± 0.071
0.978ProArg: 0.978 ± 0.062
2.107ProSer: 2.107 ± 0.083
2.2ProThr: 2.2 ± 0.127
1.961ProVal: 1.961 ± 0.102
0.261ProTrp: 0.261 ± 0.029
1.388ProTyr: 1.388 ± 0.06
0.0ProXaa: 0.0 ± 0.0
Gln
2.079GlnAla: 2.079 ± 0.085
0.281GlnCys: 0.281 ± 0.032
2.014GlnAsp: 2.014 ± 0.079
3.647GlnGlu: 3.647 ± 0.13
1.461GlnPhe: 1.461 ± 0.066
2.292GlnGly: 2.292 ± 0.075
0.685GlnHis: 0.685 ± 0.046
3.391GlnIle: 3.391 ± 0.099
4.369GlnLys: 4.369 ± 0.153
2.992GlnLeu: 2.992 ± 0.096
0.848GlnMet: 0.848 ± 0.051
2.483GlnAsn: 2.483 ± 0.097
1.113GlnPro: 1.113 ± 0.064
2.233GlnGln: 2.233 ± 0.116
1.455GlnArg: 1.455 ± 0.078
2.259GlnSer: 2.259 ± 0.089
2.587GlnThr: 2.587 ± 0.103
1.812GlnVal: 1.812 ± 0.064
0.214GlnTrp: 0.214 ± 0.025
1.68GlnTyr: 1.68 ± 0.07
0.0GlnXaa: 0.0 ± 0.0
Arg
1.565ArgAla: 1.565 ± 0.062
0.317ArgCys: 0.317 ± 0.029
1.832ArgAsp: 1.832 ± 0.078
2.191ArgGlu: 2.191 ± 0.092
1.632ArgPhe: 1.632 ± 0.071
2.051ArgGly: 2.051 ± 0.09
0.59ArgHis: 0.59 ± 0.045
2.852ArgIle: 2.852 ± 0.112
2.899ArgLys: 2.899 ± 0.111
2.719ArgLeu: 2.719 ± 0.115
0.742ArgMet: 0.742 ± 0.048
2.02ArgAsn: 2.02 ± 0.078
0.98ArgPro: 0.98 ± 0.063
1.129ArgGln: 1.129 ± 0.058
1.225ArgArg: 1.225 ± 0.073
1.84ArgSer: 1.84 ± 0.081
1.77ArgThr: 1.77 ± 0.086
1.843ArgVal: 1.843 ± 0.084
0.239ArgTrp: 0.239 ± 0.024
1.5ArgTyr: 1.5 ± 0.069
0.0ArgXaa: 0.0 ± 0.0
Ser
2.891SerAla: 2.891 ± 0.093
0.823SerCys: 0.823 ± 0.089
3.391SerAsp: 3.391 ± 0.109
3.461SerGlu: 3.461 ± 0.091
4.06SerPhe: 4.06 ± 0.128
5.099SerGly: 5.099 ± 0.247
0.955SerHis: 0.955 ± 0.053
5.56SerIle: 5.56 ± 0.146
4.739SerLys: 4.739 ± 0.144
6.622SerLeu: 6.622 ± 0.188
1.211SerMet: 1.211 ± 0.06
3.276SerAsn: 3.276 ± 0.182
2.07SerPro: 2.07 ± 0.081
2.481SerGln: 2.481 ± 0.085
1.767SerArg: 1.767 ± 0.081
4.857SerSer: 4.857 ± 0.274
3.883SerThr: 3.883 ± 0.219
3.382SerVal: 3.382 ± 0.106
0.582SerTrp: 0.582 ± 0.039
2.823SerTyr: 2.823 ± 0.135
0.0SerXaa: 0.0 ± 0.0
Thr
3.023ThrAla: 3.023 ± 0.134
0.663ThrCys: 0.663 ± 0.084
3.13ThrAsp: 3.13 ± 0.131
3.357ThrGlu: 3.357 ± 0.113
3.071ThrPhe: 3.071 ± 0.111
4.888ThrGly: 4.888 ± 0.297
0.964ThrHis: 0.964 ± 0.054
6.27ThrIle: 6.27 ± 0.224
5.018ThrLys: 5.018 ± 0.132
5.661ThrLeu: 5.661 ± 0.141
1.169ThrMet: 1.169 ± 0.058
3.863ThrAsn: 3.863 ± 0.205
2.568ThrPro: 2.568 ± 0.1
2.188ThrGln: 2.188 ± 0.094
1.82ThrArg: 1.82 ± 0.078
4.04ThrSer: 4.04 ± 0.184
4.95ThrThr: 4.95 ± 0.301
3.323ThrVal: 3.323 ± 0.142
0.382ThrTrp: 0.382 ± 0.034
2.593ThrTyr: 2.593 ± 0.126
0.0ThrXaa: 0.0 ± 0.0
Val
3.394ValAla: 3.394 ± 0.132
0.528ValCys: 0.528 ± 0.043
3.253ValAsp: 3.253 ± 0.114
3.374ValGlu: 3.374 ± 0.119
2.967ValPhe: 2.967 ± 0.096
4.265ValGly: 4.265 ± 0.116
0.82ValHis: 0.82 ± 0.054
4.776ValIle: 4.776 ± 0.128
4.009ValLys: 4.009 ± 0.134
5.183ValLeu: 5.183 ± 0.148
1.082ValMet: 1.082 ± 0.06
2.689ValAsn: 2.689 ± 0.103
1.851ValPro: 1.851 ± 0.071
1.801ValGln: 1.801 ± 0.078
1.717ValArg: 1.717 ± 0.075
3.857ValSer: 3.857 ± 0.129
3.239ValThr: 3.239 ± 0.144
3.902ValVal: 3.902 ± 0.129
0.435ValTrp: 0.435 ± 0.031
2.211ValTyr: 2.211 ± 0.089
0.0ValXaa: 0.0 ± 0.0
Trp
0.281TrpAla: 0.281 ± 0.031
0.042TrpCys: 0.042 ± 0.01
0.348TrpAsp: 0.348 ± 0.032
0.405TrpGlu: 0.405 ± 0.034
0.365TrpPhe: 0.365 ± 0.039
0.357TrpGly: 0.357 ± 0.036
0.098TrpHis: 0.098 ± 0.014
0.773TrpIle: 0.773 ± 0.05
0.739TrpLys: 0.739 ± 0.052
0.635TrpLeu: 0.635 ± 0.04
0.275TrpMet: 0.275 ± 0.034
0.478TrpAsn: 0.478 ± 0.043
0.143TrpPro: 0.143 ± 0.021
0.242TrpGln: 0.242 ± 0.023
0.236TrpArg: 0.236 ± 0.027
0.455TrpSer: 0.455 ± 0.037
0.438TrpThr: 0.438 ± 0.042
0.326TrpVal: 0.326 ± 0.036
0.076TrpTrp: 0.076 ± 0.014
0.334TrpTyr: 0.334 ± 0.034
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.02TyrAla: 2.02 ± 0.081
0.475TyrCys: 0.475 ± 0.037
2.304TyrAsp: 2.304 ± 0.086
2.309TyrGlu: 2.309 ± 0.097
2.41TyrPhe: 2.41 ± 0.085
2.675TyrGly: 2.675 ± 0.114
0.736TyrHis: 0.736 ± 0.042
3.736TyrIle: 3.736 ± 0.107
3.149TyrLys: 3.149 ± 0.121
3.992TyrLeu: 3.992 ± 0.127
0.803TyrMet: 0.803 ± 0.046
2.399TyrAsn: 2.399 ± 0.141
1.416TyrPro: 1.416 ± 0.069
1.753TyrGln: 1.753 ± 0.071
1.509TyrArg: 1.509 ± 0.07
2.68TyrSer: 2.68 ± 0.109
2.762TyrThr: 2.762 ± 0.104
2.144TyrVal: 2.144 ± 0.086
0.253TyrTrp: 0.253 ± 0.026
1.666TyrTyr: 1.666 ± 0.082
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1059 proteins (355954 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski