Amino acid dipepetide frequency for Xanthomonas phage Xoo-sp13

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.719AlaAla: 5.719 ± 0.332
0.685AlaCys: 0.685 ± 0.093
4.277AlaAsp: 4.277 ± 0.237
4.23AlaGlu: 4.23 ± 0.252
3.025AlaPhe: 3.025 ± 0.169
4.277AlaGly: 4.277 ± 0.293
0.957AlaHis: 0.957 ± 0.111
4.183AlaIle: 4.183 ± 0.232
4.159AlaLys: 4.159 ± 0.297
5.494AlaLeu: 5.494 ± 0.268
1.902AlaMet: 1.902 ± 0.122
3.202AlaAsn: 3.202 ± 0.19
2.552AlaPro: 2.552 ± 0.189
2.209AlaGln: 2.209 ± 0.183
3.084AlaArg: 3.084 ± 0.176
4.844AlaSer: 4.844 ± 0.34
5.116AlaThr: 5.116 ± 0.477
4.691AlaVal: 4.691 ± 0.29
0.803AlaTrp: 0.803 ± 0.112
2.647AlaTyr: 2.647 ± 0.19
0.0AlaXaa: 0.0 ± 0.0
Cys
0.579CysAla: 0.579 ± 0.084
0.154CysCys: 0.154 ± 0.046
0.555CysAsp: 0.555 ± 0.069
0.591CysGlu: 0.591 ± 0.104
0.284CysPhe: 0.284 ± 0.052
0.721CysGly: 0.721 ± 0.107
0.248CysHis: 0.248 ± 0.06
0.579CysIle: 0.579 ± 0.076
0.709CysLys: 0.709 ± 0.102
0.555CysLeu: 0.555 ± 0.098
0.213CysMet: 0.213 ± 0.057
0.579CysAsn: 0.579 ± 0.081
0.449CysPro: 0.449 ± 0.075
0.248CysGln: 0.248 ± 0.057
0.437CysArg: 0.437 ± 0.073
0.662CysSer: 0.662 ± 0.094
0.52CysThr: 0.52 ± 0.084
0.591CysVal: 0.591 ± 0.085
0.154CysTrp: 0.154 ± 0.045
0.331CysTyr: 0.331 ± 0.065
0.0CysXaa: 0.0 ± 0.0
Asp
4.643AspAla: 4.643 ± 0.293
0.508AspCys: 0.508 ± 0.08
4.277AspAsp: 4.277 ± 0.332
3.875AspGlu: 3.875 ± 0.246
2.741AspPhe: 2.741 ± 0.188
4.844AspGly: 4.844 ± 0.274
1.028AspHis: 1.028 ± 0.121
4.655AspIle: 4.655 ± 0.268
3.675AspLys: 3.675 ± 0.271
4.525AspLeu: 4.525 ± 0.252
1.831AspMet: 1.831 ± 0.144
3.344AspAsn: 3.344 ± 0.195
2.399AspPro: 2.399 ± 0.194
1.371AspGln: 1.371 ± 0.123
2.611AspArg: 2.611 ± 0.187
4.561AspSer: 4.561 ± 0.233
4.289AspThr: 4.289 ± 0.224
4.313AspVal: 4.313 ± 0.23
0.91AspTrp: 0.91 ± 0.122
2.895AspTyr: 2.895 ± 0.195
0.0AspXaa: 0.0 ± 0.0
Glu
4.324GluAla: 4.324 ± 0.245
0.756GluCys: 0.756 ± 0.102
3.769GluAsp: 3.769 ± 0.213
3.71GluGlu: 3.71 ± 0.299
3.486GluPhe: 3.486 ± 0.231
2.812GluGly: 2.812 ± 0.177
1.241GluHis: 1.241 ± 0.132
3.958GluIle: 3.958 ± 0.261
3.214GluLys: 3.214 ± 0.254
6.144GluLeu: 6.144 ± 0.354
1.56GluMet: 1.56 ± 0.155
2.611GluAsn: 2.611 ± 0.155
1.501GluPro: 1.501 ± 0.127
2.292GluGln: 2.292 ± 0.188
2.658GluArg: 2.658 ± 0.192
3.604GluSer: 3.604 ± 0.233
2.907GluThr: 2.907 ± 0.212
3.663GluVal: 3.663 ± 0.242
1.182GluTrp: 1.182 ± 0.126
3.06GluTyr: 3.06 ± 0.192
0.0GluXaa: 0.0 ± 0.0
Phe
2.469PheAla: 2.469 ± 0.189
0.449PheCys: 0.449 ± 0.072
3.119PheAsp: 3.119 ± 0.229
2.422PheGlu: 2.422 ± 0.199
1.607PhePhe: 1.607 ± 0.155
2.8PheGly: 2.8 ± 0.177
0.945PheHis: 0.945 ± 0.117
2.954PheIle: 2.954 ± 0.198
2.753PheLys: 2.753 ± 0.209
2.966PheLeu: 2.966 ± 0.185
0.945PheMet: 0.945 ± 0.117
2.824PheAsn: 2.824 ± 0.176
1.512PhePro: 1.512 ± 0.144
1.241PheGln: 1.241 ± 0.096
1.654PheArg: 1.654 ± 0.174
3.025PheSer: 3.025 ± 0.193
3.332PheThr: 3.332 ± 0.217
2.871PheVal: 2.871 ± 0.183
0.508PheTrp: 0.508 ± 0.07
1.359PheTyr: 1.359 ± 0.151
0.0PheXaa: 0.0 ± 0.0
Gly
3.403GlyAla: 3.403 ± 0.264
0.567GlyCys: 0.567 ± 0.088
3.875GlyAsp: 3.875 ± 0.199
3.143GlyGlu: 3.143 ± 0.216
2.729GlyPhe: 2.729 ± 0.168
4.478GlyGly: 4.478 ± 0.432
1.122GlyHis: 1.122 ± 0.129
3.923GlyIle: 3.923 ± 0.212
4.029GlyLys: 4.029 ± 0.253
4.431GlyLeu: 4.431 ± 0.233
1.512GlyMet: 1.512 ± 0.12
4.041GlyAsn: 4.041 ± 0.341
1.69GlyPro: 1.69 ± 0.16
2.091GlyGln: 2.091 ± 0.167
2.942GlyArg: 2.942 ± 0.166
4.714GlySer: 4.714 ± 0.334
5.092GlyThr: 5.092 ± 0.335
4.561GlyVal: 4.561 ± 0.301
0.898GlyTrp: 0.898 ± 0.111
2.8GlyTyr: 2.8 ± 0.203
0.0GlyXaa: 0.0 ± 0.0
His
1.217HisAla: 1.217 ± 0.123
0.189HisCys: 0.189 ± 0.047
1.276HisAsp: 1.276 ± 0.127
1.075HisGlu: 1.075 ± 0.121
0.851HisPhe: 0.851 ± 0.095
1.536HisGly: 1.536 ± 0.142
0.437HisHis: 0.437 ± 0.088
1.217HisIle: 1.217 ± 0.121
1.028HisLys: 1.028 ± 0.141
1.477HisLeu: 1.477 ± 0.142
0.366HisMet: 0.366 ± 0.064
0.992HisAsn: 0.992 ± 0.115
0.803HisPro: 0.803 ± 0.11
0.567HisGln: 0.567 ± 0.084
0.981HisArg: 0.981 ± 0.125
1.276HisSer: 1.276 ± 0.141
1.241HisThr: 1.241 ± 0.136
1.241HisVal: 1.241 ± 0.141
0.236HisTrp: 0.236 ± 0.046
0.851HisTyr: 0.851 ± 0.1
0.0HisXaa: 0.0 ± 0.0
Ile
4.525IleAla: 4.525 ± 0.2
0.52IleCys: 0.52 ± 0.075
4.49IleAsp: 4.49 ± 0.294
3.521IleGlu: 3.521 ± 0.222
2.162IlePhe: 2.162 ± 0.15
3.816IleGly: 3.816 ± 0.174
1.311IleHis: 1.311 ± 0.151
3.864IleIle: 3.864 ± 0.224
4.088IleLys: 4.088 ± 0.237
4.254IleLeu: 4.254 ± 0.251
1.501IleMet: 1.501 ± 0.165
4.041IleAsn: 4.041 ± 0.216
3.497IlePro: 3.497 ± 0.18
2.068IleGln: 2.068 ± 0.158
3.356IleArg: 3.356 ± 0.208
4.608IleSer: 4.608 ± 0.332
5.246IleThr: 5.246 ± 0.418
4.336IleVal: 4.336 ± 0.256
0.638IleTrp: 0.638 ± 0.09
1.76IleTyr: 1.76 ± 0.132
0.0IleXaa: 0.0 ± 0.0
Lys
3.934LysAla: 3.934 ± 0.287
0.614LysCys: 0.614 ± 0.09
3.45LysAsp: 3.45 ± 0.285
3.923LysGlu: 3.923 ± 0.271
3.119LysPhe: 3.119 ± 0.222
3.226LysGly: 3.226 ± 0.255
1.406LysHis: 1.406 ± 0.17
3.958LysIle: 3.958 ± 0.248
4.041LysLys: 4.041 ± 0.36
5.057LysLeu: 5.057 ± 0.302
2.056LysMet: 2.056 ± 0.2
3.048LysAsn: 3.048 ± 0.193
2.139LysPro: 2.139 ± 0.188
1.973LysGln: 1.973 ± 0.161
2.564LysArg: 2.564 ± 0.216
4.561LysSer: 4.561 ± 0.265
3.58LysThr: 3.58 ± 0.218
4.041LysVal: 4.041 ± 0.222
1.122LysTrp: 1.122 ± 0.127
2.599LysTyr: 2.599 ± 0.227
0.0LysXaa: 0.0 ± 0.0
Leu
5.518LeuAla: 5.518 ± 0.287
0.614LeuCys: 0.614 ± 0.088
5.293LeuAsp: 5.293 ± 0.317
4.75LeuGlu: 4.75 ± 0.268
2.918LeuPhe: 2.918 ± 0.172
3.946LeuGly: 3.946 ± 0.248
1.619LeuHis: 1.619 ± 0.133
4.124LeuIle: 4.124 ± 0.255
5.258LeuLys: 5.258 ± 0.306
5.73LeuLeu: 5.73 ± 0.329
1.69LeuMet: 1.69 ± 0.149
5.27LeuAsn: 5.27 ± 0.266
3.521LeuPro: 3.521 ± 0.221
2.765LeuGln: 2.765 ± 0.185
3.639LeuArg: 3.639 ± 0.193
5.624LeuSer: 5.624 ± 0.298
5.506LeuThr: 5.506 ± 0.421
4.951LeuVal: 4.951 ± 0.255
0.933LeuTrp: 0.933 ± 0.116
2.67LeuTyr: 2.67 ± 0.231
0.0LeuXaa: 0.0 ± 0.0
Met
1.82MetAla: 1.82 ± 0.164
0.213MetCys: 0.213 ± 0.045
1.619MetAsp: 1.619 ± 0.152
1.347MetGlu: 1.347 ± 0.13
1.158MetPhe: 1.158 ± 0.146
1.288MetGly: 1.288 ± 0.143
0.508MetHis: 0.508 ± 0.078
1.193MetIle: 1.193 ± 0.108
1.406MetLys: 1.406 ± 0.128
1.938MetLeu: 1.938 ± 0.173
0.473MetMet: 0.473 ± 0.075
1.501MetAsn: 1.501 ± 0.133
0.863MetPro: 0.863 ± 0.104
0.91MetGln: 0.91 ± 0.099
1.182MetArg: 1.182 ± 0.122
2.257MetSer: 2.257 ± 0.18
1.914MetThr: 1.914 ± 0.15
1.56MetVal: 1.56 ± 0.136
0.224MetTrp: 0.224 ± 0.052
0.969MetTyr: 0.969 ± 0.116
0.0MetXaa: 0.0 ± 0.0
Asn
3.828AsnAla: 3.828 ± 0.26
0.473AsnCys: 0.473 ± 0.078
3.698AsnAsp: 3.698 ± 0.22
3.072AsnGlu: 3.072 ± 0.186
1.997AsnPhe: 1.997 ± 0.162
4.667AsnGly: 4.667 ± 0.246
1.087AsnHis: 1.087 ± 0.105
3.946AsnIle: 3.946 ± 0.262
3.19AsnLys: 3.19 ± 0.19
4.112AsnLeu: 4.112 ± 0.247
1.217AsnMet: 1.217 ± 0.137
3.296AsnAsn: 3.296 ± 0.232
2.93AsnPro: 2.93 ± 0.228
1.772AsnGln: 1.772 ± 0.156
2.706AsnArg: 2.706 ± 0.179
4.1AsnSer: 4.1 ± 0.271
3.686AsnThr: 3.686 ± 0.303
4.336AsnVal: 4.336 ± 0.219
1.016AsnTrp: 1.016 ± 0.118
2.41AsnTyr: 2.41 ± 0.164
0.0AsnXaa: 0.0 ± 0.0
Pro
2.753ProAla: 2.753 ± 0.253
0.284ProCys: 0.284 ± 0.059
2.375ProAsp: 2.375 ± 0.163
2.942ProGlu: 2.942 ± 0.186
1.654ProPhe: 1.654 ± 0.137
2.127ProGly: 2.127 ± 0.158
0.78ProHis: 0.78 ± 0.092
2.351ProIle: 2.351 ± 0.148
2.753ProLys: 2.753 ± 0.2
2.847ProLeu: 2.847 ± 0.177
0.874ProMet: 0.874 ± 0.103
2.209ProAsn: 2.209 ± 0.22
1.512ProPro: 1.512 ± 0.181
1.619ProGln: 1.619 ± 0.141
1.465ProArg: 1.465 ± 0.14
3.97ProSer: 3.97 ± 0.432
6.912ProThr: 6.912 ± 1.438
3.107ProVal: 3.107 ± 0.24
0.52ProTrp: 0.52 ± 0.098
1.69ProTyr: 1.69 ± 0.156
0.0ProXaa: 0.0 ± 0.0
Gln
2.458GlnAla: 2.458 ± 0.181
0.248GlnCys: 0.248 ± 0.054
1.631GlnAsp: 1.631 ± 0.126
2.139GlnGlu: 2.139 ± 0.16
1.607GlnPhe: 1.607 ± 0.138
1.642GlnGly: 1.642 ± 0.155
0.591GlnHis: 0.591 ± 0.088
2.233GlnIle: 2.233 ± 0.165
1.796GlnLys: 1.796 ± 0.153
3.178GlnLeu: 3.178 ± 0.233
1.004GlnMet: 1.004 ± 0.113
1.796GlnAsn: 1.796 ± 0.157
1.087GlnPro: 1.087 ± 0.126
1.749GlnGln: 1.749 ± 0.163
1.311GlnArg: 1.311 ± 0.131
2.269GlnSer: 2.269 ± 0.147
2.079GlnThr: 2.079 ± 0.179
2.233GlnVal: 2.233 ± 0.143
0.555GlnTrp: 0.555 ± 0.096
1.737GlnTyr: 1.737 ± 0.17
0.0GlnXaa: 0.0 ± 0.0
Arg
2.777ArgAla: 2.777 ± 0.183
0.414ArgCys: 0.414 ± 0.072
2.599ArgAsp: 2.599 ± 0.171
2.647ArgGlu: 2.647 ± 0.176
1.89ArgPhe: 1.89 ± 0.167
2.576ArgGly: 2.576 ± 0.17
0.863ArgHis: 0.863 ± 0.105
3.001ArgIle: 3.001 ± 0.182
3.545ArgLys: 3.545 ± 0.284
3.686ArgLeu: 3.686 ± 0.216
1.063ArgMet: 1.063 ± 0.14
2.647ArgAsn: 2.647 ± 0.159
1.642ArgPro: 1.642 ± 0.128
1.772ArgGln: 1.772 ± 0.149
2.091ArgArg: 2.091 ± 0.202
2.765ArgSer: 2.765 ± 0.176
3.308ArgThr: 3.308 ± 0.263
2.942ArgVal: 2.942 ± 0.214
0.638ArgTrp: 0.638 ± 0.072
1.867ArgTyr: 1.867 ± 0.155
0.0ArgXaa: 0.0 ± 0.0
Ser
4.903SerAla: 4.903 ± 0.335
0.626SerCys: 0.626 ± 0.091
4.49SerAsp: 4.49 ± 0.244
3.899SerGlu: 3.899 ± 0.273
3.155SerPhe: 3.155 ± 0.203
4.927SerGly: 4.927 ± 0.281
1.17SerHis: 1.17 ± 0.124
5.128SerIle: 5.128 ± 0.262
3.852SerLys: 3.852 ± 0.259
5.719SerLeu: 5.719 ± 0.272
1.571SerMet: 1.571 ± 0.142
4.443SerAsn: 4.443 ± 0.283
3.474SerPro: 3.474 ± 0.356
2.198SerGln: 2.198 ± 0.166
3.143SerArg: 3.143 ± 0.174
5.423SerSer: 5.423 ± 0.353
5.447SerThr: 5.447 ± 0.395
5.837SerVal: 5.837 ± 0.401
0.744SerTrp: 0.744 ± 0.095
2.694SerTyr: 2.694 ± 0.182
0.0SerXaa: 0.0 ± 0.0
Thr
4.951ThrAla: 4.951 ± 0.468
0.567ThrCys: 0.567 ± 0.089
4.029ThrAsp: 4.029 ± 0.215
3.875ThrGlu: 3.875 ± 0.219
2.576ThrPhe: 2.576 ± 0.198
4.513ThrGly: 4.513 ± 0.274
1.099ThrHis: 1.099 ± 0.123
5.033ThrIle: 5.033 ± 0.365
3.627ThrLys: 3.627 ± 0.215
5.459ThrLeu: 5.459 ± 0.317
1.583ThrMet: 1.583 ± 0.157
3.793ThrAsn: 3.793 ± 0.242
8.448ThrPro: 8.448 ± 1.831
2.611ThrGln: 2.611 ± 0.16
3.155ThrArg: 3.155 ± 0.26
4.998ThrSer: 4.998 ± 0.449
5.022ThrThr: 5.022 ± 0.453
6.416ThrVal: 6.416 ± 0.718
0.957ThrTrp: 0.957 ± 0.117
2.611ThrTyr: 2.611 ± 0.19
0.0ThrXaa: 0.0 ± 0.0
Val
4.832ValAla: 4.832 ± 0.283
0.532ValCys: 0.532 ± 0.089
4.407ValAsp: 4.407 ± 0.215
4.395ValGlu: 4.395 ± 0.262
2.493ValPhe: 2.493 ± 0.164
4.372ValGly: 4.372 ± 0.265
1.276ValHis: 1.276 ± 0.138
3.958ValIle: 3.958 ± 0.24
4.324ValLys: 4.324 ± 0.256
4.998ValLeu: 4.998 ± 0.305
1.666ValMet: 1.666 ± 0.137
3.982ValAsn: 3.982 ± 0.269
2.918ValPro: 2.918 ± 0.192
2.009ValGln: 2.009 ± 0.177
2.966ValArg: 2.966 ± 0.185
5.6ValSer: 5.6 ± 0.375
6.877ValThr: 6.877 ± 1.061
5.435ValVal: 5.435 ± 0.287
0.874ValTrp: 0.874 ± 0.1
2.942ValTyr: 2.942 ± 0.196
0.0ValXaa: 0.0 ± 0.0
Trp
0.863TrpAla: 0.863 ± 0.094
0.236TrpCys: 0.236 ± 0.054
1.028TrpAsp: 1.028 ± 0.123
0.803TrpGlu: 0.803 ± 0.096
0.614TrpPhe: 0.614 ± 0.085
0.792TrpGly: 0.792 ± 0.116
0.366TrpHis: 0.366 ± 0.068
0.768TrpIle: 0.768 ± 0.101
0.768TrpLys: 0.768 ± 0.114
0.922TrpLeu: 0.922 ± 0.114
0.319TrpMet: 0.319 ± 0.062
1.04TrpAsn: 1.04 ± 0.121
0.26TrpPro: 0.26 ± 0.056
0.532TrpGln: 0.532 ± 0.078
0.591TrpArg: 0.591 ± 0.093
1.099TrpSer: 1.099 ± 0.166
0.815TrpThr: 0.815 ± 0.118
0.922TrpVal: 0.922 ± 0.114
0.213TrpTrp: 0.213 ± 0.05
0.78TrpTyr: 0.78 ± 0.105
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.54TyrAla: 2.54 ± 0.187
0.508TyrCys: 0.508 ± 0.078
3.037TyrAsp: 3.037 ± 0.227
2.091TyrGlu: 2.091 ± 0.186
1.69TyrPhe: 1.69 ± 0.162
2.682TyrGly: 2.682 ± 0.179
0.803TyrHis: 0.803 ± 0.104
2.576TyrIle: 2.576 ± 0.153
2.209TyrLys: 2.209 ± 0.175
2.847TyrLeu: 2.847 ± 0.214
0.981TyrMet: 0.981 ± 0.111
2.883TyrAsn: 2.883 ± 0.214
1.548TyrPro: 1.548 ± 0.155
1.347TyrGln: 1.347 ± 0.15
2.198TyrArg: 2.198 ± 0.141
2.883TyrSer: 2.883 ± 0.225
2.434TyrThr: 2.434 ± 0.167
2.812TyrVal: 2.812 ± 0.214
0.603TyrTrp: 0.603 ± 0.093
1.914TyrTyr: 1.914 ± 0.167
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 401 proteins (84637 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski