Amino acid dipepetide frequency for Klebsiella phage KMI7

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.194AlaAla: 4.194 ± 0.334
0.561AlaCys: 0.561 ± 0.094
3.999AlaAsp: 3.999 ± 0.348
4.998AlaGlu: 4.998 ± 0.403
2.731AlaPhe: 2.731 ± 0.254
4.145AlaGly: 4.145 ± 0.429
1.048AlaHis: 1.048 ± 0.159
5.047AlaIle: 5.047 ± 0.406
5.364AlaLys: 5.364 ± 0.433
5.486AlaLeu: 5.486 ± 0.396
2.365AlaMet: 2.365 ± 0.289
3.682AlaAsn: 3.682 ± 0.274
2.17AlaPro: 2.17 ± 0.233
2.17AlaGln: 2.17 ± 0.238
3.292AlaArg: 3.292 ± 0.335
3.73AlaSer: 3.73 ± 0.355
4.584AlaThr: 4.584 ± 0.601
4.828AlaVal: 4.828 ± 0.345
0.902AlaTrp: 0.902 ± 0.159
3.097AlaTyr: 3.097 ± 0.276
0.0AlaXaa: 0.0 ± 0.0
Cys
0.78CysAla: 0.78 ± 0.144
0.073CysCys: 0.073 ± 0.042
0.756CysAsp: 0.756 ± 0.142
0.878CysGlu: 0.878 ± 0.162
0.561CysPhe: 0.561 ± 0.107
0.951CysGly: 0.951 ± 0.171
0.171CysHis: 0.171 ± 0.058
0.561CysIle: 0.561 ± 0.118
1.0CysLys: 1.0 ± 0.169
0.731CysLeu: 0.731 ± 0.118
0.366CysMet: 0.366 ± 0.115
0.658CysAsn: 0.658 ± 0.133
0.683CysPro: 0.683 ± 0.149
0.244CysGln: 0.244 ± 0.082
0.512CysArg: 0.512 ± 0.102
0.707CysSer: 0.707 ± 0.121
0.683CysThr: 0.683 ± 0.112
0.585CysVal: 0.585 ± 0.126
0.098CysTrp: 0.098 ± 0.048
0.366CysTyr: 0.366 ± 0.112
0.0CysXaa: 0.0 ± 0.0
Asp
4.486AspAla: 4.486 ± 0.384
0.707AspCys: 0.707 ± 0.131
3.73AspAsp: 3.73 ± 0.335
5.047AspGlu: 5.047 ± 0.352
3.121AspPhe: 3.121 ± 0.304
4.803AspGly: 4.803 ± 0.354
0.902AspHis: 0.902 ± 0.132
4.462AspIle: 4.462 ± 0.292
4.096AspLys: 4.096 ± 0.325
5.071AspLeu: 5.071 ± 0.356
1.999AspMet: 1.999 ± 0.217
3.267AspAsn: 3.267 ± 0.264
3.023AspPro: 3.023 ± 0.267
1.56AspGln: 1.56 ± 0.193
2.95AspArg: 2.95 ± 0.245
4.047AspSer: 4.047 ± 0.312
3.974AspThr: 3.974 ± 0.329
3.901AspVal: 3.901 ± 0.357
1.341AspTrp: 1.341 ± 0.187
3.487AspTyr: 3.487 ± 0.285
0.0AspXaa: 0.0 ± 0.0
Glu
4.754GluAla: 4.754 ± 0.451
0.902GluCys: 0.902 ± 0.168
3.877GluAsp: 3.877 ± 0.358
4.95GluGlu: 4.95 ± 0.425
2.804GluPhe: 2.804 ± 0.278
3.633GluGly: 3.633 ± 0.325
1.707GluHis: 1.707 ± 0.201
5.462GluIle: 5.462 ± 0.34
5.291GluLys: 5.291 ± 0.37
6.315GluLeu: 6.315 ± 0.417
2.097GluMet: 2.097 ± 0.245
3.365GluAsn: 3.365 ± 0.281
1.268GluPro: 1.268 ± 0.161
2.121GluGln: 2.121 ± 0.244
3.462GluArg: 3.462 ± 0.319
4.145GluSer: 4.145 ± 0.281
4.364GluThr: 4.364 ± 0.333
4.023GluVal: 4.023 ± 0.296
1.024GluTrp: 1.024 ± 0.144
2.706GluTyr: 2.706 ± 0.291
0.0GluXaa: 0.0 ± 0.0
Phe
2.389PheAla: 2.389 ± 0.267
0.561PheCys: 0.561 ± 0.121
3.633PheAsp: 3.633 ± 0.313
3.097PheGlu: 3.097 ± 0.333
1.243PhePhe: 1.243 ± 0.165
3.145PheGly: 3.145 ± 0.277
0.561PheHis: 0.561 ± 0.139
2.658PheIle: 2.658 ± 0.267
3.048PheLys: 3.048 ± 0.304
2.511PheLeu: 2.511 ± 0.269
1.146PheMet: 1.146 ± 0.17
2.365PheAsn: 2.365 ± 0.238
1.39PhePro: 1.39 ± 0.187
1.146PheGln: 1.146 ± 0.159
1.755PheArg: 1.755 ± 0.177
2.097PheSer: 2.097 ± 0.226
2.877PheThr: 2.877 ± 0.252
3.048PheVal: 3.048 ± 0.286
0.61PheTrp: 0.61 ± 0.112
1.731PheTyr: 1.731 ± 0.207
0.0PheXaa: 0.0 ± 0.0
Gly
4.145GlyAla: 4.145 ± 0.424
1.024GlyCys: 1.024 ± 0.164
4.559GlyAsp: 4.559 ± 0.406
3.95GlyGlu: 3.95 ± 0.316
3.097GlyPhe: 3.097 ± 0.301
3.901GlyGly: 3.901 ± 0.542
0.951GlyHis: 0.951 ± 0.146
4.169GlyIle: 4.169 ± 0.247
4.779GlyLys: 4.779 ± 0.345
4.242GlyLeu: 4.242 ± 0.285
1.829GlyMet: 1.829 ± 0.219
4.121GlyAsn: 4.121 ± 0.421
0.902GlyPro: 0.902 ± 0.141
1.731GlyGln: 1.731 ± 0.238
3.34GlyArg: 3.34 ± 0.338
4.803GlySer: 4.803 ± 0.409
4.072GlyThr: 4.072 ± 0.556
5.291GlyVal: 5.291 ± 0.326
0.902GlyTrp: 0.902 ± 0.153
3.365GlyTyr: 3.365 ± 0.247
0.0GlyXaa: 0.0 ± 0.0
His
1.243HisAla: 1.243 ± 0.178
0.244HisCys: 0.244 ± 0.078
1.17HisAsp: 1.17 ± 0.2
1.146HisGlu: 1.146 ± 0.169
0.829HisPhe: 0.829 ± 0.136
0.902HisGly: 0.902 ± 0.169
0.341HisHis: 0.341 ± 0.114
1.219HisIle: 1.219 ± 0.158
0.927HisLys: 0.927 ± 0.172
1.365HisLeu: 1.365 ± 0.219
0.366HisMet: 0.366 ± 0.083
0.829HisAsn: 0.829 ± 0.144
1.097HisPro: 1.097 ± 0.134
0.512HisGln: 0.512 ± 0.117
0.78HisArg: 0.78 ± 0.161
1.097HisSer: 1.097 ± 0.154
0.731HisThr: 0.731 ± 0.132
1.17HisVal: 1.17 ± 0.165
0.268HisTrp: 0.268 ± 0.078
0.78HisTyr: 0.78 ± 0.147
0.0HisXaa: 0.0 ± 0.0
Ile
5.071IleAla: 5.071 ± 0.426
0.658IleCys: 0.658 ± 0.131
5.291IleAsp: 5.291 ± 0.406
4.559IleGlu: 4.559 ± 0.389
2.121IlePhe: 2.121 ± 0.254
3.852IleGly: 3.852 ± 0.328
1.024IleHis: 1.024 ± 0.18
3.779IleIle: 3.779 ± 0.321
4.852IleLys: 4.852 ± 0.365
3.877IleLeu: 3.877 ± 0.26
1.951IleMet: 1.951 ± 0.261
3.804IleAsn: 3.804 ± 0.254
2.877IlePro: 2.877 ± 0.237
2.389IleGln: 2.389 ± 0.229
3.925IleArg: 3.925 ± 0.274
3.804IleSer: 3.804 ± 0.304
4.681IleThr: 4.681 ± 0.309
4.974IleVal: 4.974 ± 0.323
0.707IleTrp: 0.707 ± 0.135
2.78IleTyr: 2.78 ± 0.24
0.0IleXaa: 0.0 ± 0.0
Lys
6.022LysAla: 6.022 ± 0.397
0.731LysCys: 0.731 ± 0.15
4.194LysAsp: 4.194 ± 0.335
5.047LysGlu: 5.047 ± 0.459
3.17LysPhe: 3.17 ± 0.281
4.194LysGly: 4.194 ± 0.35
1.439LysHis: 1.439 ± 0.196
4.633LysIle: 4.633 ± 0.353
5.096LysLys: 5.096 ± 0.404
5.657LysLeu: 5.657 ± 0.447
2.341LysMet: 2.341 ± 0.259
4.364LysAsn: 4.364 ± 0.304
2.633LysPro: 2.633 ± 0.248
2.633LysGln: 2.633 ± 0.229
3.852LysArg: 3.852 ± 0.381
3.755LysSer: 3.755 ± 0.306
4.633LysThr: 4.633 ± 0.348
4.34LysVal: 4.34 ± 0.299
1.073LysTrp: 1.073 ± 0.154
3.097LysTyr: 3.097 ± 0.274
0.0LysXaa: 0.0 ± 0.0
Leu
5.12LeuAla: 5.12 ± 0.41
0.878LeuCys: 0.878 ± 0.177
5.071LeuAsp: 5.071 ± 0.36
4.852LeuGlu: 4.852 ± 0.422
2.95LeuPhe: 2.95 ± 0.296
4.072LeuGly: 4.072 ± 0.263
1.341LeuHis: 1.341 ± 0.151
4.267LeuIle: 4.267 ± 0.322
5.291LeuLys: 5.291 ± 0.366
4.096LeuLeu: 4.096 ± 0.304
2.78LeuMet: 2.78 ± 0.269
4.316LeuAsn: 4.316 ± 0.344
3.097LeuPro: 3.097 ± 0.235
2.048LeuGln: 2.048 ± 0.202
3.95LeuArg: 3.95 ± 0.246
5.145LeuSer: 5.145 ± 0.356
4.511LeuThr: 4.511 ± 0.361
4.584LeuVal: 4.584 ± 0.352
0.61LeuTrp: 0.61 ± 0.124
3.365LeuTyr: 3.365 ± 0.294
0.0LeuXaa: 0.0 ± 0.0
Met
1.853MetAla: 1.853 ± 0.187
0.268MetCys: 0.268 ± 0.084
2.17MetAsp: 2.17 ± 0.216
1.731MetGlu: 1.731 ± 0.209
1.146MetPhe: 1.146 ± 0.179
1.609MetGly: 1.609 ± 0.182
0.61MetHis: 0.61 ± 0.13
2.121MetIle: 2.121 ± 0.227
2.877MetLys: 2.877 ± 0.343
2.219MetLeu: 2.219 ± 0.234
0.975MetMet: 0.975 ± 0.149
2.316MetAsn: 2.316 ± 0.229
0.829MetPro: 0.829 ± 0.145
1.097MetGln: 1.097 ± 0.197
1.195MetArg: 1.195 ± 0.161
1.999MetSer: 1.999 ± 0.181
1.634MetThr: 1.634 ± 0.191
1.56MetVal: 1.56 ± 0.176
0.293MetTrp: 0.293 ± 0.084
1.292MetTyr: 1.292 ± 0.19
0.0MetXaa: 0.0 ± 0.0
Asn
3.73AsnAla: 3.73 ± 0.357
0.512AsnCys: 0.512 ± 0.115
3.267AsnAsp: 3.267 ± 0.36
4.047AsnGlu: 4.047 ± 0.262
2.194AsnPhe: 2.194 ± 0.272
5.071AsnGly: 5.071 ± 0.378
1.219AsnHis: 1.219 ± 0.201
3.56AsnIle: 3.56 ± 0.308
3.877AsnLys: 3.877 ± 0.33
4.462AsnLeu: 4.462 ± 0.29
1.487AsnMet: 1.487 ± 0.167
2.926AsnAsn: 2.926 ± 0.336
2.609AsnPro: 2.609 ± 0.271
1.877AsnGln: 1.877 ± 0.214
2.682AsnArg: 2.682 ± 0.296
3.682AsnSer: 3.682 ± 0.255
2.901AsnThr: 2.901 ± 0.256
3.999AsnVal: 3.999 ± 0.321
0.317AsnTrp: 0.317 ± 0.091
1.877AsnTyr: 1.877 ± 0.234
0.0AsnXaa: 0.0 ± 0.0
Pro
2.219ProAla: 2.219 ± 0.265
0.414ProCys: 0.414 ± 0.101
3.34ProAsp: 3.34 ± 0.295
2.584ProGlu: 2.584 ± 0.244
1.487ProPhe: 1.487 ± 0.177
2.365ProGly: 2.365 ± 0.229
0.488ProHis: 0.488 ± 0.111
1.926ProIle: 1.926 ± 0.218
2.755ProLys: 2.755 ± 0.256
2.316ProLeu: 2.316 ± 0.254
0.61ProMet: 0.61 ± 0.118
1.731ProAsn: 1.731 ± 0.219
1.048ProPro: 1.048 ± 0.166
0.902ProGln: 0.902 ± 0.127
1.487ProArg: 1.487 ± 0.186
2.292ProSer: 2.292 ± 0.214
2.194ProThr: 2.194 ± 0.292
2.95ProVal: 2.95 ± 0.236
0.488ProTrp: 0.488 ± 0.102
1.755ProTyr: 1.755 ± 0.182
0.0ProXaa: 0.0 ± 0.0
Gln
2.389GlnAla: 2.389 ± 0.278
0.268GlnCys: 0.268 ± 0.072
1.609GlnAsp: 1.609 ± 0.172
2.048GlnGlu: 2.048 ± 0.198
1.195GlnPhe: 1.195 ± 0.148
1.609GlnGly: 1.609 ± 0.175
0.439GlnHis: 0.439 ± 0.102
2.511GlnIle: 2.511 ± 0.296
2.487GlnLys: 2.487 ± 0.274
2.658GlnLeu: 2.658 ± 0.258
1.122GlnMet: 1.122 ± 0.162
1.707GlnAsn: 1.707 ± 0.204
0.805GlnPro: 0.805 ± 0.111
1.512GlnGln: 1.512 ± 0.192
1.56GlnArg: 1.56 ± 0.209
1.804GlnSer: 1.804 ± 0.267
1.853GlnThr: 1.853 ± 0.197
2.268GlnVal: 2.268 ± 0.221
0.414GlnTrp: 0.414 ± 0.094
1.414GlnTyr: 1.414 ± 0.159
0.0GlnXaa: 0.0 ± 0.0
Arg
2.901ArgAla: 2.901 ± 0.256
0.512ArgCys: 0.512 ± 0.103
2.682ArgAsp: 2.682 ± 0.247
3.267ArgGlu: 3.267 ± 0.33
1.853ArgPhe: 1.853 ± 0.229
2.999ArgGly: 2.999 ± 0.256
0.829ArgHis: 0.829 ± 0.154
3.609ArgIle: 3.609 ± 0.275
3.365ArgLys: 3.365 ± 0.332
3.779ArgLeu: 3.779 ± 0.274
1.536ArgMet: 1.536 ± 0.183
3.145ArgAsn: 3.145 ± 0.256
1.365ArgPro: 1.365 ± 0.18
1.536ArgGln: 1.536 ± 0.181
2.268ArgArg: 2.268 ± 0.248
2.926ArgSer: 2.926 ± 0.234
2.584ArgThr: 2.584 ± 0.242
3.609ArgVal: 3.609 ± 0.35
1.122ArgTrp: 1.122 ± 0.163
2.146ArgTyr: 2.146 ± 0.219
0.0ArgXaa: 0.0 ± 0.0
Ser
3.925SerAla: 3.925 ± 0.343
0.585SerCys: 0.585 ± 0.106
4.267SerAsp: 4.267 ± 0.256
3.584SerGlu: 3.584 ± 0.342
2.731SerPhe: 2.731 ± 0.215
5.315SerGly: 5.315 ± 0.536
1.048SerHis: 1.048 ± 0.156
4.413SerIle: 4.413 ± 0.343
4.072SerLys: 4.072 ± 0.285
4.169SerLeu: 4.169 ± 0.397
1.877SerMet: 1.877 ± 0.259
3.048SerAsn: 3.048 ± 0.263
2.536SerPro: 2.536 ± 0.25
1.951SerGln: 1.951 ± 0.209
2.78SerArg: 2.78 ± 0.28
3.609SerSer: 3.609 ± 0.398
3.438SerThr: 3.438 ± 0.371
4.267SerVal: 4.267 ± 0.345
0.829SerTrp: 0.829 ± 0.142
2.511SerTyr: 2.511 ± 0.248
0.0SerXaa: 0.0 ± 0.0
Thr
4.438ThrAla: 4.438 ± 0.436
0.488ThrCys: 0.488 ± 0.119
3.438ThrAsp: 3.438 ± 0.334
3.901ThrGlu: 3.901 ± 0.296
2.463ThrPhe: 2.463 ± 0.232
4.95ThrGly: 4.95 ± 0.496
1.097ThrHis: 1.097 ± 0.161
4.291ThrIle: 4.291 ± 0.283
3.925ThrLys: 3.925 ± 0.275
5.169ThrLeu: 5.169 ± 0.439
1.317ThrMet: 1.317 ± 0.164
3.218ThrAsn: 3.218 ± 0.382
2.682ThrPro: 2.682 ± 0.269
2.121ThrGln: 2.121 ± 0.262
2.901ThrArg: 2.901 ± 0.262
3.609ThrSer: 3.609 ± 0.353
3.462ThrThr: 3.462 ± 0.379
4.925ThrVal: 4.925 ± 0.534
0.683ThrTrp: 0.683 ± 0.123
2.755ThrTyr: 2.755 ± 0.248
0.0ThrXaa: 0.0 ± 0.0
Val
4.803ValAla: 4.803 ± 0.382
1.146ValCys: 1.146 ± 0.178
4.852ValAsp: 4.852 ± 0.323
4.998ValGlu: 4.998 ± 0.356
2.95ValPhe: 2.95 ± 0.265
4.218ValGly: 4.218 ± 0.335
0.78ValHis: 0.78 ± 0.142
4.559ValIle: 4.559 ± 0.263
5.486ValLys: 5.486 ± 0.398
4.535ValLeu: 4.535 ± 0.295
1.634ValMet: 1.634 ± 0.201
3.804ValAsn: 3.804 ± 0.348
2.292ValPro: 2.292 ± 0.23
2.072ValGln: 2.072 ± 0.196
2.804ValArg: 2.804 ± 0.274
4.121ValSer: 4.121 ± 0.323
4.657ValThr: 4.657 ± 0.512
4.389ValVal: 4.389 ± 0.375
1.17ValTrp: 1.17 ± 0.182
3.389ValTyr: 3.389 ± 0.282
0.0ValXaa: 0.0 ± 0.0
Trp
0.902TrpAla: 0.902 ± 0.116
0.195TrpCys: 0.195 ± 0.07
0.951TrpAsp: 0.951 ± 0.152
0.634TrpGlu: 0.634 ± 0.133
0.756TrpPhe: 0.756 ± 0.148
0.634TrpGly: 0.634 ± 0.122
0.244TrpHis: 0.244 ± 0.081
0.756TrpIle: 0.756 ± 0.13
1.17TrpLys: 1.17 ± 0.188
0.951TrpLeu: 0.951 ± 0.151
0.634TrpMet: 0.634 ± 0.14
0.683TrpAsn: 0.683 ± 0.115
0.244TrpPro: 0.244 ± 0.067
0.585TrpGln: 0.585 ± 0.107
0.61TrpArg: 0.61 ± 0.141
0.683TrpSer: 0.683 ± 0.134
0.829TrpThr: 0.829 ± 0.133
1.073TrpVal: 1.073 ± 0.154
0.219TrpTrp: 0.219 ± 0.074
0.829TrpTyr: 0.829 ± 0.137
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.023TyrAla: 3.023 ± 0.261
0.634TyrCys: 0.634 ± 0.13
3.121TyrAsp: 3.121 ± 0.274
2.95TyrGlu: 2.95 ± 0.278
1.609TyrPhe: 1.609 ± 0.201
2.804TyrGly: 2.804 ± 0.269
0.78TyrHis: 0.78 ± 0.167
2.975TyrIle: 2.975 ± 0.231
3.145TyrLys: 3.145 ± 0.307
2.731TyrLeu: 2.731 ± 0.278
1.341TyrMet: 1.341 ± 0.181
2.975TyrAsn: 2.975 ± 0.245
1.804TyrPro: 1.804 ± 0.209
1.463TyrGln: 1.463 ± 0.206
1.926TyrArg: 1.926 ± 0.22
2.901TyrSer: 2.901 ± 0.276
3.17TyrThr: 3.17 ± 0.319
2.877TyrVal: 2.877 ± 0.252
0.512TyrTrp: 0.512 ± 0.098
1.658TyrTyr: 1.658 ± 0.236
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 187 proteins (41015 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski