Amino acid dipepetide frequency for Pseudomonas phage Skulduggery

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
16.542AlaAla: 16.542 ± 1.905
0.691AlaCys: 0.691 ± 0.237
7.506AlaAsp: 7.506 ± 0.694
9.53AlaGlu: 9.53 ± 0.893
3.605AlaPhe: 3.605 ± 0.454
10.666AlaGly: 10.666 ± 1.282
1.432AlaHis: 1.432 ± 0.272
4.839AlaIle: 4.839 ± 0.519
6.222AlaLys: 6.222 ± 0.694
9.234AlaLeu: 9.234 ± 0.869
3.654AlaMet: 3.654 ± 0.548
3.457AlaAsn: 3.457 ± 0.445
4.741AlaPro: 4.741 ± 0.621
5.481AlaGln: 5.481 ± 0.529
4.938AlaArg: 4.938 ± 0.52
5.333AlaSer: 5.333 ± 0.475
5.284AlaThr: 5.284 ± 0.601
5.58AlaVal: 5.58 ± 0.739
1.284AlaTrp: 1.284 ± 0.338
3.802AlaTyr: 3.802 ± 0.431
0.0AlaXaa: 0.0 ± 0.0
Cys
0.79CysAla: 0.79 ± 0.207
0.198CysCys: 0.198 ± 0.093
0.691CysAsp: 0.691 ± 0.236
0.444CysGlu: 0.444 ± 0.16
0.198CysPhe: 0.198 ± 0.115
0.593CysGly: 0.593 ± 0.22
0.198CysHis: 0.198 ± 0.111
0.198CysIle: 0.198 ± 0.113
0.593CysLys: 0.593 ± 0.232
0.593CysLeu: 0.593 ± 0.184
0.148CysMet: 0.148 ± 0.094
0.296CysAsn: 0.296 ± 0.122
0.642CysPro: 0.642 ± 0.242
0.444CysGln: 0.444 ± 0.172
0.691CysArg: 0.691 ± 0.211
0.346CysSer: 0.346 ± 0.147
0.444CysThr: 0.444 ± 0.207
0.593CysVal: 0.593 ± 0.215
0.148CysTrp: 0.148 ± 0.108
0.543CysTyr: 0.543 ± 0.176
0.0CysXaa: 0.0 ± 0.0
Asp
7.358AspAla: 7.358 ± 0.752
0.642AspCys: 0.642 ± 0.229
4.247AspAsp: 4.247 ± 0.545
4.197AspGlu: 4.197 ± 0.535
2.321AspPhe: 2.321 ± 0.343
6.963AspGly: 6.963 ± 0.688
0.839AspHis: 0.839 ± 0.259
2.321AspIle: 2.321 ± 0.371
3.259AspLys: 3.259 ± 0.45
4.741AspLeu: 4.741 ± 0.398
1.679AspMet: 1.679 ± 0.301
1.975AspAsn: 1.975 ± 0.287
3.605AspPro: 3.605 ± 0.509
2.271AspGln: 2.271 ± 0.305
2.963AspArg: 2.963 ± 0.472
2.913AspSer: 2.913 ± 0.372
3.308AspThr: 3.308 ± 0.373
3.555AspVal: 3.555 ± 0.411
1.136AspTrp: 1.136 ± 0.202
1.679AspTyr: 1.679 ± 0.252
0.0AspXaa: 0.0 ± 0.0
Glu
7.555GluAla: 7.555 ± 0.891
0.691GluCys: 0.691 ± 0.292
3.308GluAsp: 3.308 ± 0.409
4.148GluGlu: 4.148 ± 0.674
1.975GluPhe: 1.975 ± 0.317
5.086GluGly: 5.086 ± 0.668
0.741GluHis: 0.741 ± 0.168
1.679GluIle: 1.679 ± 0.265
3.111GluLys: 3.111 ± 0.362
7.506GluLeu: 7.506 ± 0.776
1.679GluMet: 1.679 ± 0.283
2.469GluAsn: 2.469 ± 0.428
3.111GluPro: 3.111 ± 0.392
5.185GluGln: 5.185 ± 0.587
5.037GluArg: 5.037 ± 0.538
3.95GluSer: 3.95 ± 0.536
3.704GluThr: 3.704 ± 0.417
3.555GluVal: 3.555 ± 0.421
0.889GluTrp: 0.889 ± 0.234
1.926GluTyr: 1.926 ± 0.354
0.0GluXaa: 0.0 ± 0.0
Phe
3.21PheAla: 3.21 ± 0.361
0.691PheCys: 0.691 ± 0.205
2.617PheAsp: 2.617 ± 0.329
2.123PheGlu: 2.123 ± 0.353
0.889PhePhe: 0.889 ± 0.227
3.111PheGly: 3.111 ± 0.483
0.642PheHis: 0.642 ± 0.226
1.432PheIle: 1.432 ± 0.302
2.123PheLys: 2.123 ± 0.305
1.876PheLeu: 1.876 ± 0.275
1.136PheMet: 1.136 ± 0.265
1.876PheAsn: 1.876 ± 0.243
1.383PhePro: 1.383 ± 0.355
1.235PheGln: 1.235 ± 0.247
1.876PheArg: 1.876 ± 0.384
1.531PheSer: 1.531 ± 0.319
2.617PheThr: 2.617 ± 0.396
2.123PheVal: 2.123 ± 0.332
0.395PheTrp: 0.395 ± 0.167
0.839PheTyr: 0.839 ± 0.186
0.0PheXaa: 0.0 ± 0.0
Gly
8.444GlyAla: 8.444 ± 1.198
0.691GlyCys: 0.691 ± 0.265
5.679GlyAsp: 5.679 ± 0.639
5.284GlyGlu: 5.284 ± 0.532
3.555GlyPhe: 3.555 ± 0.514
7.95GlyGly: 7.95 ± 1.284
1.136GlyHis: 1.136 ± 0.202
4.543GlyIle: 4.543 ± 0.425
6.024GlyLys: 6.024 ± 0.557
5.876GlyLeu: 5.876 ± 0.574
3.16GlyMet: 3.16 ± 0.447
3.21GlyAsn: 3.21 ± 0.478
2.815GlyPro: 2.815 ± 0.493
3.704GlyGln: 3.704 ± 0.399
4.79GlyArg: 4.79 ± 0.61
4.099GlySer: 4.099 ± 0.373
6.765GlyThr: 6.765 ± 0.51
5.037GlyVal: 5.037 ± 0.526
1.481GlyTrp: 1.481 ± 0.332
2.963GlyTyr: 2.963 ± 0.378
0.0GlyXaa: 0.0 ± 0.0
His
1.975HisAla: 1.975 ± 0.351
0.247HisCys: 0.247 ± 0.135
0.988HisAsp: 0.988 ± 0.278
1.037HisGlu: 1.037 ± 0.253
0.741HisPhe: 0.741 ± 0.206
1.728HisGly: 1.728 ± 0.248
0.444HisHis: 0.444 ± 0.172
0.889HisIle: 0.889 ± 0.218
1.136HisLys: 1.136 ± 0.281
1.333HisLeu: 1.333 ± 0.284
0.346HisMet: 0.346 ± 0.184
0.444HisAsn: 0.444 ± 0.176
0.593HisPro: 0.593 ± 0.163
1.136HisGln: 1.136 ± 0.229
1.136HisArg: 1.136 ± 0.224
0.593HisSer: 0.593 ± 0.18
0.691HisThr: 0.691 ± 0.182
1.086HisVal: 1.086 ± 0.247
0.346HisTrp: 0.346 ± 0.134
0.889HisTyr: 0.889 ± 0.259
0.0HisXaa: 0.0 ± 0.0
Ile
5.086IleAla: 5.086 ± 0.526
0.296IleCys: 0.296 ± 0.13
2.617IleAsp: 2.617 ± 0.244
4.148IleGlu: 4.148 ± 0.571
1.136IlePhe: 1.136 ± 0.255
3.111IleGly: 3.111 ± 0.474
0.642IleHis: 0.642 ± 0.165
2.518IleIle: 2.518 ± 0.346
2.37IleLys: 2.37 ± 0.321
2.568IleLeu: 2.568 ± 0.303
0.741IleMet: 0.741 ± 0.153
2.222IleAsn: 2.222 ± 0.31
2.37IlePro: 2.37 ± 0.265
1.876IleGln: 1.876 ± 0.223
2.42IleArg: 2.42 ± 0.354
2.271IleSer: 2.271 ± 0.352
3.802IleThr: 3.802 ± 0.471
3.111IleVal: 3.111 ± 0.354
0.494IleTrp: 0.494 ± 0.178
1.185IleTyr: 1.185 ± 0.289
0.0IleXaa: 0.0 ± 0.0
Lys
6.321LysAla: 6.321 ± 0.708
0.642LysCys: 0.642 ± 0.219
3.259LysAsp: 3.259 ± 0.461
3.802LysGlu: 3.802 ± 0.679
2.025LysPhe: 2.025 ± 0.309
4.938LysGly: 4.938 ± 0.483
1.284LysHis: 1.284 ± 0.255
1.975LysIle: 1.975 ± 0.323
2.469LysLys: 2.469 ± 0.381
4.049LysLeu: 4.049 ± 0.489
1.136LysMet: 1.136 ± 0.278
1.975LysAsn: 1.975 ± 0.292
4.296LysPro: 4.296 ± 0.531
2.765LysGln: 2.765 ± 0.373
3.259LysArg: 3.259 ± 0.391
2.716LysSer: 2.716 ± 0.391
3.16LysThr: 3.16 ± 0.407
3.555LysVal: 3.555 ± 0.464
0.691LysTrp: 0.691 ± 0.171
1.235LysTyr: 1.235 ± 0.238
0.0LysXaa: 0.0 ± 0.0
Leu
9.679LeuAla: 9.679 ± 0.733
0.691LeuCys: 0.691 ± 0.264
5.234LeuAsp: 5.234 ± 0.529
4.938LeuGlu: 4.938 ± 0.54
1.926LeuPhe: 1.926 ± 0.339
6.222LeuGly: 6.222 ± 0.503
1.728LeuHis: 1.728 ± 0.321
2.469LeuIle: 2.469 ± 0.434
5.629LeuLys: 5.629 ± 0.555
4.642LeuLeu: 4.642 ± 0.712
1.827LeuMet: 1.827 ± 0.319
2.815LeuAsn: 2.815 ± 0.384
3.654LeuPro: 3.654 ± 0.357
2.765LeuGln: 2.765 ± 0.408
5.136LeuArg: 5.136 ± 0.593
5.185LeuSer: 5.185 ± 0.47
5.432LeuThr: 5.432 ± 0.676
4.543LeuVal: 4.543 ± 0.701
0.988LeuTrp: 0.988 ± 0.205
1.728LeuTyr: 1.728 ± 0.36
0.0LeuXaa: 0.0 ± 0.0
Met
2.765MetAla: 2.765 ± 0.326
0.198MetCys: 0.198 ± 0.092
1.531MetAsp: 1.531 ± 0.2
1.481MetGlu: 1.481 ± 0.309
1.037MetPhe: 1.037 ± 0.25
2.518MetGly: 2.518 ± 0.453
0.691MetHis: 0.691 ± 0.157
1.284MetIle: 1.284 ± 0.357
1.58MetLys: 1.58 ± 0.3
2.074MetLeu: 2.074 ± 0.314
0.889MetMet: 0.889 ± 0.199
1.136MetAsn: 1.136 ± 0.269
1.63MetPro: 1.63 ± 0.267
0.889MetGln: 0.889 ± 0.215
2.123MetArg: 2.123 ± 0.317
2.568MetSer: 2.568 ± 0.406
1.827MetThr: 1.827 ± 0.314
0.691MetVal: 0.691 ± 0.171
0.296MetTrp: 0.296 ± 0.126
0.691MetTyr: 0.691 ± 0.174
0.0MetXaa: 0.0 ± 0.0
Asn
3.506AsnAla: 3.506 ± 0.465
0.494AsnCys: 0.494 ± 0.167
1.827AsnAsp: 1.827 ± 0.299
2.123AsnGlu: 2.123 ± 0.39
1.136AsnPhe: 1.136 ± 0.283
3.704AsnGly: 3.704 ± 0.421
0.642AsnHis: 0.642 ± 0.195
1.975AsnIle: 1.975 ± 0.292
1.926AsnLys: 1.926 ± 0.264
3.111AsnLeu: 3.111 ± 0.389
1.185AsnMet: 1.185 ± 0.296
1.531AsnAsn: 1.531 ± 0.32
2.815AsnPro: 2.815 ± 0.362
1.333AsnGln: 1.333 ± 0.342
2.222AsnArg: 2.222 ± 0.323
1.58AsnSer: 1.58 ± 0.21
2.667AsnThr: 2.667 ± 0.432
2.321AsnVal: 2.321 ± 0.341
0.395AsnTrp: 0.395 ± 0.142
0.839AsnTyr: 0.839 ± 0.184
0.0AsnXaa: 0.0 ± 0.0
Pro
5.58ProAla: 5.58 ± 0.574
0.247ProCys: 0.247 ± 0.119
3.16ProAsp: 3.16 ± 0.311
4.0ProGlu: 4.0 ± 0.453
2.222ProPhe: 2.222 ± 0.337
4.543ProGly: 4.543 ± 0.56
0.839ProHis: 0.839 ± 0.17
1.926ProIle: 1.926 ± 0.34
2.815ProLys: 2.815 ± 0.506
2.864ProLeu: 2.864 ± 0.334
1.136ProMet: 1.136 ± 0.203
2.271ProAsn: 2.271 ± 0.391
2.222ProPro: 2.222 ± 0.408
1.58ProGln: 1.58 ± 0.368
1.778ProArg: 1.778 ± 0.241
2.617ProSer: 2.617 ± 0.281
2.913ProThr: 2.913 ± 0.385
3.704ProVal: 3.704 ± 0.425
0.395ProTrp: 0.395 ± 0.138
1.185ProTyr: 1.185 ± 0.235
0.0ProXaa: 0.0 ± 0.0
Gln
6.469GlnAla: 6.469 ± 0.572
0.198GlnCys: 0.198 ± 0.103
2.123GlnAsp: 2.123 ± 0.399
2.765GlnGlu: 2.765 ± 0.531
2.123GlnPhe: 2.123 ± 0.346
3.012GlnGly: 3.012 ± 0.307
1.086GlnHis: 1.086 ± 0.233
2.667GlnIle: 2.667 ± 0.352
2.321GlnLys: 2.321 ± 0.359
4.345GlnLeu: 4.345 ± 0.532
1.481GlnMet: 1.481 ± 0.387
1.284GlnAsn: 1.284 ± 0.249
1.926GlnPro: 1.926 ± 0.355
4.099GlnGln: 4.099 ± 0.664
3.802GlnArg: 3.802 ± 0.57
2.123GlnSer: 2.123 ± 0.335
2.271GlnThr: 2.271 ± 0.314
2.271GlnVal: 2.271 ± 0.366
0.839GlnTrp: 0.839 ± 0.197
1.086GlnTyr: 1.086 ± 0.209
0.0GlnXaa: 0.0 ± 0.0
Arg
5.876ArgAla: 5.876 ± 0.534
0.494ArgCys: 0.494 ± 0.199
3.95ArgAsp: 3.95 ± 0.511
4.494ArgGlu: 4.494 ± 0.552
2.123ArgPhe: 2.123 ± 0.366
3.901ArgGly: 3.901 ± 0.575
1.086ArgHis: 1.086 ± 0.209
2.617ArgIle: 2.617 ± 0.323
3.062ArgLys: 3.062 ± 0.461
5.58ArgLeu: 5.58 ± 0.418
1.432ArgMet: 1.432 ± 0.275
2.42ArgAsn: 2.42 ± 0.321
1.827ArgPro: 1.827 ± 0.288
3.457ArgGln: 3.457 ± 0.421
3.605ArgArg: 3.605 ± 0.492
2.667ArgSer: 2.667 ± 0.441
3.358ArgThr: 3.358 ± 0.418
3.704ArgVal: 3.704 ± 0.466
0.593ArgTrp: 0.593 ± 0.185
1.679ArgTyr: 1.679 ± 0.277
0.0ArgXaa: 0.0 ± 0.0
Ser
5.531SerAla: 5.531 ± 0.539
0.247SerCys: 0.247 ± 0.129
3.358SerAsp: 3.358 ± 0.417
2.716SerGlu: 2.716 ± 0.301
1.58SerPhe: 1.58 ± 0.322
5.432SerGly: 5.432 ± 0.545
0.889SerHis: 0.889 ± 0.175
2.913SerIle: 2.913 ± 0.469
3.062SerLys: 3.062 ± 0.569
3.457SerLeu: 3.457 ± 0.322
1.679SerMet: 1.679 ± 0.277
1.975SerAsn: 1.975 ± 0.298
2.37SerPro: 2.37 ± 0.42
2.469SerGln: 2.469 ± 0.358
3.012SerArg: 3.012 ± 0.336
2.765SerSer: 2.765 ± 0.389
2.864SerThr: 2.864 ± 0.362
3.21SerVal: 3.21 ± 0.478
0.79SerTrp: 0.79 ± 0.184
1.284SerTyr: 1.284 ± 0.26
0.0SerXaa: 0.0 ± 0.0
Thr
7.901ThrAla: 7.901 ± 0.762
0.247ThrCys: 0.247 ± 0.111
3.753ThrAsp: 3.753 ± 0.499
3.852ThrGlu: 3.852 ± 0.479
2.271ThrPhe: 2.271 ± 0.389
6.123ThrGly: 6.123 ± 0.623
1.58ThrHis: 1.58 ± 0.351
3.654ThrIle: 3.654 ± 0.539
2.864ThrLys: 2.864 ± 0.387
5.333ThrLeu: 5.333 ± 0.597
1.63ThrMet: 1.63 ± 0.29
2.173ThrAsn: 2.173 ± 0.308
3.062ThrPro: 3.062 ± 0.407
2.074ThrGln: 2.074 ± 0.29
2.37ThrArg: 2.37 ± 0.363
2.864ThrSer: 2.864 ± 0.347
2.716ThrThr: 2.716 ± 0.348
3.704ThrVal: 3.704 ± 0.479
0.938ThrTrp: 0.938 ± 0.287
1.481ThrTyr: 1.481 ± 0.253
0.0ThrXaa: 0.0 ± 0.0
Val
5.333ValAla: 5.333 ± 0.474
0.543ValCys: 0.543 ± 0.191
4.0ValAsp: 4.0 ± 0.431
4.345ValGlu: 4.345 ± 0.504
1.432ValPhe: 1.432 ± 0.341
4.494ValGly: 4.494 ± 0.501
1.037ValHis: 1.037 ± 0.203
3.21ValIle: 3.21 ± 0.478
3.062ValLys: 3.062 ± 0.387
4.247ValLeu: 4.247 ± 0.44
1.876ValMet: 1.876 ± 0.319
1.876ValAsn: 1.876 ± 0.325
3.605ValPro: 3.605 ± 0.402
2.716ValGln: 2.716 ± 0.35
3.506ValArg: 3.506 ± 0.533
3.358ValSer: 3.358 ± 0.467
4.444ValThr: 4.444 ± 0.614
4.247ValVal: 4.247 ± 0.627
0.543ValTrp: 0.543 ± 0.148
1.284ValTyr: 1.284 ± 0.285
0.0ValXaa: 0.0 ± 0.0
Trp
1.284TrpAla: 1.284 ± 0.251
0.247TrpCys: 0.247 ± 0.135
0.494TrpAsp: 0.494 ± 0.188
0.346TrpGlu: 0.346 ± 0.145
0.593TrpPhe: 0.593 ± 0.186
0.938TrpGly: 0.938 ± 0.263
0.346TrpHis: 0.346 ± 0.168
0.741TrpIle: 0.741 ± 0.208
0.593TrpLys: 0.593 ± 0.204
1.679TrpLeu: 1.679 ± 0.282
0.395TrpMet: 0.395 ± 0.119
0.593TrpAsn: 0.593 ± 0.149
0.395TrpPro: 0.395 ± 0.174
0.839TrpGln: 0.839 ± 0.178
1.136TrpArg: 1.136 ± 0.354
0.79TrpSer: 0.79 ± 0.232
0.543TrpThr: 0.543 ± 0.191
0.79TrpVal: 0.79 ± 0.208
0.346TrpTrp: 0.346 ± 0.17
0.494TrpTyr: 0.494 ± 0.14
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.617TyrAla: 2.617 ± 0.36
0.395TyrCys: 0.395 ± 0.175
1.63TyrAsp: 1.63 ± 0.245
1.531TyrGlu: 1.531 ± 0.268
0.839TyrPhe: 0.839 ± 0.156
2.173TyrGly: 2.173 ± 0.408
0.494TyrHis: 0.494 ± 0.172
1.136TyrIle: 1.136 ± 0.293
1.333TyrLys: 1.333 ± 0.289
2.173TyrLeu: 2.173 ± 0.33
0.642TyrMet: 0.642 ± 0.175
1.333TyrAsn: 1.333 ± 0.224
0.938TyrPro: 0.938 ± 0.247
1.975TyrGln: 1.975 ± 0.337
2.123TyrArg: 2.123 ± 0.263
1.333TyrSer: 1.333 ± 0.241
1.778TyrThr: 1.778 ± 0.289
1.778TyrVal: 1.778 ± 0.276
0.593TyrTrp: 0.593 ± 0.17
0.444TyrTyr: 0.444 ± 0.13
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 72 proteins (20252 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski