Amino acid dipepetide frequency for Pseudomonas phage VSW-3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
16.092AlaAla: 16.092 ± 2.061
0.881AlaCys: 0.881 ± 0.235
6.004AlaAsp: 6.004 ± 0.612
7.605AlaGlu: 7.605 ± 1.004
3.442AlaPhe: 3.442 ± 0.631
8.246AlaGly: 8.246 ± 0.781
2.001AlaHis: 2.001 ± 0.389
4.403AlaIle: 4.403 ± 0.55
6.004AlaLys: 6.004 ± 0.774
8.966AlaLeu: 8.966 ± 1.026
3.202AlaMet: 3.202 ± 0.52
4.163AlaAsn: 4.163 ± 0.515
3.442AlaPro: 3.442 ± 0.775
5.844AlaGln: 5.844 ± 0.778
6.405AlaArg: 6.405 ± 0.851
5.524AlaSer: 5.524 ± 0.856
5.924AlaThr: 5.924 ± 0.688
7.285AlaVal: 7.285 ± 0.76
1.041AlaTrp: 1.041 ± 0.37
2.322AlaTyr: 2.322 ± 0.352
0.0AlaXaa: 0.0 ± 0.0
Cys
0.881CysAla: 0.881 ± 0.308
0.16CysCys: 0.16 ± 0.101
0.801CysAsp: 0.801 ± 0.281
0.4CysGlu: 0.4 ± 0.208
0.24CysPhe: 0.24 ± 0.123
0.801CysGly: 0.801 ± 0.277
0.56CysHis: 0.56 ± 0.247
0.48CysIle: 0.48 ± 0.161
0.801CysLys: 0.801 ± 0.226
0.721CysLeu: 0.721 ± 0.29
0.48CysMet: 0.48 ± 0.199
0.64CysAsn: 0.64 ± 0.251
0.56CysPro: 0.56 ± 0.198
0.56CysGln: 0.56 ± 0.196
0.32CysArg: 0.32 ± 0.161
0.16CysSer: 0.16 ± 0.117
0.4CysThr: 0.4 ± 0.159
0.56CysVal: 0.56 ± 0.204
0.16CysTrp: 0.16 ± 0.123
0.24CysTyr: 0.24 ± 0.138
0.0CysXaa: 0.0 ± 0.0
Asp
6.405AspAla: 6.405 ± 0.736
0.56AspCys: 0.56 ± 0.2
3.202AspAsp: 3.202 ± 0.653
2.802AspGlu: 2.802 ± 0.495
2.322AspPhe: 2.322 ± 0.39
5.844AspGly: 5.844 ± 0.928
1.361AspHis: 1.361 ± 0.348
3.683AspIle: 3.683 ± 0.589
3.282AspLys: 3.282 ± 0.389
5.204AspLeu: 5.204 ± 0.72
1.921AspMet: 1.921 ± 0.473
2.242AspAsn: 2.242 ± 0.473
3.042AspPro: 3.042 ± 0.405
1.761AspGln: 1.761 ± 0.474
3.603AspArg: 3.603 ± 0.801
4.083AspSer: 4.083 ± 0.587
2.722AspThr: 2.722 ± 0.485
4.884AspVal: 4.884 ± 0.568
0.801AspTrp: 0.801 ± 0.306
1.841AspTyr: 1.841 ± 0.307
0.0AspXaa: 0.0 ± 0.0
Glu
6.725GluAla: 6.725 ± 0.844
0.64GluCys: 0.64 ± 0.265
2.962GluAsp: 2.962 ± 0.551
3.923GluGlu: 3.923 ± 0.566
2.322GluPhe: 2.322 ± 0.517
4.323GluGly: 4.323 ± 0.605
0.64GluHis: 0.64 ± 0.243
2.962GluIle: 2.962 ± 0.561
2.562GluLys: 2.562 ± 0.439
5.284GluLeu: 5.284 ± 0.668
1.841GluMet: 1.841 ± 0.416
2.162GluAsn: 2.162 ± 0.39
2.001GluPro: 2.001 ± 0.323
3.523GluGln: 3.523 ± 0.44
3.523GluArg: 3.523 ± 0.71
3.202GluSer: 3.202 ± 0.626
2.322GluThr: 2.322 ± 0.338
4.243GluVal: 4.243 ± 0.591
0.881GluTrp: 0.881 ± 0.276
2.242GluTyr: 2.242 ± 0.356
0.0GluXaa: 0.0 ± 0.0
Phe
3.362PheAla: 3.362 ± 0.353
0.24PheCys: 0.24 ± 0.179
2.962PheAsp: 2.962 ± 0.395
1.681PheGlu: 1.681 ± 0.293
0.881PhePhe: 0.881 ± 0.315
2.962PheGly: 2.962 ± 0.619
0.4PheHis: 0.4 ± 0.18
1.841PheIle: 1.841 ± 0.397
1.361PheLys: 1.361 ± 0.451
1.921PheLeu: 1.921 ± 0.329
0.961PheMet: 0.961 ± 0.283
1.281PheAsn: 1.281 ± 0.28
1.441PhePro: 1.441 ± 0.339
1.121PheGln: 1.121 ± 0.258
1.681PheArg: 1.681 ± 0.26
1.841PheSer: 1.841 ± 0.325
2.001PheThr: 2.001 ± 0.321
2.162PheVal: 2.162 ± 0.55
0.32PheTrp: 0.32 ± 0.173
1.041PheTyr: 1.041 ± 0.335
0.0PheXaa: 0.0 ± 0.0
Gly
7.045GlyAla: 7.045 ± 0.582
0.881GlyCys: 0.881 ± 0.32
6.004GlyAsp: 6.004 ± 0.654
4.163GlyGlu: 4.163 ± 0.48
2.562GlyPhe: 2.562 ± 0.438
7.846GlyGly: 7.846 ± 0.966
1.521GlyHis: 1.521 ± 0.427
3.282GlyIle: 3.282 ± 0.503
5.924GlyLys: 5.924 ± 0.848
5.684GlyLeu: 5.684 ± 0.712
3.202GlyMet: 3.202 ± 0.549
3.362GlyAsn: 3.362 ± 0.624
2.402GlyPro: 2.402 ± 0.514
3.362GlyGln: 3.362 ± 0.624
3.763GlyArg: 3.763 ± 0.588
4.643GlySer: 4.643 ± 0.626
5.524GlyThr: 5.524 ± 0.609
5.524GlyVal: 5.524 ± 0.655
1.841GlyTrp: 1.841 ± 0.323
2.162GlyTyr: 2.162 ± 0.362
0.0GlyXaa: 0.0 ± 0.0
His
2.081HisAla: 2.081 ± 0.539
0.56HisCys: 0.56 ± 0.265
1.521HisAsp: 1.521 ± 0.392
0.4HisGlu: 0.4 ± 0.176
0.32HisPhe: 0.32 ± 0.136
1.841HisGly: 1.841 ± 0.364
0.801HisHis: 0.801 ± 0.283
0.64HisIle: 0.64 ± 0.2
1.281HisLys: 1.281 ± 0.326
1.761HisLeu: 1.761 ± 0.386
0.56HisMet: 0.56 ± 0.236
0.881HisAsn: 0.881 ± 0.19
0.801HisPro: 0.801 ± 0.296
0.64HisGln: 0.64 ± 0.205
1.441HisArg: 1.441 ± 0.363
1.121HisSer: 1.121 ± 0.241
1.441HisThr: 1.441 ± 0.289
1.601HisVal: 1.601 ± 0.339
0.24HisTrp: 0.24 ± 0.125
0.64HisTyr: 0.64 ± 0.198
0.0HisXaa: 0.0 ± 0.0
Ile
4.563IleAla: 4.563 ± 0.528
0.64IleCys: 0.64 ± 0.208
3.442IleAsp: 3.442 ± 0.432
3.282IleGlu: 3.282 ± 0.519
1.041IlePhe: 1.041 ± 0.246
3.523IleGly: 3.523 ± 0.328
1.201IleHis: 1.201 ± 0.299
3.202IleIle: 3.202 ± 0.697
2.642IleLys: 2.642 ± 0.502
3.683IleLeu: 3.683 ± 0.718
0.64IleMet: 0.64 ± 0.269
2.722IleAsn: 2.722 ± 0.36
2.322IlePro: 2.322 ± 0.301
2.162IleGln: 2.162 ± 0.512
2.962IleArg: 2.962 ± 0.533
2.642IleSer: 2.642 ± 0.434
3.362IleThr: 3.362 ± 0.652
3.282IleVal: 3.282 ± 0.612
0.4IleTrp: 0.4 ± 0.19
1.201IleTyr: 1.201 ± 0.374
0.0IleXaa: 0.0 ± 0.0
Lys
6.004LysAla: 6.004 ± 0.886
0.32LysCys: 0.32 ± 0.193
3.763LysAsp: 3.763 ± 0.595
3.523LysGlu: 3.523 ± 0.55
2.162LysPhe: 2.162 ± 0.45
3.763LysGly: 3.763 ± 0.728
1.281LysHis: 1.281 ± 0.436
2.642LysIle: 2.642 ± 0.48
3.442LysLys: 3.442 ± 0.826
5.764LysLeu: 5.764 ± 0.615
1.841LysMet: 1.841 ± 0.475
1.601LysAsn: 1.601 ± 0.336
2.722LysPro: 2.722 ± 0.486
2.962LysGln: 2.962 ± 0.59
2.962LysArg: 2.962 ± 0.554
2.722LysSer: 2.722 ± 0.436
3.282LysThr: 3.282 ± 0.429
3.362LysVal: 3.362 ± 0.765
0.721LysTrp: 0.721 ± 0.191
1.681LysTyr: 1.681 ± 0.38
0.0LysXaa: 0.0 ± 0.0
Leu
9.127LeuAla: 9.127 ± 1.041
1.521LeuCys: 1.521 ± 0.478
5.124LeuAsp: 5.124 ± 0.532
4.323LeuGlu: 4.323 ± 0.586
1.761LeuPhe: 1.761 ± 0.39
5.924LeuGly: 5.924 ± 0.638
1.601LeuHis: 1.601 ± 0.302
4.083LeuIle: 4.083 ± 0.717
3.923LeuLys: 3.923 ± 0.541
6.405LeuLeu: 6.405 ± 0.707
2.722LeuMet: 2.722 ± 0.461
3.442LeuAsn: 3.442 ± 0.549
3.282LeuPro: 3.282 ± 0.479
2.962LeuGln: 2.962 ± 0.57
4.483LeuArg: 4.483 ± 0.591
6.004LeuSer: 6.004 ± 0.602
5.684LeuThr: 5.684 ± 1.193
5.444LeuVal: 5.444 ± 0.728
0.961LeuTrp: 0.961 ± 0.335
2.482LeuTyr: 2.482 ± 0.454
0.0LeuXaa: 0.0 ± 0.0
Met
3.362MetAla: 3.362 ± 0.645
0.16MetCys: 0.16 ± 0.104
1.921MetAsp: 1.921 ± 0.313
0.961MetGlu: 0.961 ± 0.261
0.801MetPhe: 0.801 ± 0.221
2.962MetGly: 2.962 ± 0.5
0.56MetHis: 0.56 ± 0.19
1.281MetIle: 1.281 ± 0.255
1.841MetLys: 1.841 ± 0.361
1.921MetLeu: 1.921 ± 0.399
0.881MetMet: 0.881 ± 0.261
0.721MetAsn: 0.721 ± 0.242
1.521MetPro: 1.521 ± 0.277
1.921MetGln: 1.921 ± 0.414
3.122MetArg: 3.122 ± 0.467
2.482MetSer: 2.482 ± 0.332
1.681MetThr: 1.681 ± 0.435
1.521MetVal: 1.521 ± 0.38
0.4MetTrp: 0.4 ± 0.217
0.801MetTyr: 0.801 ± 0.275
0.0MetXaa: 0.0 ± 0.0
Asn
3.603AsnAla: 3.603 ± 0.535
0.16AsnCys: 0.16 ± 0.099
1.921AsnAsp: 1.921 ± 0.424
2.322AsnGlu: 2.322 ± 0.38
0.881AsnPhe: 0.881 ± 0.214
4.083AsnGly: 4.083 ± 0.607
0.801AsnHis: 0.801 ± 0.298
2.642AsnIle: 2.642 ± 0.64
2.642AsnLys: 2.642 ± 0.5
3.202AsnLeu: 3.202 ± 0.605
0.801AsnMet: 0.801 ± 0.203
1.761AsnAsn: 1.761 ± 0.495
1.921AsnPro: 1.921 ± 0.274
1.761AsnGln: 1.761 ± 0.367
2.162AsnArg: 2.162 ± 0.34
2.402AsnSer: 2.402 ± 0.461
2.402AsnThr: 2.402 ± 0.323
3.362AsnVal: 3.362 ± 0.495
0.48AsnTrp: 0.48 ± 0.195
0.881AsnTyr: 0.881 ± 0.285
0.0AsnXaa: 0.0 ± 0.0
Pro
4.483ProAla: 4.483 ± 0.764
0.48ProCys: 0.48 ± 0.24
3.202ProAsp: 3.202 ± 0.492
3.202ProGlu: 3.202 ± 0.461
1.281ProPhe: 1.281 ± 0.238
2.962ProGly: 2.962 ± 0.463
1.121ProHis: 1.121 ± 0.334
1.681ProIle: 1.681 ± 0.376
2.802ProLys: 2.802 ± 0.569
3.122ProLeu: 3.122 ± 0.384
1.441ProMet: 1.441 ± 0.369
1.761ProAsn: 1.761 ± 0.233
1.041ProPro: 1.041 ± 0.313
1.441ProGln: 1.441 ± 0.361
1.281ProArg: 1.281 ± 0.302
2.482ProSer: 2.482 ± 0.323
2.722ProThr: 2.722 ± 0.461
3.042ProVal: 3.042 ± 0.48
0.881ProTrp: 0.881 ± 0.253
1.441ProTyr: 1.441 ± 0.338
0.0ProXaa: 0.0 ± 0.0
Gln
5.844GlnAla: 5.844 ± 0.816
0.4GlnCys: 0.4 ± 0.189
1.921GlnAsp: 1.921 ± 0.407
2.322GlnGlu: 2.322 ± 0.402
1.601GlnPhe: 1.601 ± 0.421
2.962GlnGly: 2.962 ± 0.382
1.201GlnHis: 1.201 ± 0.326
2.322GlnIle: 2.322 ± 0.421
1.281GlnLys: 1.281 ± 0.338
4.403GlnLeu: 4.403 ± 0.636
2.001GlnMet: 2.001 ± 0.435
1.201GlnAsn: 1.201 ± 0.332
1.761GlnPro: 1.761 ± 0.36
2.162GlnGln: 2.162 ± 0.657
2.642GlnArg: 2.642 ± 0.495
2.162GlnSer: 2.162 ± 0.481
2.642GlnThr: 2.642 ± 0.399
2.722GlnVal: 2.722 ± 0.458
0.64GlnTrp: 0.64 ± 0.236
1.441GlnTyr: 1.441 ± 0.273
0.0GlnXaa: 0.0 ± 0.0
Arg
5.844ArgAla: 5.844 ± 0.739
0.4ArgCys: 0.4 ± 0.205
3.683ArgAsp: 3.683 ± 0.53
3.843ArgGlu: 3.843 ± 0.592
1.841ArgPhe: 1.841 ± 0.357
3.683ArgGly: 3.683 ± 0.638
1.601ArgHis: 1.601 ± 0.293
3.122ArgIle: 3.122 ± 0.467
3.523ArgLys: 3.523 ± 0.631
5.604ArgLeu: 5.604 ± 0.765
2.162ArgMet: 2.162 ± 0.566
2.402ArgAsn: 2.402 ± 0.503
2.402ArgPro: 2.402 ± 0.433
1.921ArgGln: 1.921 ± 0.398
3.843ArgArg: 3.843 ± 0.643
3.202ArgSer: 3.202 ± 0.48
2.802ArgThr: 2.802 ± 0.478
3.523ArgVal: 3.523 ± 0.551
0.801ArgTrp: 0.801 ± 0.264
1.841ArgTyr: 1.841 ± 0.363
0.0ArgXaa: 0.0 ± 0.0
Ser
5.364SerAla: 5.364 ± 0.666
0.32SerCys: 0.32 ± 0.186
3.763SerAsp: 3.763 ± 0.391
3.202SerGlu: 3.202 ± 0.686
2.001SerPhe: 2.001 ± 0.355
4.323SerGly: 4.323 ± 0.802
0.961SerHis: 0.961 ± 0.303
2.722SerIle: 2.722 ± 0.512
3.603SerLys: 3.603 ± 0.685
3.923SerLeu: 3.923 ± 0.617
1.841SerMet: 1.841 ± 0.382
2.322SerAsn: 2.322 ± 0.423
2.642SerPro: 2.642 ± 0.378
2.562SerGln: 2.562 ± 0.458
3.282SerArg: 3.282 ± 0.423
2.482SerSer: 2.482 ± 0.678
3.843SerThr: 3.843 ± 0.59
4.243SerVal: 4.243 ± 0.424
0.881SerTrp: 0.881 ± 0.384
2.001SerTyr: 2.001 ± 0.434
0.0SerXaa: 0.0 ± 0.0
Thr
6.725ThrAla: 6.725 ± 0.831
0.16ThrCys: 0.16 ± 0.1
3.683ThrAsp: 3.683 ± 0.477
3.362ThrGlu: 3.362 ± 0.547
1.761ThrPhe: 1.761 ± 0.415
5.524ThrGly: 5.524 ± 0.762
0.56ThrHis: 0.56 ± 0.206
3.282ThrIle: 3.282 ± 0.559
2.882ThrLys: 2.882 ± 0.548
4.323ThrLeu: 4.323 ± 0.683
0.961ThrMet: 0.961 ± 0.262
2.562ThrAsn: 2.562 ± 0.424
4.003ThrPro: 4.003 ± 0.656
2.322ThrGln: 2.322 ± 0.468
3.202ThrArg: 3.202 ± 0.571
3.763ThrSer: 3.763 ± 0.706
3.523ThrThr: 3.523 ± 0.478
4.884ThrVal: 4.884 ± 0.73
0.24ThrTrp: 0.24 ± 0.164
1.601ThrTyr: 1.601 ± 0.498
0.0ThrXaa: 0.0 ± 0.0
Val
7.686ValAla: 7.686 ± 1.116
0.721ValCys: 0.721 ± 0.219
2.962ValAsp: 2.962 ± 0.462
4.483ValGlu: 4.483 ± 0.57
2.482ValPhe: 2.482 ± 0.404
5.604ValGly: 5.604 ± 0.786
1.441ValHis: 1.441 ± 0.4
2.722ValIle: 2.722 ± 0.506
4.483ValLys: 4.483 ± 0.765
5.604ValLeu: 5.604 ± 0.609
2.001ValMet: 2.001 ± 0.267
3.442ValAsn: 3.442 ± 0.504
3.442ValPro: 3.442 ± 0.733
2.562ValGln: 2.562 ± 0.303
4.323ValArg: 4.323 ± 0.365
3.202ValSer: 3.202 ± 0.549
4.483ValThr: 4.483 ± 0.894
4.643ValVal: 4.643 ± 0.658
0.721ValTrp: 0.721 ± 0.234
2.322ValTyr: 2.322 ± 0.367
0.0ValXaa: 0.0 ± 0.0
Trp
1.121TrpAla: 1.121 ± 0.263
0.32TrpCys: 0.32 ± 0.204
0.56TrpAsp: 0.56 ± 0.197
1.041TrpGlu: 1.041 ± 0.32
0.961TrpPhe: 0.961 ± 0.279
0.64TrpGly: 0.64 ± 0.231
0.24TrpHis: 0.24 ± 0.152
0.721TrpIle: 0.721 ± 0.329
0.56TrpLys: 0.56 ± 0.239
1.361TrpLeu: 1.361 ± 0.326
0.56TrpMet: 0.56 ± 0.194
0.32TrpAsn: 0.32 ± 0.161
0.4TrpPro: 0.4 ± 0.172
0.56TrpGln: 0.56 ± 0.233
1.041TrpArg: 1.041 ± 0.304
0.881TrpSer: 0.881 ± 0.277
0.961TrpThr: 0.961 ± 0.269
0.56TrpVal: 0.56 ± 0.212
0.16TrpTrp: 0.16 ± 0.107
0.16TrpTyr: 0.16 ± 0.099
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.482TyrAla: 2.482 ± 0.345
0.4TyrCys: 0.4 ± 0.19
2.162TyrAsp: 2.162 ± 0.437
1.441TyrGlu: 1.441 ± 0.286
0.801TyrPhe: 0.801 ± 0.263
2.882TyrGly: 2.882 ± 0.369
0.721TyrHis: 0.721 ± 0.267
1.201TyrIle: 1.201 ± 0.291
1.761TyrLys: 1.761 ± 0.351
2.402TyrLeu: 2.402 ± 0.472
0.721TyrMet: 0.721 ± 0.264
1.201TyrAsn: 1.201 ± 0.287
0.721TyrPro: 0.721 ± 0.248
1.521TyrGln: 1.521 ± 0.325
2.001TyrArg: 2.001 ± 0.395
1.281TyrSer: 1.281 ± 0.301
1.601TyrThr: 1.601 ± 0.333
2.482TyrVal: 2.482 ± 0.491
0.56TyrTrp: 0.56 ± 0.215
1.041TyrTyr: 1.041 ± 0.329
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 46 proteins (12492 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski