Amino acid dipepetide frequency for Vibrio phage VHML

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.879AlaAla: 10.879 ± 1.905
0.913AlaCys: 0.913 ± 0.269
6.228AlaAsp: 6.228 ± 0.741
8.138AlaGlu: 8.138 ± 0.783
3.488AlaPhe: 3.488 ± 0.514
7.308AlaGly: 7.308 ± 0.87
1.329AlaHis: 1.329 ± 0.278
5.066AlaIle: 5.066 ± 0.56
6.726AlaLys: 6.726 ± 0.756
7.474AlaLeu: 7.474 ± 0.69
2.325AlaMet: 2.325 ± 0.451
3.654AlaAsn: 3.654 ± 0.458
3.903AlaPro: 3.903 ± 0.569
3.737AlaGln: 3.737 ± 0.508
4.401AlaArg: 4.401 ± 0.534
4.9AlaSer: 4.9 ± 0.974
4.65AlaThr: 4.65 ± 0.641
5.896AlaVal: 5.896 ± 0.855
1.91AlaTrp: 1.91 ± 0.413
3.239AlaTyr: 3.239 ± 0.575
0.166AlaXaa: 0.166 ± 0.121
Cys
0.498CysAla: 0.498 ± 0.221
0.498CysCys: 0.498 ± 0.198
0.581CysAsp: 0.581 ± 0.206
0.581CysGlu: 0.581 ± 0.224
0.166CysPhe: 0.166 ± 0.096
1.08CysGly: 1.08 ± 0.464
0.415CysHis: 0.415 ± 0.175
0.581CysIle: 0.581 ± 0.263
0.415CysLys: 0.415 ± 0.201
0.498CysLeu: 0.498 ± 0.201
0.415CysMet: 0.415 ± 0.184
0.332CysAsn: 0.332 ± 0.173
0.498CysPro: 0.498 ± 0.248
0.415CysGln: 0.415 ± 0.176
1.578CysArg: 1.578 ± 0.363
0.83CysSer: 0.83 ± 0.311
0.664CysThr: 0.664 ± 0.308
1.08CysVal: 1.08 ± 0.3
0.166CysTrp: 0.166 ± 0.11
0.498CysTyr: 0.498 ± 0.242
0.083CysXaa: 0.083 ± 0.084
Asp
4.65AspAla: 4.65 ± 0.547
0.997AspCys: 0.997 ± 0.292
3.737AspAsp: 3.737 ± 0.891
4.235AspGlu: 4.235 ± 0.637
3.073AspPhe: 3.073 ± 0.568
4.401AspGly: 4.401 ± 0.577
0.913AspHis: 0.913 ± 0.263
3.322AspIle: 3.322 ± 0.461
3.239AspLys: 3.239 ± 0.647
6.394AspLeu: 6.394 ± 0.603
1.495AspMet: 1.495 ± 0.416
2.657AspAsn: 2.657 ± 0.5
2.408AspPro: 2.408 ± 0.481
2.74AspGln: 2.74 ± 0.43
3.239AspArg: 3.239 ± 0.539
2.657AspSer: 2.657 ± 0.53
2.076AspThr: 2.076 ± 0.406
4.235AspVal: 4.235 ± 0.655
0.83AspTrp: 0.83 ± 0.253
2.076AspTyr: 2.076 ± 0.379
0.0AspXaa: 0.0 ± 0.0
Glu
7.308GluAla: 7.308 ± 0.639
0.581GluCys: 0.581 ± 0.22
2.74GluAsp: 2.74 ± 0.562
4.733GluGlu: 4.733 ± 0.903
2.99GluPhe: 2.99 ± 0.373
4.484GluGly: 4.484 ± 0.549
1.578GluHis: 1.578 ± 0.291
3.156GluIle: 3.156 ± 0.639
2.491GluLys: 2.491 ± 0.466
7.889GluLeu: 7.889 ± 0.779
2.159GluMet: 2.159 ± 0.4
2.076GluAsn: 2.076 ± 0.393
2.823GluPro: 2.823 ± 0.426
5.481GluGln: 5.481 ± 0.62
4.235GluArg: 4.235 ± 0.681
3.488GluSer: 3.488 ± 0.525
4.733GluThr: 4.733 ± 0.519
4.235GluVal: 4.235 ± 0.663
1.246GluTrp: 1.246 ± 0.35
2.574GluTyr: 2.574 ± 0.513
0.0GluXaa: 0.0 ± 0.0
Phe
3.073PheAla: 3.073 ± 0.484
0.913PheCys: 0.913 ± 0.321
2.99PheAsp: 2.99 ± 0.483
2.325PheGlu: 2.325 ± 0.446
0.913PhePhe: 0.913 ± 0.275
2.99PheGly: 2.99 ± 0.573
0.664PheHis: 0.664 ± 0.257
1.412PheIle: 1.412 ± 0.33
2.491PheLys: 2.491 ± 0.378
1.91PheLeu: 1.91 ± 0.373
0.747PheMet: 0.747 ± 0.212
1.412PheAsn: 1.412 ± 0.344
0.83PhePro: 0.83 ± 0.271
0.83PheGln: 0.83 ± 0.22
1.744PheArg: 1.744 ± 0.458
2.823PheSer: 2.823 ± 0.526
1.495PheThr: 1.495 ± 0.371
2.325PheVal: 2.325 ± 0.456
0.415PheTrp: 0.415 ± 0.179
1.163PheTyr: 1.163 ± 0.338
0.0PheXaa: 0.0 ± 0.0
Gly
6.062GlyAla: 6.062 ± 0.824
0.581GlyCys: 0.581 ± 0.232
4.318GlyAsp: 4.318 ± 0.496
4.733GlyGlu: 4.733 ± 0.905
3.239GlyPhe: 3.239 ± 0.432
5.232GlyGly: 5.232 ± 0.579
1.661GlyHis: 1.661 ± 0.433
3.654GlyIle: 3.654 ± 0.473
4.65GlyLys: 4.65 ± 0.616
7.308GlyLeu: 7.308 ± 0.755
2.325GlyMet: 2.325 ± 0.365
2.242GlyAsn: 2.242 ± 0.41
1.744GlyPro: 1.744 ± 0.462
3.239GlyGln: 3.239 ± 0.607
3.571GlyArg: 3.571 ± 0.448
4.069GlySer: 4.069 ± 0.569
3.986GlyThr: 3.986 ± 0.792
5.896GlyVal: 5.896 ± 0.801
0.913GlyTrp: 0.913 ± 0.22
2.408GlyTyr: 2.408 ± 0.498
0.0GlyXaa: 0.0 ± 0.0
His
1.412HisAla: 1.412 ± 0.373
0.332HisCys: 0.332 ± 0.209
1.08HisAsp: 1.08 ± 0.313
1.661HisGlu: 1.661 ± 0.38
0.747HisPhe: 0.747 ± 0.221
1.329HisGly: 1.329 ± 0.364
0.581HisHis: 0.581 ± 0.202
1.495HisIle: 1.495 ± 0.343
0.913HisLys: 0.913 ± 0.292
2.491HisLeu: 2.491 ± 0.528
0.498HisMet: 0.498 ± 0.193
0.997HisAsn: 0.997 ± 0.307
0.664HisPro: 0.664 ± 0.21
0.83HisGln: 0.83 ± 0.284
1.08HisArg: 1.08 ± 0.286
0.581HisSer: 0.581 ± 0.212
0.997HisThr: 0.997 ± 0.274
1.495HisVal: 1.495 ± 0.301
0.166HisTrp: 0.166 ± 0.129
1.08HisTyr: 1.08 ± 0.345
0.0HisXaa: 0.0 ± 0.0
Ile
5.315IleAla: 5.315 ± 0.777
0.747IleCys: 0.747 ± 0.262
2.99IleAsp: 2.99 ± 0.547
3.156IleGlu: 3.156 ± 0.455
1.246IlePhe: 1.246 ± 0.319
3.571IleGly: 3.571 ± 0.448
1.163IleHis: 1.163 ± 0.328
2.491IleIle: 2.491 ± 0.422
3.405IleLys: 3.405 ± 0.546
3.986IleLeu: 3.986 ± 0.518
0.997IleMet: 0.997 ± 0.23
1.827IleAsn: 1.827 ± 0.429
1.827IlePro: 1.827 ± 0.401
1.827IleGln: 1.827 ± 0.408
2.657IleArg: 2.657 ± 0.45
3.488IleSer: 3.488 ± 0.475
3.488IleThr: 3.488 ± 0.478
3.073IleVal: 3.073 ± 0.535
0.332IleTrp: 0.332 ± 0.225
1.163IleTyr: 1.163 ± 0.293
0.0IleXaa: 0.0 ± 0.0
Lys
6.228LysAla: 6.228 ± 0.798
0.249LysCys: 0.249 ± 0.135
3.322LysAsp: 3.322 ± 0.468
4.235LysGlu: 4.235 ± 0.57
1.246LysPhe: 1.246 ± 0.348
4.484LysGly: 4.484 ± 0.467
1.578LysHis: 1.578 ± 0.361
1.744LysIle: 1.744 ± 0.355
2.906LysLys: 2.906 ± 0.581
3.405LysLeu: 3.405 ± 0.61
1.495LysMet: 1.495 ± 0.298
2.823LysAsn: 2.823 ± 0.436
2.076LysPro: 2.076 ± 0.49
1.827LysGln: 1.827 ± 0.485
3.073LysArg: 3.073 ± 0.436
4.401LysSer: 4.401 ± 0.613
4.235LysThr: 4.235 ± 0.667
5.73LysVal: 5.73 ± 0.852
1.08LysTrp: 1.08 ± 0.302
1.578LysTyr: 1.578 ± 0.383
0.0LysXaa: 0.0 ± 0.0
Leu
8.969LeuAla: 8.969 ± 1.188
0.747LeuCys: 0.747 ± 0.253
5.315LeuAsp: 5.315 ± 0.613
6.311LeuGlu: 6.311 ± 0.879
3.239LeuPhe: 3.239 ± 0.465
5.481LeuGly: 5.481 ± 0.656
1.661LeuHis: 1.661 ± 0.323
4.484LeuIle: 4.484 ± 0.553
5.398LeuLys: 5.398 ± 0.685
7.889LeuLeu: 7.889 ± 0.597
1.495LeuMet: 1.495 ± 0.311
5.149LeuAsn: 5.149 ± 0.748
4.65LeuPro: 4.65 ± 0.769
4.567LeuGln: 4.567 ± 0.731
5.896LeuArg: 5.896 ± 0.77
6.311LeuSer: 6.311 ± 0.761
4.65LeuThr: 4.65 ± 0.571
5.149LeuVal: 5.149 ± 0.526
1.412LeuTrp: 1.412 ± 0.29
1.91LeuTyr: 1.91 ± 0.287
0.0LeuXaa: 0.0 ± 0.0
Met
2.823MetAla: 2.823 ± 0.557
0.581MetCys: 0.581 ± 0.198
1.91MetAsp: 1.91 ± 0.367
1.578MetGlu: 1.578 ± 0.35
0.415MetPhe: 0.415 ± 0.158
1.91MetGly: 1.91 ± 0.395
0.332MetHis: 0.332 ± 0.145
0.913MetIle: 0.913 ± 0.266
1.495MetLys: 1.495 ± 0.315
2.242MetLeu: 2.242 ± 0.436
0.581MetMet: 0.581 ± 0.218
1.246MetAsn: 1.246 ± 0.27
0.997MetPro: 0.997 ± 0.31
0.83MetGln: 0.83 ± 0.272
1.163MetArg: 1.163 ± 0.214
1.661MetSer: 1.661 ± 0.453
1.993MetThr: 1.993 ± 0.382
1.578MetVal: 1.578 ± 0.393
0.249MetTrp: 0.249 ± 0.158
0.332MetTyr: 0.332 ± 0.157
0.0MetXaa: 0.0 ± 0.0
Asn
3.405AsnAla: 3.405 ± 0.474
0.166AsnCys: 0.166 ± 0.109
1.993AsnAsp: 1.993 ± 0.517
2.159AsnGlu: 2.159 ± 0.396
0.747AsnPhe: 0.747 ± 0.284
4.733AsnGly: 4.733 ± 0.69
1.246AsnHis: 1.246 ± 0.314
1.578AsnIle: 1.578 ± 0.444
1.412AsnLys: 1.412 ± 0.363
3.986AsnLeu: 3.986 ± 0.602
1.246AsnMet: 1.246 ± 0.361
1.827AsnAsn: 1.827 ± 0.406
2.242AsnPro: 2.242 ± 0.416
1.993AsnGln: 1.993 ± 0.378
1.993AsnArg: 1.993 ± 0.456
2.159AsnSer: 2.159 ± 0.394
2.242AsnThr: 2.242 ± 0.364
2.74AsnVal: 2.74 ± 0.522
0.664AsnTrp: 0.664 ± 0.298
1.495AsnTyr: 1.495 ± 0.345
0.083AsnXaa: 0.083 ± 0.076
Pro
3.737ProAla: 3.737 ± 0.498
0.747ProCys: 0.747 ± 0.327
2.906ProAsp: 2.906 ± 0.586
3.82ProGlu: 3.82 ± 0.727
0.747ProPhe: 0.747 ± 0.286
2.74ProGly: 2.74 ± 0.388
0.581ProHis: 0.581 ± 0.211
1.91ProIle: 1.91 ± 0.343
1.578ProLys: 1.578 ± 0.306
3.322ProLeu: 3.322 ± 0.503
0.747ProMet: 0.747 ± 0.217
1.246ProAsn: 1.246 ± 0.324
0.997ProPro: 0.997 ± 0.364
1.744ProGln: 1.744 ± 0.62
2.159ProArg: 2.159 ± 0.447
2.325ProSer: 2.325 ± 0.387
2.242ProThr: 2.242 ± 0.425
3.073ProVal: 3.073 ± 0.611
0.664ProTrp: 0.664 ± 0.296
0.913ProTyr: 0.913 ± 0.314
0.0ProXaa: 0.0 ± 0.0
Gln
5.564GlnAla: 5.564 ± 0.989
0.415GlnCys: 0.415 ± 0.21
1.578GlnAsp: 1.578 ± 0.308
4.65GlnGlu: 4.65 ± 0.777
1.578GlnPhe: 1.578 ± 0.438
2.242GlnGly: 2.242 ± 0.395
0.664GlnHis: 0.664 ± 0.28
1.993GlnIle: 1.993 ± 0.348
2.408GlnLys: 2.408 ± 0.452
4.9GlnLeu: 4.9 ± 0.808
1.08GlnMet: 1.08 ± 0.3
1.827GlnAsn: 1.827 ± 0.458
2.408GlnPro: 2.408 ± 0.434
2.159GlnGln: 2.159 ± 0.663
2.325GlnArg: 2.325 ± 0.359
2.491GlnSer: 2.491 ± 0.336
2.325GlnThr: 2.325 ± 0.45
4.401GlnVal: 4.401 ± 0.578
0.581GlnTrp: 0.581 ± 0.274
1.827GlnTyr: 1.827 ± 0.414
0.083GlnXaa: 0.083 ± 0.076
Arg
4.983ArgAla: 4.983 ± 0.595
0.415ArgCys: 0.415 ± 0.164
2.076ArgAsp: 2.076 ± 0.397
3.737ArgGlu: 3.737 ± 0.599
1.993ArgPhe: 1.993 ± 0.337
2.823ArgGly: 2.823 ± 0.523
1.163ArgHis: 1.163 ± 0.376
3.322ArgIle: 3.322 ± 0.602
4.484ArgLys: 4.484 ± 0.542
5.813ArgLeu: 5.813 ± 0.575
1.412ArgMet: 1.412 ± 0.306
1.993ArgAsn: 1.993 ± 0.466
1.744ArgPro: 1.744 ± 0.425
3.571ArgGln: 3.571 ± 0.55
3.82ArgArg: 3.82 ± 0.544
3.488ArgSer: 3.488 ± 0.523
2.242ArgThr: 2.242 ± 0.355
4.152ArgVal: 4.152 ± 0.546
1.246ArgTrp: 1.246 ± 0.393
2.076ArgTyr: 2.076 ± 0.392
0.166ArgXaa: 0.166 ± 0.118
Ser
6.477SerAla: 6.477 ± 0.877
0.747SerCys: 0.747 ± 0.253
3.82SerAsp: 3.82 ± 0.553
2.99SerGlu: 2.99 ± 0.493
1.578SerPhe: 1.578 ± 0.358
5.232SerGly: 5.232 ± 0.719
1.661SerHis: 1.661 ± 0.414
2.657SerIle: 2.657 ± 0.538
3.571SerLys: 3.571 ± 0.505
6.062SerLeu: 6.062 ± 0.639
2.325SerMet: 2.325 ± 0.506
2.242SerAsn: 2.242 ± 0.499
2.159SerPro: 2.159 ± 0.518
3.239SerGln: 3.239 ± 0.552
3.239SerArg: 3.239 ± 0.546
2.906SerSer: 2.906 ± 0.557
2.242SerThr: 2.242 ± 0.296
3.239SerVal: 3.239 ± 0.524
0.747SerTrp: 0.747 ± 0.192
1.91SerTyr: 1.91 ± 0.39
0.0SerXaa: 0.0 ± 0.0
Thr
5.564ThrAla: 5.564 ± 0.647
0.581ThrCys: 0.581 ± 0.243
3.073ThrAsp: 3.073 ± 0.51
4.401ThrGlu: 4.401 ± 0.612
1.163ThrPhe: 1.163 ± 0.229
4.733ThrGly: 4.733 ± 0.485
0.913ThrHis: 0.913 ± 0.284
2.242ThrIle: 2.242 ± 0.53
4.152ThrLys: 4.152 ± 0.56
4.567ThrLeu: 4.567 ± 0.551
0.664ThrMet: 0.664 ± 0.212
1.91ThrAsn: 1.91 ± 0.341
1.827ThrPro: 1.827 ± 0.395
2.574ThrGln: 2.574 ± 0.439
3.073ThrArg: 3.073 ± 0.466
3.239ThrSer: 3.239 ± 0.554
3.239ThrThr: 3.239 ± 0.434
3.903ThrVal: 3.903 ± 0.522
0.83ThrTrp: 0.83 ± 0.245
1.744ThrTyr: 1.744 ± 0.27
0.0ThrXaa: 0.0 ± 0.0
Val
5.315ValAla: 5.315 ± 0.91
0.913ValCys: 0.913 ± 0.259
4.733ValAsp: 4.733 ± 0.703
3.82ValGlu: 3.82 ± 0.494
2.99ValPhe: 2.99 ± 0.375
4.069ValGly: 4.069 ± 0.46
1.246ValHis: 1.246 ± 0.337
4.9ValIle: 4.9 ± 0.567
3.82ValLys: 3.82 ± 0.482
6.394ValLeu: 6.394 ± 0.627
1.91ValMet: 1.91 ± 0.366
2.99ValAsn: 2.99 ± 0.477
2.657ValPro: 2.657 ± 0.427
2.99ValGln: 2.99 ± 0.5
4.152ValArg: 4.152 ± 0.637
3.986ValSer: 3.986 ± 0.628
4.983ValThr: 4.983 ± 0.731
5.149ValVal: 5.149 ± 0.832
1.495ValTrp: 1.495 ± 0.296
2.076ValTyr: 2.076 ± 0.529
0.0ValXaa: 0.0 ± 0.0
Trp
1.246TrpAla: 1.246 ± 0.265
0.332TrpCys: 0.332 ± 0.161
1.246TrpAsp: 1.246 ± 0.311
1.246TrpGlu: 1.246 ± 0.368
0.997TrpPhe: 0.997 ± 0.291
0.913TrpGly: 0.913 ± 0.255
0.415TrpHis: 0.415 ± 0.208
0.664TrpIle: 0.664 ± 0.191
0.913TrpLys: 0.913 ± 0.324
1.163TrpLeu: 1.163 ± 0.23
0.083TrpMet: 0.083 ± 0.092
0.747TrpAsn: 0.747 ± 0.281
0.581TrpPro: 0.581 ± 0.223
0.415TrpGln: 0.415 ± 0.173
0.83TrpArg: 0.83 ± 0.266
1.412TrpSer: 1.412 ± 0.309
0.498TrpThr: 0.498 ± 0.258
1.495TrpVal: 1.495 ± 0.39
0.166TrpTrp: 0.166 ± 0.104
0.083TrpTyr: 0.083 ± 0.07
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.74TyrAla: 2.74 ± 0.614
0.415TyrCys: 0.415 ± 0.162
2.657TyrAsp: 2.657 ± 0.388
2.408TyrGlu: 2.408 ± 0.535
0.913TyrPhe: 0.913 ± 0.327
1.993TyrGly: 1.993 ± 0.443
0.664TyrHis: 0.664 ± 0.279
1.163TyrIle: 1.163 ± 0.355
1.08TyrLys: 1.08 ± 0.253
2.99TyrLeu: 2.99 ± 0.532
0.83TyrMet: 0.83 ± 0.234
0.913TyrAsn: 0.913 ± 0.247
1.163TyrPro: 1.163 ± 0.372
2.491TyrGln: 2.491 ± 0.481
2.491TyrArg: 2.491 ± 0.429
1.827TyrSer: 1.827 ± 0.49
1.329TyrThr: 1.329 ± 0.345
1.744TyrVal: 1.744 ± 0.356
0.332TyrTrp: 0.332 ± 0.165
0.997TyrTyr: 0.997 ± 0.273
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.083XaaAla: 0.083 ± 0.069
0.0XaaCys: 0.0 ± 0.0
0.083XaaAsp: 0.083 ± 0.089
0.083XaaGlu: 0.083 ± 0.093
0.0XaaPhe: 0.0 ± 0.0
0.083XaaGly: 0.083 ± 0.076
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.166XaaLys: 0.166 ± 0.101
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.083XaaThr: 0.083 ± 0.084
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 57 proteins (12043 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski