Amino acid dipepetide frequency for Vibrio phage LP.2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.719AlaAla: 7.719 ± 1.885
0.965AlaCys: 0.965 ± 0.29
4.649AlaAsp: 4.649 ± 0.724
4.825AlaGlu: 4.825 ± 0.699
1.667AlaPhe: 1.667 ± 0.366
5.175AlaGly: 5.175 ± 0.867
1.667AlaHis: 1.667 ± 0.418
5.877AlaIle: 5.877 ± 0.632
6.053AlaLys: 6.053 ± 0.893
5.965AlaLeu: 5.965 ± 1.042
2.105AlaMet: 2.105 ± 0.457
2.719AlaAsn: 2.719 ± 0.45
2.368AlaPro: 2.368 ± 0.524
4.912AlaGln: 4.912 ± 0.716
3.333AlaArg: 3.333 ± 0.586
4.649AlaSer: 4.649 ± 0.934
4.211AlaThr: 4.211 ± 0.565
4.386AlaVal: 4.386 ± 0.728
1.14AlaTrp: 1.14 ± 0.324
2.193AlaTyr: 2.193 ± 0.4
0.0AlaXaa: 0.0 ± 0.0
Cys
1.228CysAla: 1.228 ± 0.36
0.175CysCys: 0.175 ± 0.119
0.965CysAsp: 0.965 ± 0.346
1.053CysGlu: 1.053 ± 0.388
0.439CysPhe: 0.439 ± 0.234
0.877CysGly: 0.877 ± 0.332
0.0CysHis: 0.0 ± 0.0
0.614CysIle: 0.614 ± 0.237
0.789CysLys: 0.789 ± 0.368
0.877CysLeu: 0.877 ± 0.351
0.614CysMet: 0.614 ± 0.274
1.053CysAsn: 1.053 ± 0.377
0.614CysPro: 0.614 ± 0.257
1.053CysGln: 1.053 ± 0.342
0.702CysArg: 0.702 ± 0.262
0.702CysSer: 0.702 ± 0.277
0.439CysThr: 0.439 ± 0.209
0.877CysVal: 0.877 ± 0.245
0.0CysTrp: 0.0 ± 0.0
0.175CysTyr: 0.175 ± 0.101
0.0CysXaa: 0.0 ± 0.0
Asp
4.737AspAla: 4.737 ± 0.65
1.228AspCys: 1.228 ± 0.353
4.123AspAsp: 4.123 ± 0.525
5.351AspGlu: 5.351 ± 0.847
2.368AspPhe: 2.368 ± 0.546
5.439AspGly: 5.439 ± 0.419
1.053AspHis: 1.053 ± 0.319
3.333AspIle: 3.333 ± 0.679
4.123AspLys: 4.123 ± 0.655
5.263AspLeu: 5.263 ± 0.601
2.368AspMet: 2.368 ± 0.371
3.07AspAsn: 3.07 ± 0.525
1.491AspPro: 1.491 ± 0.277
2.281AspGln: 2.281 ± 0.411
2.632AspArg: 2.632 ± 0.47
3.07AspSer: 3.07 ± 0.53
3.684AspThr: 3.684 ± 0.476
3.684AspVal: 3.684 ± 0.51
1.228AspTrp: 1.228 ± 0.322
2.368AspTyr: 2.368 ± 0.474
0.0AspXaa: 0.0 ± 0.0
Glu
6.14GluAla: 6.14 ± 0.872
0.702GluCys: 0.702 ± 0.245
3.246GluAsp: 3.246 ± 0.546
4.649GluGlu: 4.649 ± 0.729
3.947GluPhe: 3.947 ± 0.644
3.509GluGly: 3.509 ± 0.604
1.404GluHis: 1.404 ± 0.433
4.825GluIle: 4.825 ± 0.552
4.386GluLys: 4.386 ± 0.694
7.807GluLeu: 7.807 ± 0.978
1.93GluMet: 1.93 ± 0.469
3.158GluAsn: 3.158 ± 0.489
1.228GluPro: 1.228 ± 0.306
2.368GluGln: 2.368 ± 0.456
4.123GluArg: 4.123 ± 0.43
5.439GluSer: 5.439 ± 0.737
3.772GluThr: 3.772 ± 0.715
4.035GluVal: 4.035 ± 0.636
1.404GluTrp: 1.404 ± 0.469
3.07GluTyr: 3.07 ± 0.57
0.0GluXaa: 0.0 ± 0.0
Phe
2.281PheAla: 2.281 ± 0.426
0.526PheCys: 0.526 ± 0.228
2.895PheAsp: 2.895 ± 0.468
2.719PheGlu: 2.719 ± 0.461
1.404PhePhe: 1.404 ± 0.326
2.544PheGly: 2.544 ± 0.476
0.175PheHis: 0.175 ± 0.121
2.018PheIle: 2.018 ± 0.349
2.018PheLys: 2.018 ± 0.347
2.281PheLeu: 2.281 ± 0.44
1.404PheMet: 1.404 ± 0.349
2.456PheAsn: 2.456 ± 0.377
0.702PhePro: 0.702 ± 0.2
0.877PheGln: 0.877 ± 0.243
1.842PheArg: 1.842 ± 0.475
2.544PheSer: 2.544 ± 0.533
3.421PheThr: 3.421 ± 0.813
2.982PheVal: 2.982 ± 0.427
0.263PheTrp: 0.263 ± 0.215
0.965PheTyr: 0.965 ± 0.24
0.0PheXaa: 0.0 ± 0.0
Gly
4.561GlyAla: 4.561 ± 0.929
0.877GlyCys: 0.877 ± 0.276
3.509GlyAsp: 3.509 ± 0.479
4.649GlyGlu: 4.649 ± 0.531
3.246GlyPhe: 3.246 ± 0.526
5.614GlyGly: 5.614 ± 0.816
0.614GlyHis: 0.614 ± 0.209
4.035GlyIle: 4.035 ± 0.56
4.211GlyLys: 4.211 ± 0.666
5.0GlyLeu: 5.0 ± 0.659
1.754GlyMet: 1.754 ± 0.395
3.421GlyAsn: 3.421 ± 0.48
1.228GlyPro: 1.228 ± 0.422
2.632GlyGln: 2.632 ± 0.565
2.895GlyArg: 2.895 ± 0.463
4.649GlySer: 4.649 ± 0.554
4.386GlyThr: 4.386 ± 0.815
5.175GlyVal: 5.175 ± 0.541
0.789GlyTrp: 0.789 ± 0.277
2.982GlyTyr: 2.982 ± 0.457
0.0GlyXaa: 0.0 ± 0.0
His
0.702HisAla: 0.702 ± 0.222
0.263HisCys: 0.263 ± 0.16
0.877HisAsp: 0.877 ± 0.268
1.316HisGlu: 1.316 ± 0.364
0.965HisPhe: 0.965 ± 0.323
0.965HisGly: 0.965 ± 0.262
0.175HisHis: 0.175 ± 0.117
0.789HisIle: 0.789 ± 0.245
0.702HisLys: 0.702 ± 0.264
1.316HisLeu: 1.316 ± 0.358
0.439HisMet: 0.439 ± 0.203
1.053HisAsn: 1.053 ± 0.294
0.965HisPro: 0.965 ± 0.327
0.526HisGln: 0.526 ± 0.179
1.14HisArg: 1.14 ± 0.356
0.614HisSer: 0.614 ± 0.299
0.877HisThr: 0.877 ± 0.291
0.789HisVal: 0.789 ± 0.341
0.175HisTrp: 0.175 ± 0.127
0.965HisTyr: 0.965 ± 0.295
0.0HisXaa: 0.0 ± 0.0
Ile
4.912IleAla: 4.912 ± 0.638
0.965IleCys: 0.965 ± 0.277
4.561IleAsp: 4.561 ± 0.507
4.912IleGlu: 4.912 ± 0.584
1.316IlePhe: 1.316 ± 0.297
5.088IleGly: 5.088 ± 0.566
0.526IleHis: 0.526 ± 0.207
3.158IleIle: 3.158 ± 0.594
3.684IleLys: 3.684 ± 0.564
4.298IleLeu: 4.298 ± 0.581
0.789IleMet: 0.789 ± 0.218
3.333IleAsn: 3.333 ± 0.635
3.246IlePro: 3.246 ± 0.43
2.105IleGln: 2.105 ± 0.467
3.246IleArg: 3.246 ± 0.492
4.474IleSer: 4.474 ± 0.62
4.737IleThr: 4.737 ± 0.693
3.246IleVal: 3.246 ± 0.478
0.877IleTrp: 0.877 ± 0.345
2.632IleTyr: 2.632 ± 0.462
0.0IleXaa: 0.0 ± 0.0
Lys
4.123LysAla: 4.123 ± 0.701
0.789LysCys: 0.789 ± 0.301
3.246LysAsp: 3.246 ± 0.565
5.526LysGlu: 5.526 ± 0.743
2.368LysPhe: 2.368 ± 0.413
3.421LysGly: 3.421 ± 0.501
1.93LysHis: 1.93 ± 0.489
3.509LysIle: 3.509 ± 0.511
4.561LysLys: 4.561 ± 0.845
6.14LysLeu: 6.14 ± 0.748
2.193LysMet: 2.193 ± 0.502
2.807LysAsn: 2.807 ± 0.657
2.632LysPro: 2.632 ± 0.466
3.246LysGln: 3.246 ± 0.626
3.421LysArg: 3.421 ± 0.599
4.737LysSer: 4.737 ± 0.772
2.719LysThr: 2.719 ± 0.421
4.474LysVal: 4.474 ± 0.52
0.439LysTrp: 0.439 ± 0.226
2.456LysTyr: 2.456 ± 0.588
0.0LysXaa: 0.0 ± 0.0
Leu
5.526LeuAla: 5.526 ± 0.624
1.316LeuCys: 1.316 ± 0.409
4.825LeuAsp: 4.825 ± 0.565
6.14LeuGlu: 6.14 ± 0.734
2.193LeuPhe: 2.193 ± 0.392
4.561LeuGly: 4.561 ± 0.503
1.14LeuHis: 1.14 ± 0.299
5.175LeuIle: 5.175 ± 0.636
5.0LeuLys: 5.0 ± 0.682
6.14LeuLeu: 6.14 ± 0.744
1.842LeuMet: 1.842 ± 0.505
4.912LeuAsn: 4.912 ± 0.666
3.333LeuPro: 3.333 ± 0.491
2.895LeuGln: 2.895 ± 0.598
3.333LeuArg: 3.333 ± 0.564
6.754LeuSer: 6.754 ± 0.704
6.579LeuThr: 6.579 ± 0.838
4.737LeuVal: 4.737 ± 0.552
0.263LeuTrp: 0.263 ± 0.159
2.018LeuTyr: 2.018 ± 0.358
0.0LeuXaa: 0.0 ± 0.0
Met
2.719MetAla: 2.719 ± 0.487
0.526MetCys: 0.526 ± 0.22
1.228MetAsp: 1.228 ± 0.368
1.579MetGlu: 1.579 ± 0.45
1.14MetPhe: 1.14 ± 0.36
1.14MetGly: 1.14 ± 0.293
0.526MetHis: 0.526 ± 0.24
2.105MetIle: 2.105 ± 0.552
2.544MetLys: 2.544 ± 0.556
1.754MetLeu: 1.754 ± 0.347
0.702MetMet: 0.702 ± 0.264
1.667MetAsn: 1.667 ± 0.542
0.702MetPro: 0.702 ± 0.283
0.877MetGln: 0.877 ± 0.272
1.667MetArg: 1.667 ± 0.367
1.93MetSer: 1.93 ± 0.462
2.105MetThr: 2.105 ± 0.419
1.14MetVal: 1.14 ± 0.273
0.175MetTrp: 0.175 ± 0.11
1.053MetTyr: 1.053 ± 0.263
0.0MetXaa: 0.0 ± 0.0
Asn
5.088AsnAla: 5.088 ± 0.76
0.702AsnCys: 0.702 ± 0.269
2.982AsnAsp: 2.982 ± 0.401
3.596AsnGlu: 3.596 ± 0.723
1.667AsnPhe: 1.667 ± 0.355
4.737AsnGly: 4.737 ± 0.81
0.789AsnHis: 0.789 ± 0.252
3.333AsnIle: 3.333 ± 0.589
3.421AsnLys: 3.421 ± 0.608
3.246AsnLeu: 3.246 ± 0.469
1.404AsnMet: 1.404 ± 0.328
2.544AsnAsn: 2.544 ± 0.583
2.719AsnPro: 2.719 ± 0.479
2.895AsnGln: 2.895 ± 0.5
2.544AsnArg: 2.544 ± 0.5
3.947AsnSer: 3.947 ± 0.458
2.982AsnThr: 2.982 ± 0.609
2.982AsnVal: 2.982 ± 0.492
0.965AsnTrp: 0.965 ± 0.256
1.842AsnTyr: 1.842 ± 0.278
0.0AsnXaa: 0.0 ± 0.0
Pro
2.281ProAla: 2.281 ± 0.412
0.263ProCys: 0.263 ± 0.147
2.018ProAsp: 2.018 ± 0.419
2.368ProGlu: 2.368 ± 0.393
0.702ProPhe: 0.702 ± 0.25
1.228ProGly: 1.228 ± 0.31
0.614ProHis: 0.614 ± 0.229
2.368ProIle: 2.368 ± 0.517
2.018ProLys: 2.018 ± 0.434
2.281ProLeu: 2.281 ± 0.424
0.614ProMet: 0.614 ± 0.278
2.456ProAsn: 2.456 ± 0.529
1.053ProPro: 1.053 ± 0.298
1.842ProGln: 1.842 ± 0.544
1.404ProArg: 1.404 ± 0.367
2.456ProSer: 2.456 ± 0.439
2.193ProThr: 2.193 ± 0.442
2.193ProVal: 2.193 ± 0.388
0.088ProTrp: 0.088 ± 0.088
1.228ProTyr: 1.228 ± 0.263
0.0ProXaa: 0.0 ± 0.0
Gln
4.386GlnAla: 4.386 ± 0.764
0.877GlnCys: 0.877 ± 0.246
2.632GlnAsp: 2.632 ± 0.365
3.684GlnGlu: 3.684 ± 0.659
1.316GlnPhe: 1.316 ± 0.273
1.667GlnGly: 1.667 ± 0.369
0.263GlnHis: 0.263 ± 0.177
2.456GlnIle: 2.456 ± 0.397
2.632GlnLys: 2.632 ± 0.53
3.947GlnLeu: 3.947 ± 0.589
1.14GlnMet: 1.14 ± 0.303
3.421GlnAsn: 3.421 ± 0.617
1.491GlnPro: 1.491 ± 0.328
3.684GlnGln: 3.684 ± 0.997
2.193GlnArg: 2.193 ± 0.44
2.544GlnSer: 2.544 ± 0.451
2.632GlnThr: 2.632 ± 0.527
2.456GlnVal: 2.456 ± 0.324
0.789GlnTrp: 0.789 ± 0.24
2.632GlnTyr: 2.632 ± 0.448
0.0GlnXaa: 0.0 ± 0.0
Arg
3.596ArgAla: 3.596 ± 0.555
0.526ArgCys: 0.526 ± 0.217
3.596ArgAsp: 3.596 ± 0.611
2.807ArgGlu: 2.807 ± 0.429
2.368ArgPhe: 2.368 ± 0.505
2.719ArgGly: 2.719 ± 0.567
0.702ArgHis: 0.702 ± 0.207
2.895ArgIle: 2.895 ± 0.463
4.035ArgLys: 4.035 ± 0.9
4.123ArgLeu: 4.123 ± 0.448
0.965ArgMet: 0.965 ± 0.314
1.842ArgAsn: 1.842 ± 0.388
1.491ArgPro: 1.491 ± 0.348
2.193ArgGln: 2.193 ± 0.537
2.368ArgArg: 2.368 ± 0.437
2.281ArgSer: 2.281 ± 0.33
2.193ArgThr: 2.193 ± 0.357
4.123ArgVal: 4.123 ± 0.495
0.965ArgTrp: 0.965 ± 0.292
1.842ArgTyr: 1.842 ± 0.421
0.0ArgXaa: 0.0 ± 0.0
Ser
5.088SerAla: 5.088 ± 0.624
0.702SerCys: 0.702 ± 0.327
3.86SerAsp: 3.86 ± 0.501
5.088SerGlu: 5.088 ± 0.612
3.684SerPhe: 3.684 ± 0.641
5.877SerGly: 5.877 ± 0.828
1.228SerHis: 1.228 ± 0.361
4.211SerIle: 4.211 ± 0.543
3.421SerLys: 3.421 ± 0.584
5.088SerLeu: 5.088 ± 0.733
2.193SerMet: 2.193 ± 0.38
4.123SerAsn: 4.123 ± 0.667
1.491SerPro: 1.491 ± 0.403
4.211SerGln: 4.211 ± 0.538
2.632SerArg: 2.632 ± 0.531
5.526SerSer: 5.526 ± 0.718
4.123SerThr: 4.123 ± 0.636
3.246SerVal: 3.246 ± 0.458
0.526SerTrp: 0.526 ± 0.205
2.719SerTyr: 2.719 ± 0.462
0.0SerXaa: 0.0 ± 0.0
Thr
4.298ThrAla: 4.298 ± 0.855
0.263ThrCys: 0.263 ± 0.138
4.298ThrAsp: 4.298 ± 0.616
3.333ThrGlu: 3.333 ± 0.585
1.93ThrPhe: 1.93 ± 0.528
6.228ThrGly: 6.228 ± 1.076
1.14ThrHis: 1.14 ± 0.36
4.298ThrIle: 4.298 ± 0.649
3.772ThrLys: 3.772 ± 0.584
4.912ThrLeu: 4.912 ± 0.663
1.053ThrMet: 1.053 ± 0.283
4.035ThrAsn: 4.035 ± 0.729
2.018ThrPro: 2.018 ± 0.409
3.07ThrGln: 3.07 ± 0.539
2.456ThrArg: 2.456 ± 0.374
4.123ThrSer: 4.123 ± 0.59
3.596ThrThr: 3.596 ± 0.615
4.737ThrVal: 4.737 ± 0.617
0.965ThrTrp: 0.965 ± 0.325
1.93ThrTyr: 1.93 ± 0.324
0.0ThrXaa: 0.0 ± 0.0
Val
4.298ValAla: 4.298 ± 0.61
0.789ValCys: 0.789 ± 0.226
6.053ValAsp: 6.053 ± 0.657
3.684ValGlu: 3.684 ± 0.605
1.754ValPhe: 1.754 ± 0.319
3.07ValGly: 3.07 ± 0.549
0.351ValHis: 0.351 ± 0.17
4.298ValIle: 4.298 ± 0.613
3.684ValLys: 3.684 ± 0.542
4.035ValLeu: 4.035 ± 0.645
1.93ValMet: 1.93 ± 0.464
3.421ValAsn: 3.421 ± 0.538
1.579ValPro: 1.579 ± 0.331
2.456ValGln: 2.456 ± 0.483
3.421ValArg: 3.421 ± 0.512
5.351ValSer: 5.351 ± 0.643
4.386ValThr: 4.386 ± 0.818
4.561ValVal: 4.561 ± 0.63
1.14ValTrp: 1.14 ± 0.388
2.193ValTyr: 2.193 ± 0.569
0.0ValXaa: 0.0 ± 0.0
Trp
0.526TrpAla: 0.526 ± 0.19
0.175TrpCys: 0.175 ± 0.12
0.789TrpAsp: 0.789 ± 0.241
1.228TrpGlu: 1.228 ± 0.323
0.526TrpPhe: 0.526 ± 0.222
0.263TrpGly: 0.263 ± 0.161
0.351TrpHis: 0.351 ± 0.159
0.965TrpIle: 0.965 ± 0.319
0.877TrpLys: 0.877 ± 0.308
1.14TrpLeu: 1.14 ± 0.399
0.439TrpMet: 0.439 ± 0.215
0.702TrpAsn: 0.702 ± 0.232
0.263TrpPro: 0.263 ± 0.157
0.526TrpGln: 0.526 ± 0.245
0.526TrpArg: 0.526 ± 0.173
0.789TrpSer: 0.789 ± 0.247
0.702TrpThr: 0.702 ± 0.23
0.789TrpVal: 0.789 ± 0.285
0.351TrpTrp: 0.351 ± 0.17
0.789TrpTyr: 0.789 ± 0.239
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.368TyrAla: 2.368 ± 0.627
0.614TyrCys: 0.614 ± 0.271
2.544TyrAsp: 2.544 ± 0.515
2.632TyrGlu: 2.632 ± 0.648
1.316TyrPhe: 1.316 ± 0.358
2.105TyrGly: 2.105 ± 0.455
0.965TyrHis: 0.965 ± 0.3
1.93TyrIle: 1.93 ± 0.346
2.807TyrLys: 2.807 ± 0.542
3.07TyrLeu: 3.07 ± 0.651
1.316TyrMet: 1.316 ± 0.48
2.193TyrAsn: 2.193 ± 0.424
1.053TyrPro: 1.053 ± 0.27
2.193TyrGln: 2.193 ± 0.381
1.754TyrArg: 1.754 ± 0.394
2.456TyrSer: 2.456 ± 0.406
2.719TyrThr: 2.719 ± 0.455
1.842TyrVal: 1.842 ± 0.458
0.175TyrTrp: 0.175 ± 0.131
1.93TyrTyr: 1.93 ± 0.434
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 68 proteins (11401 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski