Amino acid dipepetide frequency for Vibrio phage AS51

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.43AlaAla: 8.43 ± 1.579
0.655AlaCys: 0.655 ± 0.238
5.975AlaAsp: 5.975 ± 0.475
6.138AlaGlu: 6.138 ± 0.883
2.946AlaPhe: 2.946 ± 0.472
5.402AlaGly: 5.402 ± 0.596
1.637AlaHis: 1.637 ± 0.44
5.156AlaIle: 5.156 ± 0.758
5.484AlaLys: 5.484 ± 0.576
6.138AlaLeu: 6.138 ± 0.717
2.701AlaMet: 2.701 ± 0.456
3.438AlaAsn: 3.438 ± 0.571
3.11AlaPro: 3.11 ± 0.573
3.192AlaGln: 3.192 ± 0.528
4.911AlaArg: 4.911 ± 0.877
3.765AlaSer: 3.765 ± 0.843
3.11AlaThr: 3.11 ± 0.537
4.911AlaVal: 4.911 ± 0.679
0.982AlaTrp: 0.982 ± 0.336
3.519AlaTyr: 3.519 ± 0.572
0.0AlaXaa: 0.0 ± 0.0
Cys
0.409CysAla: 0.409 ± 0.175
0.0CysCys: 0.0 ± 0.0
0.409CysAsp: 0.409 ± 0.188
0.982CysGlu: 0.982 ± 0.346
0.327CysPhe: 0.327 ± 0.143
0.655CysGly: 0.655 ± 0.322
0.246CysHis: 0.246 ± 0.143
0.737CysIle: 0.737 ± 0.224
0.818CysLys: 0.818 ± 0.239
0.9CysLeu: 0.9 ± 0.264
0.246CysMet: 0.246 ± 0.141
0.491CysAsn: 0.491 ± 0.169
0.573CysPro: 0.573 ± 0.161
0.409CysGln: 0.409 ± 0.184
0.818CysArg: 0.818 ± 0.352
0.491CysSer: 0.491 ± 0.203
0.327CysThr: 0.327 ± 0.125
0.491CysVal: 0.491 ± 0.228
0.246CysTrp: 0.246 ± 0.114
0.573CysTyr: 0.573 ± 0.197
0.0CysXaa: 0.0 ± 0.0
Asp
6.138AspAla: 6.138 ± 0.627
0.737AspCys: 0.737 ± 0.234
3.847AspAsp: 3.847 ± 0.383
4.01AspGlu: 4.01 ± 0.609
2.865AspPhe: 2.865 ± 0.401
4.665AspGly: 4.665 ± 0.776
0.818AspHis: 0.818 ± 0.224
3.847AspIle: 3.847 ± 0.722
5.566AspLys: 5.566 ± 0.618
4.665AspLeu: 4.665 ± 0.613
1.31AspMet: 1.31 ± 0.397
3.519AspAsn: 3.519 ± 0.538
2.783AspPro: 2.783 ± 0.428
1.064AspGln: 1.064 ± 0.385
2.783AspArg: 2.783 ± 0.375
3.438AspSer: 3.438 ± 0.61
3.274AspThr: 3.274 ± 0.496
4.583AspVal: 4.583 ± 0.605
1.228AspTrp: 1.228 ± 0.459
3.028AspTyr: 3.028 ± 0.393
0.0AspXaa: 0.0 ± 0.0
Glu
6.057GluAla: 6.057 ± 1.061
0.737GluCys: 0.737 ± 0.371
4.01GluAsp: 4.01 ± 0.712
8.021GluGlu: 8.021 ± 1.224
3.601GluPhe: 3.601 ± 0.475
4.665GluGly: 4.665 ± 0.566
1.637GluHis: 1.637 ± 0.391
3.11GluIle: 3.11 ± 0.421
4.829GluLys: 4.829 ± 0.675
6.957GluLeu: 6.957 ± 0.915
2.537GluMet: 2.537 ± 0.558
2.701GluAsn: 2.701 ± 0.357
2.374GluPro: 2.374 ± 0.521
4.256GluGln: 4.256 ± 0.63
3.601GluArg: 3.601 ± 0.474
4.502GluSer: 4.502 ± 0.723
2.537GluThr: 2.537 ± 0.341
4.338GluVal: 4.338 ± 0.654
1.719GluTrp: 1.719 ± 0.398
2.865GluTyr: 2.865 ± 0.463
0.0GluXaa: 0.0 ± 0.0
Phe
3.028PheAla: 3.028 ± 0.4
0.573PheCys: 0.573 ± 0.254
3.192PheAsp: 3.192 ± 0.749
2.619PheGlu: 2.619 ± 0.424
1.228PhePhe: 1.228 ± 0.307
2.374PheGly: 2.374 ± 0.472
0.818PheHis: 0.818 ± 0.241
2.128PheIle: 2.128 ± 0.385
2.701PheLys: 2.701 ± 0.524
2.701PheLeu: 2.701 ± 0.437
1.637PheMet: 1.637 ± 0.375
2.21PheAsn: 2.21 ± 0.298
1.555PhePro: 1.555 ± 0.412
1.146PheGln: 1.146 ± 0.218
2.292PheArg: 2.292 ± 0.424
2.783PheSer: 2.783 ± 0.398
2.046PheThr: 2.046 ± 0.464
2.619PheVal: 2.619 ± 0.481
0.655PheTrp: 0.655 ± 0.247
1.146PheTyr: 1.146 ± 0.251
0.0PheXaa: 0.0 ± 0.0
Gly
4.42GlyAla: 4.42 ± 0.659
0.9GlyCys: 0.9 ± 0.3
4.42GlyAsp: 4.42 ± 0.451
5.729GlyGlu: 5.729 ± 0.571
2.374GlyPhe: 2.374 ± 0.392
5.647GlyGly: 5.647 ± 0.731
1.801GlyHis: 1.801 ± 0.464
3.929GlyIle: 3.929 ± 0.652
7.775GlyLys: 7.775 ± 0.788
4.911GlyLeu: 4.911 ± 0.597
2.128GlyMet: 2.128 ± 0.303
3.274GlyAsn: 3.274 ± 0.503
0.0GlyPro: 0.0 ± 0.0
2.292GlyGln: 2.292 ± 0.323
3.438GlyArg: 3.438 ± 0.555
4.092GlySer: 4.092 ± 0.713
4.583GlyThr: 4.583 ± 0.789
3.601GlyVal: 3.601 ± 0.615
1.146GlyTrp: 1.146 ± 0.298
3.274GlyTyr: 3.274 ± 0.434
0.0GlyXaa: 0.0 ± 0.0
His
1.473HisAla: 1.473 ± 0.458
0.246HisCys: 0.246 ± 0.126
1.637HisAsp: 1.637 ± 0.323
1.228HisGlu: 1.228 ± 0.344
0.737HisPhe: 0.737 ± 0.259
1.146HisGly: 1.146 ± 0.355
0.818HisHis: 0.818 ± 0.302
1.146HisIle: 1.146 ± 0.292
1.473HisLys: 1.473 ± 0.337
2.21HisLeu: 2.21 ± 0.46
0.409HisMet: 0.409 ± 0.148
0.982HisAsn: 0.982 ± 0.322
0.737HisPro: 0.737 ± 0.277
1.064HisGln: 1.064 ± 0.297
0.818HisArg: 0.818 ± 0.266
1.391HisSer: 1.391 ± 0.262
0.818HisThr: 0.818 ± 0.248
1.146HisVal: 1.146 ± 0.295
0.164HisTrp: 0.164 ± 0.105
0.818HisTyr: 0.818 ± 0.332
0.0HisXaa: 0.0 ± 0.0
Ile
4.42IleAla: 4.42 ± 0.659
0.409IleCys: 0.409 ± 0.175
4.338IleAsp: 4.338 ± 0.586
3.765IleGlu: 3.765 ± 0.472
1.637IlePhe: 1.637 ± 0.387
4.338IleGly: 4.338 ± 0.526
1.391IleHis: 1.391 ± 0.364
3.11IleIle: 3.11 ± 0.665
4.993IleLys: 4.993 ± 0.686
3.11IleLeu: 3.11 ± 0.499
1.473IleMet: 1.473 ± 0.263
2.537IleAsn: 2.537 ± 0.606
1.719IlePro: 1.719 ± 0.318
2.21IleGln: 2.21 ± 0.497
2.783IleArg: 2.783 ± 0.506
2.783IleSer: 2.783 ± 0.557
3.519IleThr: 3.519 ± 0.58
3.274IleVal: 3.274 ± 0.465
0.818IleTrp: 0.818 ± 0.276
1.882IleTyr: 1.882 ± 0.45
0.0IleXaa: 0.0 ± 0.0
Lys
6.957LysAla: 6.957 ± 0.965
0.737LysCys: 0.737 ± 0.237
5.647LysAsp: 5.647 ± 0.839
6.384LysGlu: 6.384 ± 0.79
2.783LysPhe: 2.783 ± 0.341
4.747LysGly: 4.747 ± 0.84
1.473LysHis: 1.473 ± 0.362
1.964LysIle: 1.964 ± 0.356
3.929LysLys: 3.929 ± 0.709
6.63LysLeu: 6.63 ± 0.652
1.964LysMet: 1.964 ± 0.399
3.601LysAsn: 3.601 ± 0.506
3.274LysPro: 3.274 ± 0.579
3.438LysGln: 3.438 ± 0.388
3.929LysArg: 3.929 ± 0.532
3.438LysSer: 3.438 ± 0.536
4.256LysThr: 4.256 ± 0.609
3.683LysVal: 3.683 ± 0.663
0.655LysTrp: 0.655 ± 0.219
3.028LysTyr: 3.028 ± 0.427
0.0LysXaa: 0.0 ± 0.0
Leu
6.302LeuAla: 6.302 ± 0.696
0.409LeuCys: 0.409 ± 0.209
5.729LeuAsp: 5.729 ± 0.624
5.811LeuGlu: 5.811 ± 0.721
2.537LeuPhe: 2.537 ± 0.397
5.893LeuGly: 5.893 ± 0.424
2.128LeuHis: 2.128 ± 0.456
4.993LeuIle: 4.993 ± 0.705
5.32LeuLys: 5.32 ± 0.691
6.138LeuLeu: 6.138 ± 0.75
2.783LeuMet: 2.783 ± 0.4
3.028LeuAsn: 3.028 ± 0.556
3.765LeuPro: 3.765 ± 0.459
3.192LeuGln: 3.192 ± 0.557
4.747LeuArg: 4.747 ± 0.935
4.911LeuSer: 4.911 ± 0.698
4.502LeuThr: 4.502 ± 0.517
5.893LeuVal: 5.893 ± 0.718
0.737LeuTrp: 0.737 ± 0.229
2.619LeuTyr: 2.619 ± 0.415
0.0LeuXaa: 0.0 ± 0.0
Met
2.619MetAla: 2.619 ± 0.429
0.0MetCys: 0.0 ± 0.0
0.982MetAsp: 0.982 ± 0.261
1.964MetGlu: 1.964 ± 0.452
1.31MetPhe: 1.31 ± 0.356
1.801MetGly: 1.801 ± 0.456
0.655MetHis: 0.655 ± 0.205
1.391MetIle: 1.391 ± 0.261
2.374MetLys: 2.374 ± 0.398
3.438MetLeu: 3.438 ± 0.593
0.573MetMet: 0.573 ± 0.247
1.228MetAsn: 1.228 ± 0.342
1.228MetPro: 1.228 ± 0.363
1.719MetGln: 1.719 ± 0.345
2.128MetArg: 2.128 ± 0.389
2.374MetSer: 2.374 ± 0.414
2.292MetThr: 2.292 ± 0.512
1.228MetVal: 1.228 ± 0.236
0.082MetTrp: 0.082 ± 0.075
1.146MetTyr: 1.146 ± 0.327
0.0MetXaa: 0.0 ± 0.0
Asn
3.765AsnAla: 3.765 ± 0.627
0.246AsnCys: 0.246 ± 0.131
2.783AsnAsp: 2.783 ± 0.507
2.292AsnGlu: 2.292 ± 0.404
1.882AsnPhe: 1.882 ± 0.274
3.765AsnGly: 3.765 ± 0.61
0.655AsnHis: 0.655 ± 0.214
2.865AsnIle: 2.865 ± 0.54
4.338AsnLys: 4.338 ± 0.716
5.402AsnLeu: 5.402 ± 0.693
1.146AsnMet: 1.146 ± 0.303
2.865AsnAsn: 2.865 ± 0.425
2.374AsnPro: 2.374 ± 0.43
1.801AsnGln: 1.801 ± 0.381
2.128AsnArg: 2.128 ± 0.393
2.619AsnSer: 2.619 ± 0.446
2.701AsnThr: 2.701 ± 0.356
2.783AsnVal: 2.783 ± 0.42
0.573AsnTrp: 0.573 ± 0.23
1.555AsnTyr: 1.555 ± 0.444
0.0AsnXaa: 0.0 ± 0.0
Pro
2.946ProAla: 2.946 ± 0.475
0.491ProCys: 0.491 ± 0.203
2.455ProAsp: 2.455 ± 0.499
3.847ProGlu: 3.847 ± 0.713
1.719ProPhe: 1.719 ± 0.439
0.246ProGly: 0.246 ± 0.135
0.491ProHis: 0.491 ± 0.157
2.128ProIle: 2.128 ± 0.456
2.128ProLys: 2.128 ± 0.434
3.192ProLeu: 3.192 ± 0.48
0.982ProMet: 0.982 ± 0.222
2.455ProAsn: 2.455 ± 0.311
1.228ProPro: 1.228 ± 0.325
1.473ProGln: 1.473 ± 0.296
1.146ProArg: 1.146 ± 0.379
2.946ProSer: 2.946 ± 0.492
1.637ProThr: 1.637 ± 0.298
2.292ProVal: 2.292 ± 0.539
0.573ProTrp: 0.573 ± 0.169
1.228ProTyr: 1.228 ± 0.323
0.0ProXaa: 0.0 ± 0.0
Gln
4.01GlnAla: 4.01 ± 0.56
0.409GlnCys: 0.409 ± 0.167
1.31GlnAsp: 1.31 ± 0.313
3.929GlnGlu: 3.929 ± 0.651
1.964GlnPhe: 1.964 ± 0.362
2.21GlnGly: 2.21 ± 0.426
0.573GlnHis: 0.573 ± 0.179
1.801GlnIle: 1.801 ± 0.429
1.555GlnLys: 1.555 ± 0.392
3.274GlnLeu: 3.274 ± 0.512
1.555GlnMet: 1.555 ± 0.312
2.537GlnAsn: 2.537 ± 0.367
1.146GlnPro: 1.146 ± 0.32
2.701GlnGln: 2.701 ± 0.63
2.865GlnArg: 2.865 ± 0.624
2.701GlnSer: 2.701 ± 0.528
2.128GlnThr: 2.128 ± 0.445
2.619GlnVal: 2.619 ± 0.468
0.327GlnTrp: 0.327 ± 0.174
1.31GlnTyr: 1.31 ± 0.339
0.0GlnXaa: 0.0 ± 0.0
Arg
4.092ArgAla: 4.092 ± 0.724
0.409ArgCys: 0.409 ± 0.218
3.274ArgAsp: 3.274 ± 0.505
3.438ArgGlu: 3.438 ± 0.528
1.801ArgPhe: 1.801 ± 0.394
4.01ArgGly: 4.01 ± 0.549
0.818ArgHis: 0.818 ± 0.23
2.865ArgIle: 2.865 ± 0.439
4.01ArgLys: 4.01 ± 0.613
4.338ArgLeu: 4.338 ± 0.514
1.637ArgMet: 1.637 ± 0.309
2.619ArgAsn: 2.619 ± 0.633
1.801ArgPro: 1.801 ± 0.386
2.046ArgGln: 2.046 ± 0.396
3.028ArgArg: 3.028 ± 0.573
2.619ArgSer: 2.619 ± 0.457
2.701ArgThr: 2.701 ± 0.396
3.519ArgVal: 3.519 ± 0.329
1.064ArgTrp: 1.064 ± 0.247
2.128ArgTyr: 2.128 ± 0.406
0.0ArgXaa: 0.0 ± 0.0
Ser
4.256SerAla: 4.256 ± 0.616
0.737SerCys: 0.737 ± 0.24
3.11SerAsp: 3.11 ± 0.462
4.092SerGlu: 4.092 ± 0.498
2.701SerPhe: 2.701 ± 0.645
5.156SerGly: 5.156 ± 0.732
1.146SerHis: 1.146 ± 0.338
3.929SerIle: 3.929 ± 0.459
4.338SerLys: 4.338 ± 0.571
5.156SerLeu: 5.156 ± 0.566
2.21SerMet: 2.21 ± 0.431
2.701SerAsn: 2.701 ± 0.505
1.801SerPro: 1.801 ± 0.384
1.964SerGln: 1.964 ± 0.439
2.128SerArg: 2.128 ± 0.425
3.192SerSer: 3.192 ± 0.584
3.274SerThr: 3.274 ± 0.524
3.519SerVal: 3.519 ± 0.613
0.655SerTrp: 0.655 ± 0.206
2.455SerTyr: 2.455 ± 0.629
0.0SerXaa: 0.0 ± 0.0
Thr
3.765ThrAla: 3.765 ± 0.511
0.655ThrCys: 0.655 ± 0.226
3.765ThrAsp: 3.765 ± 0.532
4.174ThrGlu: 4.174 ± 0.67
2.783ThrPhe: 2.783 ± 0.403
4.829ThrGly: 4.829 ± 0.683
0.818ThrHis: 0.818 ± 0.244
3.438ThrIle: 3.438 ± 0.46
2.946ThrLys: 2.946 ± 0.442
4.01ThrLeu: 4.01 ± 0.56
1.391ThrMet: 1.391 ± 0.351
2.21ThrAsn: 2.21 ± 0.348
2.046ThrPro: 2.046 ± 0.436
2.455ThrGln: 2.455 ± 0.408
2.374ThrArg: 2.374 ± 0.366
2.619ThrSer: 2.619 ± 0.646
3.519ThrThr: 3.519 ± 0.725
3.11ThrVal: 3.11 ± 0.39
0.491ThrTrp: 0.491 ± 0.184
1.637ThrTyr: 1.637 ± 0.321
0.0ThrXaa: 0.0 ± 0.0
Val
4.911ValAla: 4.911 ± 0.645
0.737ValCys: 0.737 ± 0.279
4.42ValAsp: 4.42 ± 0.549
3.274ValGlu: 3.274 ± 0.629
2.128ValPhe: 2.128 ± 0.566
4.829ValGly: 4.829 ± 0.49
1.064ValHis: 1.064 ± 0.28
3.601ValIle: 3.601 ± 0.627
4.42ValLys: 4.42 ± 0.65
4.42ValLeu: 4.42 ± 0.659
1.964ValMet: 1.964 ± 0.521
3.028ValAsn: 3.028 ± 0.493
2.21ValPro: 2.21 ± 0.355
2.21ValGln: 2.21 ± 0.458
3.683ValArg: 3.683 ± 0.686
3.765ValSer: 3.765 ± 0.734
3.192ValThr: 3.192 ± 0.735
4.502ValVal: 4.502 ± 0.657
0.491ValTrp: 0.491 ± 0.261
1.473ValTyr: 1.473 ± 0.447
0.0ValXaa: 0.0 ± 0.0
Trp
0.9TrpAla: 0.9 ± 0.263
0.409TrpCys: 0.409 ± 0.193
0.491TrpAsp: 0.491 ± 0.178
0.982TrpGlu: 0.982 ± 0.357
0.573TrpPhe: 0.573 ± 0.229
0.655TrpGly: 0.655 ± 0.21
0.573TrpHis: 0.573 ± 0.256
0.573TrpIle: 0.573 ± 0.213
1.228TrpLys: 1.228 ± 0.448
1.228TrpLeu: 1.228 ± 0.392
0.655TrpMet: 0.655 ± 0.219
0.573TrpAsn: 0.573 ± 0.184
0.327TrpPro: 0.327 ± 0.161
0.655TrpGln: 0.655 ± 0.225
0.327TrpArg: 0.327 ± 0.183
1.31TrpSer: 1.31 ± 0.335
0.737TrpThr: 0.737 ± 0.303
0.491TrpVal: 0.491 ± 0.18
0.164TrpTrp: 0.164 ± 0.11
0.737TrpTyr: 0.737 ± 0.177
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.701TyrAla: 2.701 ± 0.465
0.655TyrCys: 0.655 ± 0.204
2.374TyrAsp: 2.374 ± 0.451
2.374TyrGlu: 2.374 ± 0.447
1.555TyrPhe: 1.555 ± 0.305
2.701TyrGly: 2.701 ± 0.408
0.9TyrHis: 0.9 ± 0.276
1.801TyrIle: 1.801 ± 0.335
2.292TyrLys: 2.292 ± 0.33
2.619TyrLeu: 2.619 ± 0.301
1.146TyrMet: 1.146 ± 0.254
2.537TyrAsn: 2.537 ± 0.374
1.637TyrPro: 1.637 ± 0.415
1.637TyrGln: 1.637 ± 0.358
2.292TyrArg: 2.292 ± 0.376
2.783TyrSer: 2.783 ± 0.586
1.882TyrThr: 1.882 ± 0.375
1.719TyrVal: 1.719 ± 0.383
0.818TyrTrp: 0.818 ± 0.282
1.228TyrTyr: 1.228 ± 0.346
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 34 proteins (12219 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski