Amino acid dipepetide frequency for Vibrio phage Seahorse

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.309AlaAla: 6.309 ± 1.119
0.36AlaCys: 0.36 ± 0.183
4.236AlaAsp: 4.236 ± 0.516
6.128AlaGlu: 6.128 ± 1.281
2.073AlaPhe: 2.073 ± 0.321
4.236AlaGly: 4.236 ± 0.728
1.262AlaHis: 1.262 ± 0.232
4.957AlaIle: 4.957 ± 0.604
6.218AlaLys: 6.218 ± 0.985
4.596AlaLeu: 4.596 ± 0.945
2.704AlaMet: 2.704 ± 0.529
4.056AlaAsn: 4.056 ± 0.861
2.253AlaPro: 2.253 ± 0.477
2.433AlaGln: 2.433 ± 0.49
4.146AlaArg: 4.146 ± 0.997
4.686AlaSer: 4.686 ± 0.886
5.858AlaThr: 5.858 ± 1.092
3.875AlaVal: 3.875 ± 0.592
0.901AlaTrp: 0.901 ± 0.323
2.433AlaTyr: 2.433 ± 0.514
0.0AlaXaa: 0.0 ± 0.0
Cys
0.901CysAla: 0.901 ± 0.246
0.36CysCys: 0.36 ± 0.173
0.901CysAsp: 0.901 ± 0.272
1.172CysGlu: 1.172 ± 0.316
0.631CysPhe: 0.631 ± 0.223
2.253CysGly: 2.253 ± 0.508
0.36CysHis: 0.36 ± 0.165
0.631CysIle: 0.631 ± 0.25
0.991CysLys: 0.991 ± 0.247
0.901CysLeu: 0.901 ± 0.344
0.27CysMet: 0.27 ± 0.15
0.36CysAsn: 0.36 ± 0.17
0.451CysPro: 0.451 ± 0.182
0.27CysGln: 0.27 ± 0.141
0.18CysArg: 0.18 ± 0.117
0.901CysSer: 0.901 ± 0.317
0.541CysThr: 0.541 ± 0.246
0.541CysVal: 0.541 ± 0.288
0.18CysTrp: 0.18 ± 0.134
0.541CysTyr: 0.541 ± 0.186
0.0CysXaa: 0.0 ± 0.0
Asp
4.686AspAla: 4.686 ± 0.746
1.352AspCys: 1.352 ± 0.428
5.497AspAsp: 5.497 ± 0.658
4.146AspGlu: 4.146 ± 0.538
3.154AspPhe: 3.154 ± 0.531
5.317AspGly: 5.317 ± 1.013
1.622AspHis: 1.622 ± 0.379
3.965AspIle: 3.965 ± 0.55
4.776AspLys: 4.776 ± 0.659
5.497AspLeu: 5.497 ± 0.585
1.983AspMet: 1.983 ± 0.484
2.523AspAsn: 2.523 ± 0.436
2.253AspPro: 2.253 ± 0.433
2.073AspGln: 2.073 ± 0.583
3.605AspArg: 3.605 ± 0.501
3.515AspSer: 3.515 ± 0.503
3.785AspThr: 3.785 ± 0.737
3.425AspVal: 3.425 ± 0.492
1.532AspTrp: 1.532 ± 0.414
2.343AspTyr: 2.343 ± 0.442
0.0AspXaa: 0.0 ± 0.0
Glu
6.669GluAla: 6.669 ± 1.132
1.081GluCys: 1.081 ± 0.374
2.974GluAsp: 2.974 ± 0.389
7.03GluGlu: 7.03 ± 1.877
3.785GluPhe: 3.785 ± 0.594
4.056GluGly: 4.056 ± 0.611
1.262GluHis: 1.262 ± 0.302
6.038GluIle: 6.038 ± 0.623
6.849GluLys: 6.849 ± 1.123
6.128GluLeu: 6.128 ± 0.817
3.064GluMet: 3.064 ± 0.558
2.704GluAsn: 2.704 ± 0.391
2.253GluPro: 2.253 ± 0.549
3.695GluGln: 3.695 ± 0.703
4.596GluArg: 4.596 ± 0.942
5.317GluSer: 5.317 ± 0.72
3.605GluThr: 3.605 ± 0.396
3.605GluVal: 3.605 ± 0.64
1.532GluTrp: 1.532 ± 0.317
2.343GluTyr: 2.343 ± 0.545
0.0GluXaa: 0.0 ± 0.0
Phe
1.893PheAla: 1.893 ± 0.459
0.721PheCys: 0.721 ± 0.254
3.605PheAsp: 3.605 ± 0.584
2.614PheGlu: 2.614 ± 0.381
1.172PhePhe: 1.172 ± 0.355
4.056PheGly: 4.056 ± 0.529
1.081PheHis: 1.081 ± 0.388
2.433PheIle: 2.433 ± 0.436
2.253PheLys: 2.253 ± 0.471
1.081PheLeu: 1.081 ± 0.264
0.811PheMet: 0.811 ± 0.298
1.442PheAsn: 1.442 ± 0.396
0.27PhePro: 0.27 ± 0.144
1.172PheGln: 1.172 ± 0.27
1.442PheArg: 1.442 ± 0.273
2.253PheSer: 2.253 ± 0.496
2.974PheThr: 2.974 ± 0.499
2.343PheVal: 2.343 ± 0.511
0.541PheTrp: 0.541 ± 0.195
0.811PheTyr: 0.811 ± 0.262
0.0PheXaa: 0.0 ± 0.0
Gly
4.867GlyAla: 4.867 ± 0.965
0.721GlyCys: 0.721 ± 0.303
4.957GlyAsp: 4.957 ± 0.77
5.047GlyGlu: 5.047 ± 0.685
1.802GlyPhe: 1.802 ± 0.42
5.588GlyGly: 5.588 ± 0.824
0.901GlyHis: 0.901 ± 0.3
3.875GlyIle: 3.875 ± 0.708
5.948GlyLys: 5.948 ± 0.642
5.317GlyLeu: 5.317 ± 0.552
1.802GlyMet: 1.802 ± 0.384
4.416GlyAsn: 4.416 ± 0.74
0.991GlyPro: 0.991 ± 0.351
2.163GlyGln: 2.163 ± 0.395
2.523GlyArg: 2.523 ± 0.44
3.695GlySer: 3.695 ± 0.579
3.425GlyThr: 3.425 ± 0.598
6.399GlyVal: 6.399 ± 0.859
1.622GlyTrp: 1.622 ± 0.422
3.244GlyTyr: 3.244 ± 0.586
0.0GlyXaa: 0.0 ± 0.0
His
1.262HisAla: 1.262 ± 0.394
0.811HisCys: 0.811 ± 0.279
1.172HisAsp: 1.172 ± 0.336
1.172HisGlu: 1.172 ± 0.396
1.262HisPhe: 1.262 ± 0.39
1.081HisGly: 1.081 ± 0.292
0.541HisHis: 0.541 ± 0.177
1.442HisIle: 1.442 ± 0.307
1.172HisLys: 1.172 ± 0.314
1.712HisLeu: 1.712 ± 0.326
0.27HisMet: 0.27 ± 0.162
0.721HisAsn: 0.721 ± 0.294
0.541HisPro: 0.541 ± 0.202
0.451HisGln: 0.451 ± 0.199
0.811HisArg: 0.811 ± 0.303
0.901HisSer: 0.901 ± 0.322
0.991HisThr: 0.991 ± 0.276
0.811HisVal: 0.811 ± 0.256
0.27HisTrp: 0.27 ± 0.114
0.991HisTyr: 0.991 ± 0.253
0.0HisXaa: 0.0 ± 0.0
Ile
4.596IleAla: 4.596 ± 0.725
0.721IleCys: 0.721 ± 0.249
5.227IleAsp: 5.227 ± 0.655
6.399IleGlu: 6.399 ± 0.622
1.352IlePhe: 1.352 ± 0.372
4.416IleGly: 4.416 ± 0.636
1.532IleHis: 1.532 ± 0.386
2.794IleIle: 2.794 ± 0.51
4.416IleLys: 4.416 ± 0.727
3.515IleLeu: 3.515 ± 0.531
1.802IleMet: 1.802 ± 0.306
3.335IleAsn: 3.335 ± 0.572
3.154IlePro: 3.154 ± 0.623
2.433IleGln: 2.433 ± 0.625
2.343IleArg: 2.343 ± 0.394
3.335IleSer: 3.335 ± 0.587
3.785IleThr: 3.785 ± 0.673
3.515IleVal: 3.515 ± 0.638
0.451IleTrp: 0.451 ± 0.209
2.433IleTyr: 2.433 ± 0.617
0.0IleXaa: 0.0 ± 0.0
Lys
5.948LysAla: 5.948 ± 1.052
0.811LysCys: 0.811 ± 0.258
2.974LysAsp: 2.974 ± 0.611
5.497LysGlu: 5.497 ± 0.782
2.253LysPhe: 2.253 ± 0.394
4.596LysGly: 4.596 ± 0.734
1.532LysHis: 1.532 ± 0.357
3.875LysIle: 3.875 ± 0.669
4.686LysLys: 4.686 ± 0.846
5.588LysLeu: 5.588 ± 0.615
1.983LysMet: 1.983 ± 0.463
3.154LysAsn: 3.154 ± 0.666
2.704LysPro: 2.704 ± 0.609
3.064LysGln: 3.064 ± 0.633
4.957LysArg: 4.957 ± 0.833
5.317LysSer: 5.317 ± 0.63
4.957LysThr: 4.957 ± 0.716
4.596LysVal: 4.596 ± 0.51
0.901LysTrp: 0.901 ± 0.289
2.704LysTyr: 2.704 ± 0.433
0.0LysXaa: 0.0 ± 0.0
Leu
5.227LeuAla: 5.227 ± 0.695
0.541LeuCys: 0.541 ± 0.244
5.137LeuAsp: 5.137 ± 0.571
5.137LeuGlu: 5.137 ± 0.673
1.983LeuPhe: 1.983 ± 0.445
4.596LeuGly: 4.596 ± 0.618
1.081LeuHis: 1.081 ± 0.297
4.506LeuIle: 4.506 ± 0.569
4.506LeuLys: 4.506 ± 0.756
5.137LeuLeu: 5.137 ± 0.592
1.532LeuMet: 1.532 ± 0.288
4.776LeuAsn: 4.776 ± 0.606
2.433LeuPro: 2.433 ± 0.39
2.433LeuGln: 2.433 ± 0.572
3.875LeuArg: 3.875 ± 0.531
5.317LeuSer: 5.317 ± 0.835
4.236LeuThr: 4.236 ± 0.668
4.957LeuVal: 4.957 ± 0.514
0.901LeuTrp: 0.901 ± 0.252
2.343LeuTyr: 2.343 ± 0.48
0.0LeuXaa: 0.0 ± 0.0
Met
2.704MetAla: 2.704 ± 0.427
0.451MetCys: 0.451 ± 0.191
1.081MetAsp: 1.081 ± 0.376
1.352MetGlu: 1.352 ± 0.321
0.991MetPhe: 0.991 ± 0.29
1.802MetGly: 1.802 ± 0.341
0.27MetHis: 0.27 ± 0.134
1.712MetIle: 1.712 ± 0.345
1.893MetLys: 1.893 ± 0.37
1.802MetLeu: 1.802 ± 0.413
0.631MetMet: 0.631 ± 0.272
0.991MetAsn: 0.991 ± 0.327
1.172MetPro: 1.172 ± 0.352
0.811MetGln: 0.811 ± 0.274
1.262MetArg: 1.262 ± 0.376
2.523MetSer: 2.523 ± 0.472
2.433MetThr: 2.433 ± 0.438
1.712MetVal: 1.712 ± 0.44
0.36MetTrp: 0.36 ± 0.201
0.631MetTyr: 0.631 ± 0.177
0.0MetXaa: 0.0 ± 0.0
Asn
4.056AsnAla: 4.056 ± 0.608
0.811AsnCys: 0.811 ± 0.245
2.433AsnAsp: 2.433 ± 0.468
4.686AsnGlu: 4.686 ± 0.662
1.712AsnPhe: 1.712 ± 0.349
5.317AsnGly: 5.317 ± 0.694
1.081AsnHis: 1.081 ± 0.278
2.523AsnIle: 2.523 ± 0.482
3.335AsnLys: 3.335 ± 0.446
4.056AsnLeu: 4.056 ± 0.544
0.451AsnMet: 0.451 ± 0.189
2.253AsnAsn: 2.253 ± 0.435
1.532AsnPro: 1.532 ± 0.362
2.884AsnGln: 2.884 ± 0.488
1.893AsnArg: 1.893 ± 0.429
3.785AsnSer: 3.785 ± 0.498
3.154AsnThr: 3.154 ± 0.44
3.335AsnVal: 3.335 ± 0.446
0.991AsnTrp: 0.991 ± 0.352
2.614AsnTyr: 2.614 ± 0.526
0.0AsnXaa: 0.0 ± 0.0
Pro
2.343ProAla: 2.343 ± 0.395
0.451ProCys: 0.451 ± 0.217
2.884ProAsp: 2.884 ± 0.508
2.794ProGlu: 2.794 ± 0.553
1.081ProPhe: 1.081 ± 0.319
1.352ProGly: 1.352 ± 0.355
0.27ProHis: 0.27 ± 0.197
2.343ProIle: 2.343 ± 0.357
1.893ProLys: 1.893 ± 0.419
3.244ProLeu: 3.244 ± 0.479
0.811ProMet: 0.811 ± 0.314
2.163ProAsn: 2.163 ± 0.386
0.631ProPro: 0.631 ± 0.253
1.442ProGln: 1.442 ± 0.33
1.262ProArg: 1.262 ± 0.297
2.343ProSer: 2.343 ± 0.462
2.253ProThr: 2.253 ± 0.394
2.343ProVal: 2.343 ± 0.381
0.541ProTrp: 0.541 ± 0.207
1.262ProTyr: 1.262 ± 0.423
0.0ProXaa: 0.0 ± 0.0
Gln
3.064GlnAla: 3.064 ± 0.613
0.451GlnCys: 0.451 ± 0.192
1.622GlnAsp: 1.622 ± 0.275
2.614GlnGlu: 2.614 ± 0.615
1.532GlnPhe: 1.532 ± 0.411
2.523GlnGly: 2.523 ± 0.442
0.451GlnHis: 0.451 ± 0.193
3.064GlnIle: 3.064 ± 0.641
2.704GlnLys: 2.704 ± 0.546
3.064GlnLeu: 3.064 ± 0.434
1.172GlnMet: 1.172 ± 0.281
1.622GlnAsn: 1.622 ± 0.294
2.253GlnPro: 2.253 ± 0.41
1.893GlnGln: 1.893 ± 0.506
2.163GlnArg: 2.163 ± 0.574
3.335GlnSer: 3.335 ± 0.552
1.802GlnThr: 1.802 ± 0.382
2.073GlnVal: 2.073 ± 0.444
0.451GlnTrp: 0.451 ± 0.204
1.802GlnTyr: 1.802 ± 0.376
0.0GlnXaa: 0.0 ± 0.0
Arg
2.704ArgAla: 2.704 ± 0.636
0.721ArgCys: 0.721 ± 0.255
3.785ArgAsp: 3.785 ± 0.625
4.416ArgGlu: 4.416 ± 0.805
1.712ArgPhe: 1.712 ± 0.359
2.073ArgGly: 2.073 ± 0.387
1.081ArgHis: 1.081 ± 0.259
2.523ArgIle: 2.523 ± 0.496
4.506ArgLys: 4.506 ± 1.165
3.335ArgLeu: 3.335 ± 0.591
1.352ArgMet: 1.352 ± 0.337
3.244ArgAsn: 3.244 ± 0.583
1.262ArgPro: 1.262 ± 0.325
2.614ArgGln: 2.614 ± 0.799
2.433ArgArg: 2.433 ± 0.565
3.064ArgSer: 3.064 ± 0.493
1.893ArgThr: 1.893 ± 0.47
2.794ArgVal: 2.794 ± 0.428
1.442ArgTrp: 1.442 ± 0.345
1.081ArgTyr: 1.081 ± 0.281
0.0ArgXaa: 0.0 ± 0.0
Ser
4.506SerAla: 4.506 ± 0.939
0.36SerCys: 0.36 ± 0.167
4.056SerAsp: 4.056 ± 0.432
5.948SerGlu: 5.948 ± 0.6
2.523SerPhe: 2.523 ± 0.498
4.596SerGly: 4.596 ± 0.658
0.451SerHis: 0.451 ± 0.198
4.236SerIle: 4.236 ± 0.613
5.317SerLys: 5.317 ± 0.569
4.596SerLeu: 4.596 ± 0.645
1.893SerMet: 1.893 ± 0.437
4.776SerAsn: 4.776 ± 0.936
2.343SerPro: 2.343 ± 0.426
2.523SerGln: 2.523 ± 0.513
2.614SerArg: 2.614 ± 0.43
3.785SerSer: 3.785 ± 0.785
2.974SerThr: 2.974 ± 0.396
5.227SerVal: 5.227 ± 0.79
0.36SerTrp: 0.36 ± 0.165
1.622SerTyr: 1.622 ± 0.359
0.0SerXaa: 0.0 ± 0.0
Thr
4.867ThrAla: 4.867 ± 0.633
0.811ThrCys: 0.811 ± 0.283
4.236ThrAsp: 4.236 ± 0.693
3.785ThrGlu: 3.785 ± 0.676
2.073ThrPhe: 2.073 ± 0.529
3.965ThrGly: 3.965 ± 0.547
1.081ThrHis: 1.081 ± 0.304
2.974ThrIle: 2.974 ± 0.447
3.425ThrLys: 3.425 ± 0.45
4.867ThrLeu: 4.867 ± 0.557
1.983ThrMet: 1.983 ± 0.517
3.154ThrAsn: 3.154 ± 0.601
3.515ThrPro: 3.515 ± 0.455
2.614ThrGln: 2.614 ± 0.464
3.244ThrArg: 3.244 ± 0.533
2.884ThrSer: 2.884 ± 0.479
3.605ThrThr: 3.605 ± 0.783
3.785ThrVal: 3.785 ± 0.711
0.631ThrTrp: 0.631 ± 0.204
2.704ThrTyr: 2.704 ± 0.434
0.0ThrXaa: 0.0 ± 0.0
Val
4.236ValAla: 4.236 ± 0.664
0.631ValCys: 0.631 ± 0.21
5.588ValAsp: 5.588 ± 0.509
5.678ValGlu: 5.678 ± 0.765
1.532ValPhe: 1.532 ± 0.335
4.867ValGly: 4.867 ± 0.794
0.991ValHis: 0.991 ± 0.319
5.047ValIle: 5.047 ± 0.502
3.695ValLys: 3.695 ± 0.523
3.605ValLeu: 3.605 ± 0.644
1.081ValMet: 1.081 ± 0.288
4.326ValAsn: 4.326 ± 0.504
1.532ValPro: 1.532 ± 0.329
2.614ValGln: 2.614 ± 0.646
1.983ValArg: 1.983 ± 0.336
3.785ValSer: 3.785 ± 0.61
4.506ValThr: 4.506 ± 0.835
3.965ValVal: 3.965 ± 0.539
0.991ValTrp: 0.991 ± 0.353
2.253ValTyr: 2.253 ± 0.399
0.0ValXaa: 0.0 ± 0.0
Trp
0.721TrpAla: 0.721 ± 0.243
0.451TrpCys: 0.451 ± 0.247
1.442TrpAsp: 1.442 ± 0.369
0.811TrpGlu: 0.811 ± 0.207
1.081TrpPhe: 1.081 ± 0.317
0.811TrpGly: 0.811 ± 0.277
0.631TrpHis: 0.631 ± 0.221
0.901TrpIle: 0.901 ± 0.271
1.622TrpLys: 1.622 ± 0.399
1.172TrpLeu: 1.172 ± 0.329
0.18TrpMet: 0.18 ± 0.136
0.721TrpAsn: 0.721 ± 0.24
0.27TrpPro: 0.27 ± 0.145
0.721TrpGln: 0.721 ± 0.195
0.451TrpArg: 0.451 ± 0.173
0.631TrpSer: 0.631 ± 0.217
0.721TrpThr: 0.721 ± 0.268
1.172TrpVal: 1.172 ± 0.377
0.27TrpTrp: 0.27 ± 0.164
0.451TrpTyr: 0.451 ± 0.183
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.163TyrAla: 2.163 ± 0.48
0.721TyrCys: 0.721 ± 0.241
3.244TyrAsp: 3.244 ± 0.624
2.163TyrGlu: 2.163 ± 0.507
1.262TyrPhe: 1.262 ± 0.314
1.802TyrGly: 1.802 ± 0.305
0.901TyrHis: 0.901 ± 0.253
1.712TyrIle: 1.712 ± 0.355
1.893TyrLys: 1.893 ± 0.37
1.442TyrLeu: 1.442 ± 0.284
0.721TyrMet: 0.721 ± 0.262
2.253TyrAsn: 2.253 ± 0.402
1.893TyrPro: 1.893 ± 0.357
1.352TyrGln: 1.352 ± 0.308
2.253TyrArg: 2.253 ± 0.533
3.335TyrSer: 3.335 ± 0.525
2.704TyrThr: 2.704 ± 0.428
2.433TyrVal: 2.433 ± 0.4
0.36TyrTrp: 0.36 ± 0.161
1.802TyrTyr: 1.802 ± 0.506
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 48 proteins (11097 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski