Amino acid dipepetide frequency for Vibrio phage Athena1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.809AlaAla: 5.809 ± 1.125
1.196AlaCys: 1.196 ± 0.353
4.784AlaAsp: 4.784 ± 0.533
5.467AlaGlu: 5.467 ± 0.676
2.477AlaPhe: 2.477 ± 0.472
4.698AlaGly: 4.698 ± 0.679
1.196AlaHis: 1.196 ± 0.259
4.955AlaIle: 4.955 ± 0.768
4.442AlaLys: 4.442 ± 0.757
6.578AlaLeu: 6.578 ± 0.753
1.623AlaMet: 1.623 ± 0.517
3.246AlaAsn: 3.246 ± 0.584
1.794AlaPro: 1.794 ± 0.408
2.563AlaGln: 2.563 ± 0.651
2.819AlaArg: 2.819 ± 0.658
5.467AlaSer: 5.467 ± 0.637
3.673AlaThr: 3.673 ± 0.74
4.955AlaVal: 4.955 ± 0.633
1.025AlaTrp: 1.025 ± 0.319
2.221AlaTyr: 2.221 ± 0.441
0.0AlaXaa: 0.0 ± 0.0
Cys
0.342CysAla: 0.342 ± 0.163
0.256CysCys: 0.256 ± 0.142
0.513CysAsp: 0.513 ± 0.245
1.367CysGlu: 1.367 ± 0.311
0.427CysPhe: 0.427 ± 0.199
0.598CysGly: 0.598 ± 0.206
0.171CysHis: 0.171 ± 0.124
0.427CysIle: 0.427 ± 0.153
0.683CysLys: 0.683 ± 0.235
0.683CysLeu: 0.683 ± 0.218
0.171CysMet: 0.171 ± 0.117
0.598CysAsn: 0.598 ± 0.294
0.171CysPro: 0.171 ± 0.123
0.342CysGln: 0.342 ± 0.176
1.025CysArg: 1.025 ± 0.273
1.025CysSer: 1.025 ± 0.412
0.427CysThr: 0.427 ± 0.222
0.342CysVal: 0.342 ± 0.167
0.256CysTrp: 0.256 ± 0.119
0.683CysTyr: 0.683 ± 0.284
0.0CysXaa: 0.0 ± 0.0
Asp
5.211AspAla: 5.211 ± 0.738
0.598AspCys: 0.598 ± 0.214
4.271AspAsp: 4.271 ± 0.535
5.211AspGlu: 5.211 ± 0.604
4.186AspPhe: 4.186 ± 0.452
4.869AspGly: 4.869 ± 0.617
1.111AspHis: 1.111 ± 0.285
3.417AspIle: 3.417 ± 0.551
3.844AspLys: 3.844 ± 0.487
4.869AspLeu: 4.869 ± 0.773
1.281AspMet: 1.281 ± 0.346
3.075AspAsn: 3.075 ± 0.619
2.221AspPro: 2.221 ± 0.474
2.392AspGln: 2.392 ± 0.409
1.794AspArg: 1.794 ± 0.321
4.784AspSer: 4.784 ± 0.62
3.332AspThr: 3.332 ± 0.62
3.588AspVal: 3.588 ± 0.611
1.879AspTrp: 1.879 ± 0.43
2.05AspTyr: 2.05 ± 0.47
0.0AspXaa: 0.0 ± 0.0
Glu
3.759GluAla: 3.759 ± 0.494
0.427GluCys: 0.427 ± 0.196
3.417GluAsp: 3.417 ± 0.563
5.296GluGlu: 5.296 ± 1.06
3.844GluPhe: 3.844 ± 0.853
3.332GluGly: 3.332 ± 0.589
1.111GluHis: 1.111 ± 0.317
4.955GluIle: 4.955 ± 0.624
6.065GluLys: 6.065 ± 1.342
7.774GluLeu: 7.774 ± 0.982
1.965GluMet: 1.965 ± 0.458
4.186GluAsn: 4.186 ± 0.535
2.136GluPro: 2.136 ± 0.595
3.844GluGln: 3.844 ± 0.561
3.759GluArg: 3.759 ± 0.668
6.492GluSer: 6.492 ± 0.819
3.844GluThr: 3.844 ± 0.586
4.271GluVal: 4.271 ± 0.666
1.025GluTrp: 1.025 ± 0.31
2.648GluTyr: 2.648 ± 0.501
0.0GluXaa: 0.0 ± 0.0
Phe
3.332PheAla: 3.332 ± 0.562
0.769PheCys: 0.769 ± 0.243
3.075PheAsp: 3.075 ± 0.572
3.246PheGlu: 3.246 ± 0.565
1.367PhePhe: 1.367 ± 0.366
3.588PheGly: 3.588 ± 0.624
0.683PheHis: 0.683 ± 0.243
2.99PheIle: 2.99 ± 0.64
2.904PheLys: 2.904 ± 0.524
2.392PheLeu: 2.392 ± 0.464
0.94PheMet: 0.94 ± 0.32
2.136PheAsn: 2.136 ± 0.44
1.111PhePro: 1.111 ± 0.409
1.025PheGln: 1.025 ± 0.278
1.538PheArg: 1.538 ± 0.359
4.528PheSer: 4.528 ± 0.7
2.307PheThr: 2.307 ± 0.369
2.477PheVal: 2.477 ± 0.532
0.513PheTrp: 0.513 ± 0.188
1.025PheTyr: 1.025 ± 0.269
0.0PheXaa: 0.0 ± 0.0
Gly
5.724GlyAla: 5.724 ± 1.284
0.171GlyCys: 0.171 ± 0.131
4.955GlyAsp: 4.955 ± 0.851
4.613GlyGlu: 4.613 ± 0.554
4.1GlyPhe: 4.1 ± 0.704
6.663GlyGly: 6.663 ± 1.148
0.683GlyHis: 0.683 ± 0.309
3.246GlyIle: 3.246 ± 0.592
5.382GlyLys: 5.382 ± 0.611
4.955GlyLeu: 4.955 ± 0.815
2.221GlyMet: 2.221 ± 0.437
2.563GlyAsn: 2.563 ± 0.449
0.427GlyPro: 0.427 ± 0.17
2.563GlyGln: 2.563 ± 0.462
3.332GlyArg: 3.332 ± 0.537
5.553GlySer: 5.553 ± 0.704
4.271GlyThr: 4.271 ± 0.761
6.065GlyVal: 6.065 ± 0.692
0.513GlyTrp: 0.513 ± 0.2
3.588GlyTyr: 3.588 ± 0.45
0.0GlyXaa: 0.0 ± 0.0
His
1.025HisAla: 1.025 ± 0.243
0.342HisCys: 0.342 ± 0.163
1.025HisAsp: 1.025 ± 0.35
0.854HisGlu: 0.854 ± 0.304
0.513HisPhe: 0.513 ± 0.19
1.281HisGly: 1.281 ± 0.401
0.171HisHis: 0.171 ± 0.119
1.111HisIle: 1.111 ± 0.351
1.196HisLys: 1.196 ± 0.32
1.879HisLeu: 1.879 ± 0.493
0.171HisMet: 0.171 ± 0.127
0.598HisAsn: 0.598 ± 0.262
0.598HisPro: 0.598 ± 0.227
0.342HisGln: 0.342 ± 0.164
0.513HisArg: 0.513 ± 0.229
0.683HisSer: 0.683 ± 0.225
1.281HisThr: 1.281 ± 0.284
1.196HisVal: 1.196 ± 0.268
0.256HisTrp: 0.256 ± 0.137
0.513HisTyr: 0.513 ± 0.194
0.0HisXaa: 0.0 ± 0.0
Ile
3.673IleAla: 3.673 ± 0.505
0.598IleCys: 0.598 ± 0.219
4.955IleAsp: 4.955 ± 0.674
4.698IleGlu: 4.698 ± 0.514
1.538IlePhe: 1.538 ± 0.449
4.442IleGly: 4.442 ± 0.804
0.598IleHis: 0.598 ± 0.212
2.734IleIle: 2.734 ± 0.36
4.955IleLys: 4.955 ± 0.697
4.357IleLeu: 4.357 ± 0.555
1.111IleMet: 1.111 ± 0.239
3.673IleAsn: 3.673 ± 0.46
3.075IlePro: 3.075 ± 0.537
2.819IleGln: 2.819 ± 0.556
2.563IleArg: 2.563 ± 0.44
4.015IleSer: 4.015 ± 0.665
4.271IleThr: 4.271 ± 0.49
2.819IleVal: 2.819 ± 0.443
0.683IleTrp: 0.683 ± 0.233
2.563IleTyr: 2.563 ± 0.463
0.0IleXaa: 0.0 ± 0.0
Lys
5.467LysAla: 5.467 ± 0.884
0.854LysCys: 0.854 ± 0.314
3.673LysAsp: 3.673 ± 0.643
5.467LysGlu: 5.467 ± 0.884
4.015LysPhe: 4.015 ± 0.637
3.844LysGly: 3.844 ± 0.556
1.709LysHis: 1.709 ± 0.444
4.357LysIle: 4.357 ± 0.618
4.869LysLys: 4.869 ± 0.918
5.638LysLeu: 5.638 ± 0.705
1.709LysMet: 1.709 ± 0.442
3.332LysAsn: 3.332 ± 0.437
3.417LysPro: 3.417 ± 0.533
2.392LysGln: 2.392 ± 0.472
2.734LysArg: 2.734 ± 0.498
5.638LysSer: 5.638 ± 0.772
3.673LysThr: 3.673 ± 0.51
3.93LysVal: 3.93 ± 0.626
0.769LysTrp: 0.769 ± 0.293
2.563LysTyr: 2.563 ± 0.531
0.0LysXaa: 0.0 ± 0.0
Leu
5.467LeuAla: 5.467 ± 0.58
0.598LeuCys: 0.598 ± 0.278
4.698LeuAsp: 4.698 ± 0.541
5.98LeuGlu: 5.98 ± 0.83
1.879LeuPhe: 1.879 ± 0.359
5.467LeuGly: 5.467 ± 0.724
1.025LeuHis: 1.025 ± 0.454
5.724LeuIle: 5.724 ± 0.617
5.98LeuLys: 5.98 ± 0.858
5.382LeuLeu: 5.382 ± 0.607
1.794LeuMet: 1.794 ± 0.366
5.126LeuAsn: 5.126 ± 0.601
2.563LeuPro: 2.563 ± 0.471
2.477LeuGln: 2.477 ± 0.471
3.93LeuArg: 3.93 ± 0.588
5.98LeuSer: 5.98 ± 0.679
5.382LeuThr: 5.382 ± 0.503
5.211LeuVal: 5.211 ± 0.54
1.111LeuTrp: 1.111 ± 0.36
2.136LeuTyr: 2.136 ± 0.467
0.0LeuXaa: 0.0 ± 0.0
Met
2.221MetAla: 2.221 ± 0.437
0.256MetCys: 0.256 ± 0.151
0.854MetAsp: 0.854 ± 0.287
1.196MetGlu: 1.196 ± 0.279
0.513MetPhe: 0.513 ± 0.192
1.025MetGly: 1.025 ± 0.326
0.427MetHis: 0.427 ± 0.166
1.452MetIle: 1.452 ± 0.354
1.879MetLys: 1.879 ± 0.432
1.623MetLeu: 1.623 ± 0.329
0.427MetMet: 0.427 ± 0.214
1.196MetAsn: 1.196 ± 0.332
0.769MetPro: 0.769 ± 0.246
0.854MetGln: 0.854 ± 0.282
1.281MetArg: 1.281 ± 0.28
2.904MetSer: 2.904 ± 0.494
1.196MetThr: 1.196 ± 0.336
1.281MetVal: 1.281 ± 0.34
0.256MetTrp: 0.256 ± 0.159
1.025MetTyr: 1.025 ± 0.3
0.0MetXaa: 0.0 ± 0.0
Asn
4.271AsnAla: 4.271 ± 0.792
0.256AsnCys: 0.256 ± 0.138
3.161AsnAsp: 3.161 ± 0.53
3.246AsnGlu: 3.246 ± 0.438
2.05AsnPhe: 2.05 ± 0.392
5.724AsnGly: 5.724 ± 0.871
1.196AsnHis: 1.196 ± 0.338
2.392AsnIle: 2.392 ± 0.296
2.477AsnLys: 2.477 ± 0.438
4.357AsnLeu: 4.357 ± 0.571
0.854AsnMet: 0.854 ± 0.261
3.246AsnAsn: 3.246 ± 0.517
2.477AsnPro: 2.477 ± 0.495
2.819AsnGln: 2.819 ± 0.477
2.136AsnArg: 2.136 ± 0.458
4.015AsnSer: 4.015 ± 0.53
2.648AsnThr: 2.648 ± 0.607
2.477AsnVal: 2.477 ± 0.513
0.598AsnTrp: 0.598 ± 0.219
1.196AsnTyr: 1.196 ± 0.331
0.0AsnXaa: 0.0 ± 0.0
Pro
2.307ProAla: 2.307 ± 0.477
0.342ProCys: 0.342 ± 0.185
2.563ProAsp: 2.563 ± 0.533
3.93ProGlu: 3.93 ± 0.932
1.538ProPhe: 1.538 ± 0.405
1.196ProGly: 1.196 ± 0.404
0.598ProHis: 0.598 ± 0.226
2.136ProIle: 2.136 ± 0.48
2.136ProLys: 2.136 ± 0.485
1.879ProLeu: 1.879 ± 0.408
0.683ProMet: 0.683 ± 0.305
2.136ProAsn: 2.136 ± 0.403
0.94ProPro: 0.94 ± 0.401
1.111ProGln: 1.111 ± 0.277
1.111ProArg: 1.111 ± 0.339
2.99ProSer: 2.99 ± 0.421
1.965ProThr: 1.965 ± 0.468
3.161ProVal: 3.161 ± 0.502
0.0ProTrp: 0.0 ± 0.0
1.281ProTyr: 1.281 ± 0.382
0.0ProXaa: 0.0 ± 0.0
Gln
3.161GlnAla: 3.161 ± 0.488
0.085GlnCys: 0.085 ± 0.084
1.452GlnAsp: 1.452 ± 0.454
3.246GlnGlu: 3.246 ± 0.581
1.196GlnPhe: 1.196 ± 0.227
2.477GlnGly: 2.477 ± 0.455
0.342GlnHis: 0.342 ± 0.17
2.99GlnIle: 2.99 ± 0.503
2.307GlnLys: 2.307 ± 0.528
3.075GlnLeu: 3.075 ± 0.518
0.769GlnMet: 0.769 ± 0.284
1.281GlnAsn: 1.281 ± 0.305
0.94GlnPro: 0.94 ± 0.253
2.392GlnGln: 2.392 ± 0.49
1.965GlnArg: 1.965 ± 0.42
3.075GlnSer: 3.075 ± 0.559
2.307GlnThr: 2.307 ± 0.413
2.307GlnVal: 2.307 ± 0.583
0.683GlnTrp: 0.683 ± 0.304
1.965GlnTyr: 1.965 ± 0.396
0.0GlnXaa: 0.0 ± 0.0
Arg
3.246ArgAla: 3.246 ± 0.539
0.94ArgCys: 0.94 ± 0.338
2.392ArgAsp: 2.392 ± 0.483
2.734ArgGlu: 2.734 ± 0.454
1.794ArgPhe: 1.794 ± 0.383
2.819ArgGly: 2.819 ± 0.531
0.769ArgHis: 0.769 ± 0.312
2.221ArgIle: 2.221 ± 0.43
2.819ArgLys: 2.819 ± 0.653
3.673ArgLeu: 3.673 ± 0.696
1.623ArgMet: 1.623 ± 0.429
2.477ArgAsn: 2.477 ± 0.423
1.196ArgPro: 1.196 ± 0.361
1.879ArgGln: 1.879 ± 0.406
2.392ArgArg: 2.392 ± 0.537
2.904ArgSer: 2.904 ± 0.704
1.879ArgThr: 1.879 ± 0.507
3.332ArgVal: 3.332 ± 0.666
0.342ArgTrp: 0.342 ± 0.197
1.794ArgTyr: 1.794 ± 0.41
0.0ArgXaa: 0.0 ± 0.0
Ser
6.065SerAla: 6.065 ± 1.427
0.256SerCys: 0.256 ± 0.148
5.809SerAsp: 5.809 ± 0.612
6.236SerGlu: 6.236 ± 0.687
3.417SerPhe: 3.417 ± 0.498
6.834SerGly: 6.834 ± 0.955
1.025SerHis: 1.025 ± 0.256
5.126SerIle: 5.126 ± 0.597
5.724SerLys: 5.724 ± 0.69
5.382SerLeu: 5.382 ± 0.693
2.05SerMet: 2.05 ± 0.36
3.673SerAsn: 3.673 ± 0.593
3.417SerPro: 3.417 ± 0.643
2.563SerGln: 2.563 ± 0.441
2.904SerArg: 2.904 ± 0.56
6.834SerSer: 6.834 ± 0.89
3.93SerThr: 3.93 ± 0.559
6.749SerVal: 6.749 ± 0.787
0.598SerTrp: 0.598 ± 0.212
2.221SerTyr: 2.221 ± 0.376
0.0SerXaa: 0.0 ± 0.0
Thr
2.734ThrAla: 2.734 ± 0.594
0.598ThrCys: 0.598 ± 0.202
3.417ThrAsp: 3.417 ± 0.792
4.528ThrGlu: 4.528 ± 0.58
2.477ThrPhe: 2.477 ± 0.613
5.211ThrGly: 5.211 ± 0.82
0.94ThrHis: 0.94 ± 0.273
4.186ThrIle: 4.186 ± 0.624
4.186ThrLys: 4.186 ± 0.561
4.869ThrLeu: 4.869 ± 0.835
1.025ThrMet: 1.025 ± 0.269
3.588ThrAsn: 3.588 ± 0.442
3.332ThrPro: 3.332 ± 0.539
1.879ThrGln: 1.879 ± 0.359
1.709ThrArg: 1.709 ± 0.421
3.417ThrSer: 3.417 ± 0.554
3.588ThrThr: 3.588 ± 0.77
4.442ThrVal: 4.442 ± 0.873
0.513ThrTrp: 0.513 ± 0.201
1.538ThrTyr: 1.538 ± 0.458
0.0ThrXaa: 0.0 ± 0.0
Val
4.442ValAla: 4.442 ± 0.669
0.683ValCys: 0.683 ± 0.239
5.296ValAsp: 5.296 ± 0.664
3.417ValGlu: 3.417 ± 0.543
2.563ValPhe: 2.563 ± 0.49
4.1ValGly: 4.1 ± 0.678
0.854ValHis: 0.854 ± 0.25
4.1ValIle: 4.1 ± 0.491
4.357ValLys: 4.357 ± 0.574
5.382ValLeu: 5.382 ± 0.814
1.111ValMet: 1.111 ± 0.292
3.417ValAsn: 3.417 ± 0.581
1.623ValPro: 1.623 ± 0.383
2.221ValGln: 2.221 ± 0.464
2.734ValArg: 2.734 ± 0.441
7.261ValSer: 7.261 ± 0.708
4.528ValThr: 4.528 ± 0.892
4.613ValVal: 4.613 ± 0.853
0.769ValTrp: 0.769 ± 0.287
2.819ValTyr: 2.819 ± 0.442
0.0ValXaa: 0.0 ± 0.0
Trp
0.769TrpAla: 0.769 ± 0.259
0.256TrpCys: 0.256 ± 0.151
0.854TrpAsp: 0.854 ± 0.315
1.025TrpGlu: 1.025 ± 0.259
0.427TrpPhe: 0.427 ± 0.229
0.513TrpGly: 0.513 ± 0.217
0.342TrpHis: 0.342 ± 0.168
0.171TrpIle: 0.171 ± 0.119
1.538TrpLys: 1.538 ± 0.431
1.025TrpLeu: 1.025 ± 0.365
0.342TrpMet: 0.342 ± 0.149
0.769TrpAsn: 0.769 ± 0.328
0.769TrpPro: 0.769 ± 0.269
0.427TrpGln: 0.427 ± 0.192
0.598TrpArg: 0.598 ± 0.23
0.94TrpSer: 0.94 ± 0.302
0.854TrpThr: 0.854 ± 0.253
0.598TrpVal: 0.598 ± 0.208
0.0TrpTrp: 0.0 ± 0.0
0.342TrpTyr: 0.342 ± 0.153
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.965TyrAla: 1.965 ± 0.32
1.025TyrCys: 1.025 ± 0.286
2.819TyrAsp: 2.819 ± 0.43
2.392TyrGlu: 2.392 ± 0.469
1.623TyrPhe: 1.623 ± 0.626
2.904TyrGly: 2.904 ± 0.594
0.598TyrHis: 0.598 ± 0.303
1.538TyrIle: 1.538 ± 0.342
2.307TyrLys: 2.307 ± 0.631
2.136TyrLeu: 2.136 ± 0.461
0.683TyrMet: 0.683 ± 0.282
1.281TyrAsn: 1.281 ± 0.287
1.025TyrPro: 1.025 ± 0.355
1.111TyrGln: 1.111 ± 0.304
2.392TyrArg: 2.392 ± 0.375
2.307TyrSer: 2.307 ± 0.381
2.819TyrThr: 2.819 ± 0.502
2.477TyrVal: 2.477 ± 0.461
0.769TyrTrp: 0.769 ± 0.257
1.709TyrTyr: 1.709 ± 0.406
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 57 proteins (11707 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski