Amino acid dipepetide frequency for Streptococcus phage P5652

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.716AlaAla: 2.716 ± 0.889
0.194AlaCys: 0.194 ± 0.149
4.073AlaAsp: 4.073 ± 0.646
3.297AlaGlu: 3.297 ± 0.494
1.843AlaPhe: 1.843 ± 0.451
4.17AlaGly: 4.17 ± 0.722
0.97AlaHis: 0.97 ± 0.336
4.849AlaIle: 4.849 ± 0.816
6.11AlaLys: 6.11 ± 0.959
6.11AlaLeu: 6.11 ± 0.654
1.552AlaMet: 1.552 ± 0.364
5.043AlaAsn: 5.043 ± 0.836
1.552AlaPro: 1.552 ± 0.328
2.813AlaGln: 2.813 ± 0.488
2.716AlaArg: 2.716 ± 0.447
4.558AlaSer: 4.558 ± 0.701
4.655AlaThr: 4.655 ± 0.789
3.588AlaVal: 3.588 ± 0.663
1.164AlaTrp: 1.164 ± 0.28
2.328AlaTyr: 2.328 ± 0.42
0.0AlaXaa: 0.0 ± 0.0
Cys
0.097CysAla: 0.097 ± 0.085
0.0CysCys: 0.0 ± 0.0
0.776CysAsp: 0.776 ± 0.287
0.194CysGlu: 0.194 ± 0.126
0.485CysPhe: 0.485 ± 0.26
0.291CysGly: 0.291 ± 0.167
0.097CysHis: 0.097 ± 0.082
0.194CysIle: 0.194 ± 0.147
0.388CysLys: 0.388 ± 0.215
0.485CysLeu: 0.485 ± 0.294
0.097CysMet: 0.097 ± 0.105
0.388CysAsn: 0.388 ± 0.164
0.291CysPro: 0.291 ± 0.184
0.194CysGln: 0.194 ± 0.137
0.388CysArg: 0.388 ± 0.268
0.194CysSer: 0.194 ± 0.146
0.291CysThr: 0.291 ± 0.17
0.097CysVal: 0.097 ± 0.076
0.194CysTrp: 0.194 ± 0.155
0.194CysTyr: 0.194 ± 0.123
0.0CysXaa: 0.0 ± 0.0
Asp
3.879AspAla: 3.879 ± 0.573
0.485AspCys: 0.485 ± 0.206
4.073AspAsp: 4.073 ± 0.833
4.073AspGlu: 4.073 ± 0.593
3.394AspPhe: 3.394 ± 0.619
6.886AspGly: 6.886 ± 1.24
1.261AspHis: 1.261 ± 0.347
5.043AspIle: 5.043 ± 0.902
5.237AspLys: 5.237 ± 0.615
3.588AspLeu: 3.588 ± 0.875
2.522AspMet: 2.522 ± 0.386
3.879AspAsn: 3.879 ± 0.713
2.328AspPro: 2.328 ± 0.479
1.358AspGln: 1.358 ± 0.299
2.134AspArg: 2.134 ± 0.493
3.976AspSer: 3.976 ± 0.616
4.946AspThr: 4.946 ± 0.642
3.782AspVal: 3.782 ± 0.827
1.261AspTrp: 1.261 ± 0.321
3.103AspTyr: 3.103 ± 0.53
0.0AspXaa: 0.0 ± 0.0
Glu
3.685GluAla: 3.685 ± 0.465
0.291GluCys: 0.291 ± 0.138
3.2GluAsp: 3.2 ± 0.674
4.364GluGlu: 4.364 ± 0.844
2.716GluPhe: 2.716 ± 0.6
3.103GluGly: 3.103 ± 0.412
1.067GluHis: 1.067 ± 0.323
5.14GluIle: 5.14 ± 0.892
3.297GluLys: 3.297 ± 0.609
6.692GluLeu: 6.692 ± 0.841
1.94GluMet: 1.94 ± 0.413
4.461GluAsn: 4.461 ± 0.731
1.94GluPro: 1.94 ± 0.636
2.619GluGln: 2.619 ± 0.549
3.2GluArg: 3.2 ± 0.573
3.297GluSer: 3.297 ± 0.416
3.2GluThr: 3.2 ± 0.506
4.461GluVal: 4.461 ± 0.556
0.97GluTrp: 0.97 ± 0.325
3.2GluTyr: 3.2 ± 0.633
0.0GluXaa: 0.0 ± 0.0
Phe
2.91PheAla: 2.91 ± 0.441
0.194PheCys: 0.194 ± 0.152
3.588PheAsp: 3.588 ± 0.564
2.231PheGlu: 2.231 ± 0.405
1.649PhePhe: 1.649 ± 0.273
3.394PheGly: 3.394 ± 0.636
0.485PheHis: 0.485 ± 0.149
2.619PheIle: 2.619 ± 0.525
3.879PheLys: 3.879 ± 0.703
2.813PheLeu: 2.813 ± 0.538
0.194PheMet: 0.194 ± 0.137
3.297PheAsn: 3.297 ± 0.832
0.679PhePro: 0.679 ± 0.225
1.067PheGln: 1.067 ± 0.271
1.358PheArg: 1.358 ± 0.348
2.716PheSer: 2.716 ± 0.544
3.394PheThr: 3.394 ± 0.649
2.813PheVal: 2.813 ± 0.467
0.582PheTrp: 0.582 ± 0.209
2.425PheTyr: 2.425 ± 0.468
0.0PheXaa: 0.0 ± 0.0
Gly
3.491GlyAla: 3.491 ± 0.701
0.291GlyCys: 0.291 ± 0.177
4.558GlyAsp: 4.558 ± 0.535
3.782GlyGlu: 3.782 ± 0.65
3.588GlyPhe: 3.588 ± 0.459
4.752GlyGly: 4.752 ± 0.893
0.97GlyHis: 0.97 ± 0.279
5.237GlyIle: 5.237 ± 0.842
6.692GlyLys: 6.692 ± 0.893
5.625GlyLeu: 5.625 ± 0.764
1.455GlyMet: 1.455 ± 0.397
4.461GlyAsn: 4.461 ± 0.674
0.582GlyPro: 0.582 ± 0.223
2.813GlyGln: 2.813 ± 0.648
3.103GlyArg: 3.103 ± 0.501
4.849GlySer: 4.849 ± 0.873
4.558GlyThr: 4.558 ± 0.904
3.976GlyVal: 3.976 ± 0.726
1.261GlyTrp: 1.261 ± 0.308
2.813GlyTyr: 2.813 ± 0.607
0.0GlyXaa: 0.0 ± 0.0
His
0.388HisAla: 0.388 ± 0.22
0.097HisCys: 0.097 ± 0.105
0.873HisAsp: 0.873 ± 0.214
0.582HisGlu: 0.582 ± 0.231
0.679HisPhe: 0.679 ± 0.275
1.067HisGly: 1.067 ± 0.307
0.582HisHis: 0.582 ± 0.178
1.067HisIle: 1.067 ± 0.316
1.067HisLys: 1.067 ± 0.306
1.261HisLeu: 1.261 ± 0.298
0.388HisMet: 0.388 ± 0.175
0.679HisAsn: 0.679 ± 0.314
0.582HisPro: 0.582 ± 0.23
0.582HisGln: 0.582 ± 0.244
0.873HisArg: 0.873 ± 0.273
0.873HisSer: 0.873 ± 0.264
0.873HisThr: 0.873 ± 0.293
1.552HisVal: 1.552 ± 0.289
0.097HisTrp: 0.097 ± 0.113
0.97HisTyr: 0.97 ± 0.312
0.0HisXaa: 0.0 ± 0.0
Ile
5.334IleAla: 5.334 ± 0.788
0.485IleCys: 0.485 ± 0.24
5.722IleAsp: 5.722 ± 0.672
3.879IleGlu: 3.879 ± 0.759
1.455IlePhe: 1.455 ± 0.377
4.073IleGly: 4.073 ± 0.489
0.485IleHis: 0.485 ± 0.221
3.297IleIle: 3.297 ± 0.785
6.789IleLys: 6.789 ± 0.682
4.461IleLeu: 4.461 ± 0.765
2.037IleMet: 2.037 ± 0.548
3.976IleAsn: 3.976 ± 0.541
2.813IlePro: 2.813 ± 0.483
2.425IleGln: 2.425 ± 0.426
3.103IleArg: 3.103 ± 0.483
4.17IleSer: 4.17 ± 0.515
4.17IleThr: 4.17 ± 0.643
3.491IleVal: 3.491 ± 0.592
0.97IleTrp: 0.97 ± 0.345
2.425IleTyr: 2.425 ± 0.507
0.0IleXaa: 0.0 ± 0.0
Lys
4.946LysAla: 4.946 ± 0.55
0.388LysCys: 0.388 ± 0.215
4.267LysAsp: 4.267 ± 0.824
6.207LysGlu: 6.207 ± 0.759
3.782LysPhe: 3.782 ± 0.865
5.722LysGly: 5.722 ± 0.751
1.455LysHis: 1.455 ± 0.577
4.655LysIle: 4.655 ± 0.676
6.304LysLys: 6.304 ± 0.983
6.401LysLeu: 6.401 ± 0.908
1.94LysMet: 1.94 ± 0.519
4.946LysAsn: 4.946 ± 0.587
3.491LysPro: 3.491 ± 0.36
3.588LysGln: 3.588 ± 0.536
3.394LysArg: 3.394 ± 0.516
4.558LysSer: 4.558 ± 0.557
5.625LysThr: 5.625 ± 0.773
4.364LysVal: 4.364 ± 0.716
1.261LysTrp: 1.261 ± 0.262
3.006LysTyr: 3.006 ± 0.656
0.0LysXaa: 0.0 ± 0.0
Leu
6.498LeuAla: 6.498 ± 0.537
0.679LeuCys: 0.679 ± 0.306
6.11LeuAsp: 6.11 ± 0.755
6.498LeuGlu: 6.498 ± 0.932
3.006LeuPhe: 3.006 ± 0.321
5.722LeuGly: 5.722 ± 0.986
0.97LeuHis: 0.97 ± 0.342
3.879LeuIle: 3.879 ± 0.52
6.789LeuLys: 6.789 ± 0.692
4.849LeuLeu: 4.849 ± 0.754
2.619LeuMet: 2.619 ± 0.476
5.625LeuAsn: 5.625 ± 0.752
2.813LeuPro: 2.813 ± 0.429
2.716LeuGln: 2.716 ± 0.519
3.782LeuArg: 3.782 ± 0.823
4.364LeuSer: 4.364 ± 0.823
6.11LeuThr: 6.11 ± 0.973
3.782LeuVal: 3.782 ± 0.502
0.776LeuTrp: 0.776 ± 0.243
2.134LeuTyr: 2.134 ± 0.556
0.0LeuXaa: 0.0 ± 0.0
Met
2.037MetAla: 2.037 ± 0.364
0.097MetCys: 0.097 ± 0.094
1.067MetAsp: 1.067 ± 0.297
1.261MetGlu: 1.261 ± 0.415
1.455MetPhe: 1.455 ± 0.302
0.582MetGly: 0.582 ± 0.232
0.291MetHis: 0.291 ± 0.159
1.649MetIle: 1.649 ± 0.34
3.006MetLys: 3.006 ± 0.59
2.037MetLeu: 2.037 ± 0.346
0.582MetMet: 0.582 ± 0.274
0.97MetAsn: 0.97 ± 0.274
1.067MetPro: 1.067 ± 0.231
0.679MetGln: 0.679 ± 0.19
0.873MetArg: 0.873 ± 0.234
1.649MetSer: 1.649 ± 0.409
2.037MetThr: 2.037 ± 0.365
2.134MetVal: 2.134 ± 0.508
0.194MetTrp: 0.194 ± 0.153
0.679MetTyr: 0.679 ± 0.206
0.0MetXaa: 0.0 ± 0.0
Asn
4.946AsnAla: 4.946 ± 1.05
0.194AsnCys: 0.194 ± 0.14
3.976AsnAsp: 3.976 ± 0.478
3.782AsnGlu: 3.782 ± 0.776
2.619AsnPhe: 2.619 ± 0.526
7.177AsnGly: 7.177 ± 1.212
1.164AsnHis: 1.164 ± 0.304
4.073AsnIle: 4.073 ± 0.623
3.394AsnLys: 3.394 ± 0.486
5.431AsnLeu: 5.431 ± 0.653
1.261AsnMet: 1.261 ± 0.327
5.14AsnAsn: 5.14 ± 0.935
3.006AsnPro: 3.006 ± 0.527
2.522AsnGln: 2.522 ± 0.448
2.425AsnArg: 2.425 ± 0.564
3.491AsnSer: 3.491 ± 0.543
3.394AsnThr: 3.394 ± 0.569
3.394AsnVal: 3.394 ± 0.433
1.843AsnTrp: 1.843 ± 0.31
1.746AsnTyr: 1.746 ± 0.408
0.0AsnXaa: 0.0 ± 0.0
Pro
1.843ProAla: 1.843 ± 0.42
0.0ProCys: 0.0 ± 0.0
1.843ProAsp: 1.843 ± 0.399
2.522ProGlu: 2.522 ± 0.519
1.552ProPhe: 1.552 ± 0.367
0.873ProGly: 0.873 ± 0.27
0.485ProHis: 0.485 ± 0.198
1.843ProIle: 1.843 ± 0.329
3.394ProLys: 3.394 ± 0.587
2.425ProLeu: 2.425 ± 0.524
0.291ProMet: 0.291 ± 0.169
2.716ProAsn: 2.716 ± 0.45
0.679ProPro: 0.679 ± 0.31
1.261ProGln: 1.261 ± 0.355
0.776ProArg: 0.776 ± 0.219
2.522ProSer: 2.522 ± 0.385
2.328ProThr: 2.328 ± 0.455
1.649ProVal: 1.649 ± 0.391
0.582ProTrp: 0.582 ± 0.195
0.776ProTyr: 0.776 ± 0.261
0.0ProXaa: 0.0 ± 0.0
Gln
3.297GlnAla: 3.297 ± 0.598
0.194GlnCys: 0.194 ± 0.13
1.746GlnAsp: 1.746 ± 0.388
2.716GlnGlu: 2.716 ± 0.567
1.261GlnPhe: 1.261 ± 0.43
3.2GlnGly: 3.2 ± 0.741
0.388GlnHis: 0.388 ± 0.199
2.619GlnIle: 2.619 ± 0.471
3.103GlnLys: 3.103 ± 0.569
3.491GlnLeu: 3.491 ± 0.463
1.261GlnMet: 1.261 ± 0.356
2.716GlnAsn: 2.716 ± 0.432
0.291GlnPro: 0.291 ± 0.141
2.91GlnGln: 2.91 ± 0.567
1.649GlnArg: 1.649 ± 0.332
2.619GlnSer: 2.619 ± 0.407
2.328GlnThr: 2.328 ± 0.49
1.94GlnVal: 1.94 ± 0.487
0.485GlnTrp: 0.485 ± 0.178
2.037GlnTyr: 2.037 ± 0.491
0.0GlnXaa: 0.0 ± 0.0
Arg
2.231ArgAla: 2.231 ± 0.406
0.194ArgCys: 0.194 ± 0.144
3.006ArgAsp: 3.006 ± 0.526
2.328ArgGlu: 2.328 ± 0.47
2.425ArgPhe: 2.425 ± 0.497
2.522ArgGly: 2.522 ± 0.644
0.97ArgHis: 0.97 ± 0.328
3.103ArgIle: 3.103 ± 0.689
2.522ArgLys: 2.522 ± 0.508
3.394ArgLeu: 3.394 ± 0.58
1.358ArgMet: 1.358 ± 0.391
2.813ArgAsn: 2.813 ± 0.345
1.164ArgPro: 1.164 ± 0.299
2.231ArgGln: 2.231 ± 0.361
1.358ArgArg: 1.358 ± 0.401
1.94ArgSer: 1.94 ± 0.39
2.619ArgThr: 2.619 ± 0.712
3.2ArgVal: 3.2 ± 0.646
0.873ArgTrp: 0.873 ± 0.234
2.328ArgTyr: 2.328 ± 0.492
0.0ArgXaa: 0.0 ± 0.0
Ser
3.2SerAla: 3.2 ± 0.61
0.485SerCys: 0.485 ± 0.307
4.364SerAsp: 4.364 ± 0.707
3.2SerGlu: 3.2 ± 0.453
2.522SerPhe: 2.522 ± 0.44
4.849SerGly: 4.849 ± 0.653
0.388SerHis: 0.388 ± 0.221
4.558SerIle: 4.558 ± 0.491
4.267SerLys: 4.267 ± 0.671
4.849SerLeu: 4.849 ± 0.45
1.455SerMet: 1.455 ± 0.275
4.267SerAsn: 4.267 ± 0.594
2.037SerPro: 2.037 ± 0.432
2.91SerGln: 2.91 ± 0.584
3.103SerArg: 3.103 ± 0.71
3.491SerSer: 3.491 ± 0.606
3.782SerThr: 3.782 ± 0.656
5.237SerVal: 5.237 ± 0.715
0.679SerTrp: 0.679 ± 0.32
1.94SerTyr: 1.94 ± 0.424
0.0SerXaa: 0.0 ± 0.0
Thr
5.043ThrAla: 5.043 ± 0.715
0.291ThrCys: 0.291 ± 0.15
4.558ThrAsp: 4.558 ± 0.578
3.491ThrGlu: 3.491 ± 0.459
3.2ThrPhe: 3.2 ± 0.596
4.073ThrGly: 4.073 ± 0.492
1.552ThrHis: 1.552 ± 0.293
4.655ThrIle: 4.655 ± 0.735
5.14ThrLys: 5.14 ± 0.756
6.595ThrLeu: 6.595 ± 1.22
1.067ThrMet: 1.067 ± 0.251
3.782ThrAsn: 3.782 ± 0.47
1.649ThrPro: 1.649 ± 0.377
2.231ThrGln: 2.231 ± 0.554
2.037ThrArg: 2.037 ± 0.377
3.491ThrSer: 3.491 ± 0.459
3.782ThrThr: 3.782 ± 0.583
4.364ThrVal: 4.364 ± 0.46
1.164ThrTrp: 1.164 ± 0.403
3.685ThrTyr: 3.685 ± 0.585
0.0ThrXaa: 0.0 ± 0.0
Val
4.461ValAla: 4.461 ± 0.761
0.194ValCys: 0.194 ± 0.123
5.431ValAsp: 5.431 ± 0.568
4.461ValGlu: 4.461 ± 0.805
2.037ValPhe: 2.037 ± 0.406
4.267ValGly: 4.267 ± 0.609
0.388ValHis: 0.388 ± 0.176
4.364ValIle: 4.364 ± 0.593
5.237ValLys: 5.237 ± 0.646
3.879ValLeu: 3.879 ± 0.933
1.067ValMet: 1.067 ± 0.332
3.782ValAsn: 3.782 ± 0.74
1.746ValPro: 1.746 ± 0.426
1.843ValGln: 1.843 ± 0.347
2.425ValArg: 2.425 ± 0.538
4.752ValSer: 4.752 ± 0.768
4.946ValThr: 4.946 ± 0.8
3.976ValVal: 3.976 ± 0.667
1.164ValTrp: 1.164 ± 0.288
1.649ValTyr: 1.649 ± 0.402
0.0ValXaa: 0.0 ± 0.0
Trp
0.582TrpAla: 0.582 ± 0.19
0.097TrpCys: 0.097 ± 0.082
1.552TrpAsp: 1.552 ± 0.462
1.067TrpGlu: 1.067 ± 0.234
0.776TrpPhe: 0.776 ± 0.299
0.485TrpGly: 0.485 ± 0.171
0.388TrpHis: 0.388 ± 0.189
0.776TrpIle: 0.776 ± 0.271
0.873TrpLys: 0.873 ± 0.28
1.649TrpLeu: 1.649 ± 0.333
0.194TrpMet: 0.194 ± 0.134
0.485TrpAsn: 0.485 ± 0.204
0.194TrpPro: 0.194 ± 0.158
1.067TrpGln: 1.067 ± 0.22
0.873TrpArg: 0.873 ± 0.225
1.746TrpSer: 1.746 ± 0.567
1.164TrpThr: 1.164 ± 0.286
1.358TrpVal: 1.358 ± 0.301
0.194TrpTrp: 0.194 ± 0.12
0.388TrpTyr: 0.388 ± 0.265
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.716TyrAla: 2.716 ± 0.407
0.485TyrCys: 0.485 ± 0.287
2.425TyrAsp: 2.425 ± 0.512
2.91TyrGlu: 2.91 ± 0.428
1.843TyrPhe: 1.843 ± 0.365
1.649TyrGly: 1.649 ± 0.562
0.776TyrHis: 0.776 ± 0.235
1.94TyrIle: 1.94 ± 0.378
2.813TyrLys: 2.813 ± 0.49
3.782TyrLeu: 3.782 ± 0.593
0.97TyrMet: 0.97 ± 0.258
1.552TyrAsn: 1.552 ± 0.342
1.455TyrPro: 1.455 ± 0.396
2.425TyrGln: 2.425 ± 0.337
3.006TyrArg: 3.006 ± 0.582
2.328TyrSer: 2.328 ± 0.55
1.649TyrThr: 1.649 ± 0.416
2.91TyrVal: 2.91 ± 0.415
0.194TyrTrp: 0.194 ± 0.141
2.716TyrTyr: 2.716 ± 0.731
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 43 proteins (10312 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski