Amino acid dipepetide frequency for Streptococcus phage Javan263

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.174AlaAla: 4.174 ± 1.107
0.485AlaCys: 0.485 ± 0.164
4.271AlaAsp: 4.271 ± 0.547
5.532AlaGlu: 5.532 ± 0.557
2.329AlaPhe: 2.329 ± 0.588
3.785AlaGly: 3.785 ± 0.947
1.262AlaHis: 1.262 ± 0.287
5.532AlaIle: 5.532 ± 1.065
6.503AlaLys: 6.503 ± 0.712
5.338AlaLeu: 5.338 ± 0.592
1.941AlaMet: 1.941 ± 0.578
4.659AlaAsn: 4.659 ± 0.764
1.359AlaPro: 1.359 ± 0.34
2.621AlaGln: 2.621 ± 0.635
2.621AlaArg: 2.621 ± 0.517
4.853AlaSer: 4.853 ± 0.882
4.174AlaThr: 4.174 ± 0.714
5.144AlaVal: 5.144 ± 0.596
1.553AlaTrp: 1.553 ± 0.427
2.329AlaTyr: 2.329 ± 0.6
0.0AlaXaa: 0.0 ± 0.0
Cys
0.097CysAla: 0.097 ± 0.092
0.097CysCys: 0.097 ± 0.116
0.485CysAsp: 0.485 ± 0.235
0.388CysGlu: 0.388 ± 0.181
0.097CysPhe: 0.097 ± 0.116
0.874CysGly: 0.874 ± 0.354
0.291CysHis: 0.291 ± 0.218
0.097CysIle: 0.097 ± 0.096
0.388CysLys: 0.388 ± 0.172
0.388CysLeu: 0.388 ± 0.159
0.194CysMet: 0.194 ± 0.159
0.388CysAsn: 0.388 ± 0.213
0.194CysPro: 0.194 ± 0.135
0.679CysGln: 0.679 ± 0.306
0.388CysArg: 0.388 ± 0.172
0.485CysSer: 0.485 ± 0.178
0.291CysThr: 0.291 ± 0.141
0.194CysVal: 0.194 ± 0.113
0.0CysTrp: 0.0 ± 0.0
0.582CysTyr: 0.582 ± 0.245
0.0CysXaa: 0.0 ± 0.0
Asp
2.718AspAla: 2.718 ± 0.46
0.291AspCys: 0.291 ± 0.181
4.659AspAsp: 4.659 ± 0.705
4.853AspGlu: 4.853 ± 1.092
2.912AspPhe: 2.912 ± 0.506
5.726AspGly: 5.726 ± 0.709
1.068AspHis: 1.068 ± 0.37
3.397AspIle: 3.397 ± 0.523
4.174AspLys: 4.174 ± 0.523
4.853AspLeu: 4.853 ± 0.664
1.941AspMet: 1.941 ± 0.358
5.824AspAsn: 5.824 ± 0.617
1.359AspPro: 1.359 ± 0.382
0.874AspGln: 0.874 ± 0.264
2.524AspArg: 2.524 ± 0.492
5.241AspSer: 5.241 ± 0.813
3.494AspThr: 3.494 ± 0.479
3.882AspVal: 3.882 ± 0.601
0.971AspTrp: 0.971 ± 0.28
3.203AspTyr: 3.203 ± 0.542
0.0AspXaa: 0.0 ± 0.0
Glu
6.212GluAla: 6.212 ± 0.799
0.194GluCys: 0.194 ± 0.143
4.271GluAsp: 4.271 ± 0.763
3.591GluGlu: 3.591 ± 0.656
3.785GluPhe: 3.785 ± 0.823
3.203GluGly: 3.203 ± 0.396
0.776GluHis: 0.776 ± 0.256
6.115GluIle: 6.115 ± 0.877
6.115GluLys: 6.115 ± 0.852
7.474GluLeu: 7.474 ± 0.889
2.524GluMet: 2.524 ± 0.563
5.435GluAsn: 5.435 ± 0.928
2.329GluPro: 2.329 ± 0.558
3.009GluGln: 3.009 ± 0.738
2.718GluArg: 2.718 ± 0.479
2.426GluSer: 2.426 ± 0.551
4.853GluThr: 4.853 ± 0.686
3.979GluVal: 3.979 ± 0.677
1.165GluTrp: 1.165 ± 0.389
2.718GluTyr: 2.718 ± 0.461
0.0GluXaa: 0.0 ± 0.0
Phe
2.135PheAla: 2.135 ± 0.517
0.194PheCys: 0.194 ± 0.14
2.621PheAsp: 2.621 ± 0.483
3.3PheGlu: 3.3 ± 0.52
1.65PhePhe: 1.65 ± 0.407
2.232PheGly: 2.232 ± 0.396
0.679PheHis: 0.679 ± 0.253
2.524PheIle: 2.524 ± 0.555
3.009PheLys: 3.009 ± 0.508
2.038PheLeu: 2.038 ± 0.396
0.582PheMet: 0.582 ± 0.232
2.718PheAsn: 2.718 ± 0.411
0.776PhePro: 0.776 ± 0.239
1.262PheGln: 1.262 ± 0.374
1.553PheArg: 1.553 ± 0.398
2.718PheSer: 2.718 ± 0.904
2.135PheThr: 2.135 ± 0.522
3.397PheVal: 3.397 ± 0.509
0.485PheTrp: 0.485 ± 0.277
1.359PheTyr: 1.359 ± 0.446
0.0PheXaa: 0.0 ± 0.0
Gly
3.106GlyAla: 3.106 ± 0.613
0.388GlyCys: 0.388 ± 0.199
3.591GlyAsp: 3.591 ± 0.488
3.882GlyGlu: 3.882 ± 0.524
2.621GlyPhe: 2.621 ± 0.51
4.465GlyGly: 4.465 ± 0.82
1.165GlyHis: 1.165 ± 0.338
4.174GlyIle: 4.174 ± 0.747
5.144GlyLys: 5.144 ± 0.766
6.6GlyLeu: 6.6 ± 0.991
1.553GlyMet: 1.553 ± 0.482
3.3GlyAsn: 3.3 ± 0.411
0.582GlyPro: 0.582 ± 0.281
3.106GlyGln: 3.106 ± 0.577
1.747GlyArg: 1.747 ± 0.373
3.979GlySer: 3.979 ± 0.748
4.465GlyThr: 4.465 ± 0.74
3.494GlyVal: 3.494 ± 0.526
0.971GlyTrp: 0.971 ± 0.267
3.591GlyTyr: 3.591 ± 0.604
0.0GlyXaa: 0.0 ± 0.0
His
0.582HisAla: 0.582 ± 0.202
0.291HisCys: 0.291 ± 0.18
0.485HisAsp: 0.485 ± 0.211
0.971HisGlu: 0.971 ± 0.37
0.485HisPhe: 0.485 ± 0.197
1.262HisGly: 1.262 ± 0.302
0.194HisHis: 0.194 ± 0.123
1.844HisIle: 1.844 ± 0.367
1.068HisLys: 1.068 ± 0.281
0.971HisLeu: 0.971 ± 0.28
0.0HisMet: 0.0 ± 0.0
0.679HisAsn: 0.679 ± 0.227
0.679HisPro: 0.679 ± 0.334
0.776HisGln: 0.776 ± 0.247
0.388HisArg: 0.388 ± 0.222
1.262HisSer: 1.262 ± 0.317
0.776HisThr: 0.776 ± 0.312
0.679HisVal: 0.679 ± 0.235
0.388HisTrp: 0.388 ± 0.246
0.971HisTyr: 0.971 ± 0.311
0.0HisXaa: 0.0 ± 0.0
Ile
3.882IleAla: 3.882 ± 0.669
0.679IleCys: 0.679 ± 0.267
5.824IleAsp: 5.824 ± 0.717
6.697IleGlu: 6.697 ± 1.071
1.65IlePhe: 1.65 ± 0.327
3.397IleGly: 3.397 ± 0.708
0.776IleHis: 0.776 ± 0.261
4.465IleIle: 4.465 ± 0.629
5.921IleLys: 5.921 ± 0.661
5.047IleLeu: 5.047 ± 0.653
1.359IleMet: 1.359 ± 0.419
3.494IleAsn: 3.494 ± 0.42
2.426IlePro: 2.426 ± 0.463
3.106IleGln: 3.106 ± 0.766
1.941IleArg: 1.941 ± 0.377
4.756IleSer: 4.756 ± 0.608
4.465IleThr: 4.465 ± 0.839
3.882IleVal: 3.882 ± 0.548
0.388IleTrp: 0.388 ± 0.203
3.009IleTyr: 3.009 ± 0.505
0.0IleXaa: 0.0 ± 0.0
Lys
6.212LysAla: 6.212 ± 0.862
0.582LysCys: 0.582 ± 0.259
5.629LysAsp: 5.629 ± 0.513
7.474LysGlu: 7.474 ± 1.197
1.844LysPhe: 1.844 ± 0.565
3.979LysGly: 3.979 ± 0.719
0.971LysHis: 0.971 ± 0.322
5.338LysIle: 5.338 ± 0.763
5.824LysLys: 5.824 ± 0.959
6.697LysLeu: 6.697 ± 0.714
2.038LysMet: 2.038 ± 0.561
4.95LysAsn: 4.95 ± 0.637
2.135LysPro: 2.135 ± 0.48
3.203LysGln: 3.203 ± 0.629
3.688LysArg: 3.688 ± 0.681
5.241LysSer: 5.241 ± 0.858
5.921LysThr: 5.921 ± 0.906
5.532LysVal: 5.532 ± 0.786
0.776LysTrp: 0.776 ± 0.266
2.815LysTyr: 2.815 ± 0.493
0.0LysXaa: 0.0 ± 0.0
Leu
7.474LeuAla: 7.474 ± 0.804
0.194LeuCys: 0.194 ± 0.138
4.853LeuAsp: 4.853 ± 0.684
6.697LeuGlu: 6.697 ± 0.854
2.815LeuPhe: 2.815 ± 0.538
4.853LeuGly: 4.853 ± 0.798
1.262LeuHis: 1.262 ± 0.33
3.397LeuIle: 3.397 ± 0.674
8.929LeuLys: 8.929 ± 0.9
7.085LeuLeu: 7.085 ± 0.869
1.553LeuMet: 1.553 ± 0.309
6.115LeuAsn: 6.115 ± 0.801
2.329LeuPro: 2.329 ± 0.384
3.494LeuGln: 3.494 ± 0.721
2.426LeuArg: 2.426 ± 0.635
5.144LeuSer: 5.144 ± 0.607
5.241LeuThr: 5.241 ± 0.676
4.756LeuVal: 4.756 ± 0.502
0.776LeuTrp: 0.776 ± 0.276
2.038LeuTyr: 2.038 ± 0.439
0.0LeuXaa: 0.0 ± 0.0
Met
1.941MetAla: 1.941 ± 0.579
0.097MetCys: 0.097 ± 0.092
1.359MetAsp: 1.359 ± 0.386
0.679MetGlu: 0.679 ± 0.298
1.068MetPhe: 1.068 ± 0.348
1.068MetGly: 1.068 ± 0.391
0.291MetHis: 0.291 ± 0.216
1.262MetIle: 1.262 ± 0.382
1.844MetLys: 1.844 ± 0.416
1.165MetLeu: 1.165 ± 0.305
0.291MetMet: 0.291 ± 0.161
2.232MetAsn: 2.232 ± 0.425
0.679MetPro: 0.679 ± 0.294
0.971MetGln: 0.971 ± 0.332
0.776MetArg: 0.776 ± 0.287
2.524MetSer: 2.524 ± 0.649
2.621MetThr: 2.621 ± 0.499
0.971MetVal: 0.971 ± 0.267
0.291MetTrp: 0.291 ± 0.131
0.582MetTyr: 0.582 ± 0.298
0.0MetXaa: 0.0 ± 0.0
Asn
4.659AsnAla: 4.659 ± 0.67
0.388AsnCys: 0.388 ± 0.187
3.785AsnAsp: 3.785 ± 0.591
3.494AsnGlu: 3.494 ± 0.521
2.426AsnPhe: 2.426 ± 0.447
4.95AsnGly: 4.95 ± 0.678
0.874AsnHis: 0.874 ± 0.244
3.785AsnIle: 3.785 ± 0.603
3.882AsnLys: 3.882 ± 0.74
4.853AsnLeu: 4.853 ± 0.686
1.262AsnMet: 1.262 ± 0.304
3.494AsnAsn: 3.494 ± 0.531
2.426AsnPro: 2.426 ± 0.669
3.009AsnGln: 3.009 ± 0.549
2.815AsnArg: 2.815 ± 0.592
5.144AsnSer: 5.144 ± 0.971
3.203AsnThr: 3.203 ± 0.643
4.95AsnVal: 4.95 ± 0.593
1.553AsnTrp: 1.553 ± 0.449
3.106AsnTyr: 3.106 ± 0.476
0.0AsnXaa: 0.0 ± 0.0
Pro
2.135ProAla: 2.135 ± 0.519
0.097ProCys: 0.097 ± 0.081
1.844ProAsp: 1.844 ± 0.428
2.232ProGlu: 2.232 ± 0.626
1.262ProPhe: 1.262 ± 0.449
0.971ProGly: 0.971 ± 0.254
0.388ProHis: 0.388 ± 0.258
2.329ProIle: 2.329 ± 0.61
3.009ProLys: 3.009 ± 0.691
1.941ProLeu: 1.941 ± 0.367
0.582ProMet: 0.582 ± 0.27
1.553ProAsn: 1.553 ± 0.566
0.874ProPro: 0.874 ± 0.256
0.971ProGln: 0.971 ± 0.284
0.874ProArg: 0.874 ± 0.275
1.844ProSer: 1.844 ± 0.399
1.941ProThr: 1.941 ± 0.505
2.621ProVal: 2.621 ± 0.664
0.194ProTrp: 0.194 ± 0.125
0.582ProTyr: 0.582 ± 0.216
0.0ProXaa: 0.0 ± 0.0
Gln
3.882GlnAla: 3.882 ± 0.575
0.291GlnCys: 0.291 ± 0.166
2.038GlnAsp: 2.038 ± 0.499
3.3GlnGlu: 3.3 ± 0.59
1.262GlnPhe: 1.262 ± 0.355
1.941GlnGly: 1.941 ± 0.432
0.485GlnHis: 0.485 ± 0.198
2.718GlnIle: 2.718 ± 0.419
3.203GlnLys: 3.203 ± 0.728
3.203GlnLeu: 3.203 ± 0.453
1.068GlnMet: 1.068 ± 0.36
2.621GlnAsn: 2.621 ± 0.656
1.553GlnPro: 1.553 ± 0.334
1.456GlnGln: 1.456 ± 0.378
1.553GlnArg: 1.553 ± 0.407
3.203GlnSer: 3.203 ± 0.902
3.009GlnThr: 3.009 ± 0.693
1.65GlnVal: 1.65 ± 0.426
0.291GlnTrp: 0.291 ± 0.156
1.747GlnTyr: 1.747 ± 0.53
0.0GlnXaa: 0.0 ± 0.0
Arg
2.329ArgAla: 2.329 ± 0.578
0.388ArgCys: 0.388 ± 0.211
2.524ArgAsp: 2.524 ± 0.437
2.426ArgGlu: 2.426 ± 0.523
1.553ArgPhe: 1.553 ± 0.32
1.165ArgGly: 1.165 ± 0.343
0.679ArgHis: 0.679 ± 0.308
2.718ArgIle: 2.718 ± 0.51
4.076ArgLys: 4.076 ± 0.931
3.203ArgLeu: 3.203 ± 0.553
0.485ArgMet: 0.485 ± 0.225
2.135ArgAsn: 2.135 ± 0.476
1.456ArgPro: 1.456 ± 0.372
1.165ArgGln: 1.165 ± 0.272
1.262ArgArg: 1.262 ± 0.491
1.456ArgSer: 1.456 ± 0.392
2.135ArgThr: 2.135 ± 0.506
2.524ArgVal: 2.524 ± 0.45
0.485ArgTrp: 0.485 ± 0.226
1.941ArgTyr: 1.941 ± 0.386
0.0ArgXaa: 0.0 ± 0.0
Ser
4.853SerAla: 4.853 ± 1.244
0.485SerCys: 0.485 ± 0.201
4.756SerAsp: 4.756 ± 0.706
4.465SerGlu: 4.465 ± 0.623
3.591SerPhe: 3.591 ± 0.639
5.726SerGly: 5.726 ± 1.008
1.359SerHis: 1.359 ± 0.352
4.368SerIle: 4.368 ± 0.78
5.338SerLys: 5.338 ± 0.868
6.309SerLeu: 6.309 ± 1.002
1.65SerMet: 1.65 ± 0.521
4.368SerAsn: 4.368 ± 0.947
1.456SerPro: 1.456 ± 0.338
2.815SerGln: 2.815 ± 0.466
2.329SerArg: 2.329 ± 0.54
4.174SerSer: 4.174 ± 1.1
3.591SerThr: 3.591 ± 0.846
4.562SerVal: 4.562 ± 0.745
0.679SerTrp: 0.679 ± 0.212
2.038SerTyr: 2.038 ± 0.327
0.0SerXaa: 0.0 ± 0.0
Thr
6.212ThrAla: 6.212 ± 2.092
0.485ThrCys: 0.485 ± 0.197
3.688ThrAsp: 3.688 ± 0.609
5.338ThrGlu: 5.338 ± 0.635
2.232ThrPhe: 2.232 ± 0.57
4.368ThrGly: 4.368 ± 0.68
0.971ThrHis: 0.971 ± 0.261
5.338ThrIle: 5.338 ± 0.624
3.785ThrLys: 3.785 ± 1.018
5.047ThrLeu: 5.047 ± 0.763
0.874ThrMet: 0.874 ± 0.229
3.009ThrAsn: 3.009 ± 0.697
1.844ThrPro: 1.844 ± 0.404
3.009ThrGln: 3.009 ± 0.542
1.359ThrArg: 1.359 ± 0.423
4.076ThrSer: 4.076 ± 0.807
5.338ThrThr: 5.338 ± 1.209
6.406ThrVal: 6.406 ± 0.816
1.068ThrTrp: 1.068 ± 0.412
1.747ThrTyr: 1.747 ± 0.332
0.0ThrXaa: 0.0 ± 0.0
Val
4.368ValAla: 4.368 ± 0.568
0.291ValCys: 0.291 ± 0.186
3.882ValAsp: 3.882 ± 0.737
3.785ValGlu: 3.785 ± 0.657
2.038ValPhe: 2.038 ± 0.356
4.174ValGly: 4.174 ± 0.487
0.485ValHis: 0.485 ± 0.247
4.271ValIle: 4.271 ± 0.781
6.018ValLys: 6.018 ± 0.762
4.659ValLeu: 4.659 ± 0.701
0.874ValMet: 0.874 ± 0.262
3.494ValAsn: 3.494 ± 0.584
2.232ValPro: 2.232 ± 0.466
2.912ValGln: 2.912 ± 0.824
2.718ValArg: 2.718 ± 0.496
6.406ValSer: 6.406 ± 0.636
5.435ValThr: 5.435 ± 0.914
3.3ValVal: 3.3 ± 0.486
0.485ValTrp: 0.485 ± 0.195
1.747ValTyr: 1.747 ± 0.461
0.0ValXaa: 0.0 ± 0.0
Trp
0.874TrpAla: 0.874 ± 0.254
0.291TrpCys: 0.291 ± 0.138
0.097TrpAsp: 0.097 ± 0.111
0.874TrpGlu: 0.874 ± 0.246
0.485TrpPhe: 0.485 ± 0.227
1.068TrpGly: 1.068 ± 0.247
0.194TrpHis: 0.194 ± 0.129
0.971TrpIle: 0.971 ± 0.273
1.068TrpLys: 1.068 ± 0.308
1.456TrpLeu: 1.456 ± 0.301
0.291TrpMet: 0.291 ± 0.206
1.068TrpAsn: 1.068 ± 0.268
0.097TrpPro: 0.097 ± 0.092
0.582TrpGln: 0.582 ± 0.252
0.582TrpArg: 0.582 ± 0.317
0.971TrpSer: 0.971 ± 0.267
1.165TrpThr: 1.165 ± 0.375
0.582TrpVal: 0.582 ± 0.19
0.097TrpTrp: 0.097 ± 0.087
0.582TrpTyr: 0.582 ± 0.24
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.621TyrAla: 2.621 ± 0.558
0.388TyrCys: 0.388 ± 0.187
3.3TyrAsp: 3.3 ± 0.55
2.912TyrGlu: 2.912 ± 0.586
1.359TyrPhe: 1.359 ± 0.458
2.718TyrGly: 2.718 ± 0.623
0.582TyrHis: 0.582 ± 0.189
2.524TyrIle: 2.524 ± 0.562
1.553TyrLys: 1.553 ± 0.421
3.106TyrLeu: 3.106 ± 0.614
1.553TyrMet: 1.553 ± 0.385
2.524TyrAsn: 2.524 ± 0.441
1.456TyrPro: 1.456 ± 0.452
1.456TyrGln: 1.456 ± 0.32
1.844TyrArg: 1.844 ± 0.44
3.009TyrSer: 3.009 ± 0.468
1.941TyrThr: 1.941 ± 0.495
1.068TyrVal: 1.068 ± 0.262
0.776TyrTrp: 0.776 ± 0.289
2.232TyrTyr: 2.232 ± 0.512
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 50 proteins (10304 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski