Amino acid dipepetide frequency for Streptococcus satellite phage Javan395

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.276AlaAla: 0.276 ± 0.271
1.38AlaCys: 1.38 ± 0.598
2.761AlaAsp: 2.761 ± 0.936
3.589AlaGlu: 3.589 ± 1.286
2.485AlaPhe: 2.485 ± 0.639
1.104AlaGly: 1.104 ± 0.636
0.552AlaHis: 0.552 ± 0.405
4.97AlaIle: 4.97 ± 0.896
4.97AlaLys: 4.97 ± 1.561
3.865AlaLeu: 3.865 ± 0.961
1.657AlaMet: 1.657 ± 0.539
1.933AlaAsn: 1.933 ± 1.111
0.0AlaPro: 0.0 ± 0.0
0.828AlaGln: 0.828 ± 0.515
2.761AlaArg: 2.761 ± 1.089
3.313AlaSer: 3.313 ± 1.014
2.485AlaThr: 2.485 ± 0.814
3.037AlaVal: 3.037 ± 0.763
0.276AlaTrp: 0.276 ± 0.208
2.209AlaTyr: 2.209 ± 0.664
0.0AlaXaa: 0.0 ± 0.0
Cys
1.104CysAla: 1.104 ± 0.555
0.0CysCys: 0.0 ± 0.0
0.552CysAsp: 0.552 ± 0.409
0.276CysGlu: 0.276 ± 0.295
0.552CysPhe: 0.552 ± 0.374
0.552CysGly: 0.552 ± 0.364
0.276CysHis: 0.276 ± 0.256
0.828CysIle: 0.828 ± 0.506
0.552CysLys: 0.552 ± 0.378
0.828CysLeu: 0.828 ± 0.677
0.276CysMet: 0.276 ± 0.259
0.828CysAsn: 0.828 ± 0.331
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.276CysArg: 0.276 ± 0.273
0.552CysSer: 0.552 ± 0.338
0.0CysThr: 0.0 ± 0.0
0.276CysVal: 0.276 ± 0.295
0.0CysTrp: 0.0 ± 0.0
0.276CysTyr: 0.276 ± 0.208
0.0CysXaa: 0.0 ± 0.0
Asp
2.209AspAla: 2.209 ± 0.796
0.276AspCys: 0.276 ± 0.256
3.865AspAsp: 3.865 ± 0.967
5.522AspGlu: 5.522 ± 1.223
2.761AspPhe: 2.761 ± 0.93
2.761AspGly: 2.761 ± 0.796
1.657AspHis: 1.657 ± 0.401
6.626AspIle: 6.626 ± 1.467
3.865AspLys: 3.865 ± 0.937
4.97AspLeu: 4.97 ± 1.167
1.104AspMet: 1.104 ± 0.522
4.97AspAsn: 4.97 ± 0.795
0.828AspPro: 0.828 ± 0.447
1.104AspGln: 1.104 ± 0.639
1.933AspArg: 1.933 ± 0.648
3.037AspSer: 3.037 ± 1.067
3.037AspThr: 3.037 ± 0.876
3.037AspVal: 3.037 ± 0.931
0.552AspTrp: 0.552 ± 0.347
3.865AspTyr: 3.865 ± 0.983
0.0AspXaa: 0.0 ± 0.0
Glu
4.141GluAla: 4.141 ± 1.031
0.828GluCys: 0.828 ± 0.467
4.417GluAsp: 4.417 ± 1.126
8.283GluGlu: 8.283 ± 2.236
3.037GluPhe: 3.037 ± 1.05
0.828GluGly: 0.828 ± 0.465
1.104GluHis: 1.104 ± 0.515
4.694GluIle: 4.694 ± 1.135
8.559GluLys: 8.559 ± 1.395
11.596GluLeu: 11.596 ± 1.242
3.865GluMet: 3.865 ± 0.878
5.522GluAsn: 5.522 ± 1.227
1.933GluPro: 1.933 ± 0.616
3.313GluGln: 3.313 ± 0.937
3.589GluArg: 3.589 ± 1.16
3.313GluSer: 3.313 ± 1.011
3.037GluThr: 3.037 ± 1.027
2.761GluVal: 2.761 ± 0.744
0.828GluTrp: 0.828 ± 0.42
4.694GluTyr: 4.694 ± 1.102
0.0GluXaa: 0.0 ± 0.0
Phe
2.209PheAla: 2.209 ± 0.803
0.0PheCys: 0.0 ± 0.0
3.037PheAsp: 3.037 ± 0.776
3.865PheGlu: 3.865 ± 0.955
3.313PhePhe: 3.313 ± 1.041
2.209PheGly: 2.209 ± 0.555
0.552PheHis: 0.552 ± 0.347
3.865PheIle: 3.865 ± 0.993
3.589PheLys: 3.589 ± 0.606
4.97PheLeu: 4.97 ± 1.428
0.828PheMet: 0.828 ± 0.462
3.589PheAsn: 3.589 ± 0.889
0.828PhePro: 0.828 ± 0.503
0.552PheGln: 0.552 ± 0.313
1.38PheArg: 1.38 ± 0.618
3.589PheSer: 3.589 ± 0.894
1.933PheThr: 1.933 ± 0.553
4.694PheVal: 4.694 ± 1.337
0.828PheTrp: 0.828 ± 0.431
1.104PheTyr: 1.104 ± 0.662
0.0PheXaa: 0.0 ± 0.0
Gly
3.037GlyAla: 3.037 ± 0.677
0.828GlyCys: 0.828 ± 0.528
1.657GlyAsp: 1.657 ± 0.562
3.589GlyGlu: 3.589 ± 0.953
1.657GlyPhe: 1.657 ± 0.564
2.485GlyGly: 2.485 ± 0.847
0.828GlyHis: 0.828 ± 0.567
3.589GlyIle: 3.589 ± 0.924
4.141GlyLys: 4.141 ± 1.032
6.074GlyLeu: 6.074 ± 1.51
1.933GlyMet: 1.933 ± 0.648
2.209GlyAsn: 2.209 ± 0.763
0.0GlyPro: 0.0 ± 0.0
0.828GlyGln: 0.828 ± 0.371
1.657GlyArg: 1.657 ± 0.732
2.485GlySer: 2.485 ± 0.691
3.313GlyThr: 3.313 ± 1.081
3.313GlyVal: 3.313 ± 0.87
0.276GlyTrp: 0.276 ± 0.271
1.657GlyTyr: 1.657 ± 0.608
0.0GlyXaa: 0.0 ± 0.0
His
0.828HisAla: 0.828 ± 0.768
0.0HisCys: 0.0 ± 0.0
0.828HisAsp: 0.828 ± 0.615
0.828HisGlu: 0.828 ± 0.416
1.104HisPhe: 1.104 ± 0.509
0.552HisGly: 0.552 ± 0.338
0.552HisHis: 0.552 ± 0.409
1.104HisIle: 1.104 ± 0.556
0.276HisLys: 0.276 ± 0.269
2.209HisLeu: 2.209 ± 0.69
0.828HisMet: 0.828 ± 0.383
0.828HisAsn: 0.828 ± 0.465
0.552HisPro: 0.552 ± 0.298
0.552HisGln: 0.552 ± 0.338
0.276HisArg: 0.276 ± 0.31
0.552HisSer: 0.552 ± 0.346
0.276HisThr: 0.276 ± 0.256
0.828HisVal: 0.828 ± 0.428
0.0HisTrp: 0.0 ± 0.0
0.276HisTyr: 0.276 ± 0.242
0.0HisXaa: 0.0 ± 0.0
Ile
4.694IleAla: 4.694 ± 1.285
0.552IleCys: 0.552 ± 0.405
5.798IleAsp: 5.798 ± 1.448
6.902IleGlu: 6.902 ± 0.839
6.074IlePhe: 6.074 ± 1.868
4.417IleGly: 4.417 ± 0.997
0.828IleHis: 0.828 ± 0.396
8.835IleIle: 8.835 ± 1.896
4.417IleLys: 4.417 ± 1.221
6.35IleLeu: 6.35 ± 1.526
0.828IleMet: 0.828 ± 0.491
6.902IleAsn: 6.902 ± 1.144
1.38IlePro: 1.38 ± 0.554
4.694IleGln: 4.694 ± 0.936
4.694IleArg: 4.694 ± 1.032
8.559IleSer: 8.559 ± 1.489
3.865IleThr: 3.865 ± 0.949
4.97IleVal: 4.97 ± 1.307
0.552IleTrp: 0.552 ± 0.362
2.485IleTyr: 2.485 ± 0.482
0.0IleXaa: 0.0 ± 0.0
Lys
5.246LysAla: 5.246 ± 1.854
0.0LysCys: 0.0 ± 0.0
4.97LysAsp: 4.97 ± 0.93
8.559LysGlu: 8.559 ± 1.38
2.485LysPhe: 2.485 ± 0.925
3.313LysGly: 3.313 ± 0.974
1.104LysHis: 1.104 ± 0.629
9.111LysIle: 9.111 ± 1.496
11.32LysLys: 11.32 ± 2.144
7.178LysLeu: 7.178 ± 1.515
2.485LysMet: 2.485 ± 0.594
4.97LysAsn: 4.97 ± 1.29
1.657LysPro: 1.657 ± 0.589
5.246LysGln: 5.246 ± 1.213
3.313LysArg: 3.313 ± 0.744
6.35LysSer: 6.35 ± 1.031
7.178LysThr: 7.178 ± 1.62
7.731LysVal: 7.731 ± 1.545
0.276LysTrp: 0.276 ± 0.319
3.589LysTyr: 3.589 ± 0.856
0.0LysXaa: 0.0 ± 0.0
Leu
5.246LeuAla: 5.246 ± 1.1
0.552LeuCys: 0.552 ± 0.346
6.902LeuAsp: 6.902 ± 1.445
6.35LeuGlu: 6.35 ± 1.324
4.694LeuPhe: 4.694 ± 1.178
7.178LeuGly: 7.178 ± 2.007
1.38LeuHis: 1.38 ± 0.671
10.768LeuIle: 10.768 ± 1.982
8.007LeuLys: 8.007 ± 1.312
10.215LeuLeu: 10.215 ± 1.664
2.761LeuMet: 2.761 ± 0.785
6.074LeuAsn: 6.074 ± 0.908
2.761LeuPro: 2.761 ± 0.754
6.074LeuGln: 6.074 ± 1.18
3.037LeuArg: 3.037 ± 0.821
4.97LeuSer: 4.97 ± 1.103
3.865LeuThr: 3.865 ± 0.937
5.522LeuVal: 5.522 ± 1.188
0.276LeuTrp: 0.276 ± 0.256
3.589LeuTyr: 3.589 ± 0.876
0.0LeuXaa: 0.0 ± 0.0
Met
1.657MetAla: 1.657 ± 0.565
0.276MetCys: 0.276 ± 0.256
1.104MetAsp: 1.104 ± 0.569
1.933MetGlu: 1.933 ± 0.851
0.828MetPhe: 0.828 ± 0.511
0.828MetGly: 0.828 ± 0.348
0.0MetHis: 0.0 ± 0.0
3.037MetIle: 3.037 ± 1.314
4.417MetLys: 4.417 ± 1.245
2.209MetLeu: 2.209 ± 0.807
1.104MetMet: 1.104 ± 0.436
2.209MetAsn: 2.209 ± 0.578
0.552MetPro: 0.552 ± 0.362
2.761MetGln: 2.761 ± 0.829
0.828MetArg: 0.828 ± 0.406
0.552MetSer: 0.552 ± 0.578
0.828MetThr: 0.828 ± 0.41
1.933MetVal: 1.933 ± 0.645
0.0MetTrp: 0.0 ± 0.0
1.38MetTyr: 1.38 ± 0.526
0.0MetXaa: 0.0 ± 0.0
Asn
3.037AsnAla: 3.037 ± 1.294
0.0AsnCys: 0.0 ± 0.0
3.313AsnAsp: 3.313 ± 0.941
6.626AsnGlu: 6.626 ± 1.659
2.485AsnPhe: 2.485 ± 0.585
3.313AsnGly: 3.313 ± 0.979
0.552AsnHis: 0.552 ± 0.389
4.97AsnIle: 4.97 ± 1.345
9.663AsnLys: 9.663 ± 1.066
6.626AsnLeu: 6.626 ± 1.296
1.104AsnMet: 1.104 ± 0.719
2.485AsnAsn: 2.485 ± 0.612
1.933AsnPro: 1.933 ± 0.806
2.485AsnGln: 2.485 ± 0.827
3.037AsnArg: 3.037 ± 0.57
4.694AsnSer: 4.694 ± 1.642
2.209AsnThr: 2.209 ± 0.936
1.933AsnVal: 1.933 ± 0.632
1.38AsnTrp: 1.38 ± 0.504
1.38AsnTyr: 1.38 ± 0.496
0.0AsnXaa: 0.0 ± 0.0
Pro
1.38ProAla: 1.38 ± 0.626
0.0ProCys: 0.0 ± 0.0
1.657ProAsp: 1.657 ± 0.68
1.657ProGlu: 1.657 ± 0.574
0.552ProPhe: 0.552 ± 0.339
0.0ProGly: 0.0 ± 0.0
0.0ProHis: 0.0 ± 0.0
0.828ProIle: 0.828 ± 0.366
1.38ProLys: 1.38 ± 0.748
2.485ProLeu: 2.485 ± 0.984
0.0ProMet: 0.0 ± 0.0
2.761ProAsn: 2.761 ± 0.835
1.933ProPro: 1.933 ± 0.729
0.552ProGln: 0.552 ± 0.334
1.38ProArg: 1.38 ± 0.557
1.38ProSer: 1.38 ± 0.47
1.933ProThr: 1.933 ± 0.643
1.104ProVal: 1.104 ± 0.629
0.0ProTrp: 0.0 ± 0.0
1.657ProTyr: 1.657 ± 0.591
0.0ProXaa: 0.0 ± 0.0
Gln
1.933GlnAla: 1.933 ± 0.555
0.828GlnCys: 0.828 ± 0.513
1.933GlnAsp: 1.933 ± 0.669
1.933GlnGlu: 1.933 ± 0.59
1.38GlnPhe: 1.38 ± 0.53
2.761GlnGly: 2.761 ± 0.761
0.552GlnHis: 0.552 ± 0.349
3.865GlnIle: 3.865 ± 1.178
5.246GlnLys: 5.246 ± 1.162
4.417GlnLeu: 4.417 ± 1.206
1.104GlnMet: 1.104 ± 0.606
1.38GlnAsn: 1.38 ± 0.439
0.828GlnPro: 0.828 ± 0.423
1.104GlnGln: 1.104 ± 0.481
2.209GlnArg: 2.209 ± 0.711
2.761GlnSer: 2.761 ± 1.018
2.209GlnThr: 2.209 ± 1.02
1.657GlnVal: 1.657 ± 0.628
0.276GlnTrp: 0.276 ± 0.328
1.38GlnTyr: 1.38 ± 0.48
0.0GlnXaa: 0.0 ± 0.0
Arg
1.104ArgAla: 1.104 ± 0.739
0.276ArgCys: 0.276 ± 0.273
1.933ArgAsp: 1.933 ± 0.909
4.417ArgGlu: 4.417 ± 1.188
2.209ArgPhe: 2.209 ± 0.763
1.657ArgGly: 1.657 ± 0.563
0.552ArgHis: 0.552 ± 0.278
3.589ArgIle: 3.589 ± 1.069
3.037ArgLys: 3.037 ± 0.77
3.037ArgLeu: 3.037 ± 0.812
1.104ArgMet: 1.104 ± 0.484
2.485ArgAsn: 2.485 ± 0.779
1.38ArgPro: 1.38 ± 0.648
3.589ArgGln: 3.589 ± 0.814
0.552ArgArg: 0.552 ± 0.349
2.485ArgSer: 2.485 ± 0.625
2.761ArgThr: 2.761 ± 0.632
2.761ArgVal: 2.761 ± 0.637
0.828ArgTrp: 0.828 ± 0.431
1.38ArgTyr: 1.38 ± 0.462
0.0ArgXaa: 0.0 ± 0.0
Ser
3.037SerAla: 3.037 ± 0.896
0.552SerCys: 0.552 ± 0.381
4.141SerAsp: 4.141 ± 1.01
2.761SerGlu: 2.761 ± 0.752
2.761SerPhe: 2.761 ± 0.886
2.761SerGly: 2.761 ± 0.782
1.104SerHis: 1.104 ± 0.447
6.074SerIle: 6.074 ± 1.467
6.074SerLys: 6.074 ± 1.637
6.626SerLeu: 6.626 ± 1.296
3.865SerMet: 3.865 ± 0.835
3.589SerAsn: 3.589 ± 0.751
0.828SerPro: 0.828 ± 0.454
1.657SerGln: 1.657 ± 0.531
2.485SerArg: 2.485 ± 0.7
2.761SerSer: 2.761 ± 1.16
2.209SerThr: 2.209 ± 0.78
3.037SerVal: 3.037 ± 0.82
1.104SerTrp: 1.104 ± 0.66
4.694SerTyr: 4.694 ± 1.25
0.0SerXaa: 0.0 ± 0.0
Thr
1.657ThrAla: 1.657 ± 0.624
0.276ThrCys: 0.276 ± 0.208
2.485ThrAsp: 2.485 ± 1.218
4.97ThrGlu: 4.97 ± 1.275
1.933ThrPhe: 1.933 ± 0.754
2.761ThrGly: 2.761 ± 0.824
0.276ThrHis: 0.276 ± 0.256
3.589ThrIle: 3.589 ± 0.913
3.313ThrLys: 3.313 ± 1.163
6.074ThrLeu: 6.074 ± 1.311
0.828ThrMet: 0.828 ± 0.386
3.865ThrAsn: 3.865 ± 0.977
1.104ThrPro: 1.104 ± 0.534
1.38ThrGln: 1.38 ± 0.696
2.485ThrArg: 2.485 ± 0.81
1.657ThrSer: 1.657 ± 0.673
3.313ThrThr: 3.313 ± 1.127
3.865ThrVal: 3.865 ± 0.947
1.104ThrTrp: 1.104 ± 0.556
2.485ThrTyr: 2.485 ± 0.613
0.0ThrXaa: 0.0 ± 0.0
Val
1.38ValAla: 1.38 ± 0.479
0.828ValCys: 0.828 ± 0.428
4.141ValAsp: 4.141 ± 1.29
4.417ValGlu: 4.417 ± 0.944
1.657ValPhe: 1.657 ± 0.688
2.209ValGly: 2.209 ± 0.602
0.828ValHis: 0.828 ± 0.605
3.313ValIle: 3.313 ± 1.141
5.522ValLys: 5.522 ± 1.004
5.798ValLeu: 5.798 ± 0.96
1.104ValMet: 1.104 ± 0.439
3.589ValAsn: 3.589 ± 0.77
2.761ValPro: 2.761 ± 0.991
1.104ValGln: 1.104 ± 0.542
2.209ValArg: 2.209 ± 0.98
5.246ValSer: 5.246 ± 0.797
3.313ValThr: 3.313 ± 0.654
3.865ValVal: 3.865 ± 0.876
0.276ValTrp: 0.276 ± 0.208
4.97ValTyr: 4.97 ± 1.262
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.276TrpAsp: 0.276 ± 0.273
1.657TrpGlu: 1.657 ± 0.515
0.828TrpPhe: 0.828 ± 0.587
0.828TrpGly: 0.828 ± 0.39
0.0TrpHis: 0.0 ± 0.0
1.104TrpIle: 1.104 ± 0.524
0.276TrpLys: 0.276 ± 0.271
1.104TrpLeu: 1.104 ± 0.496
0.552TrpMet: 0.552 ± 0.407
0.276TrpAsn: 0.276 ± 0.246
0.0TrpPro: 0.0 ± 0.0
0.276TrpGln: 0.276 ± 0.242
0.552TrpArg: 0.552 ± 0.298
0.552TrpSer: 0.552 ± 0.278
0.276TrpThr: 0.276 ± 0.269
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.552TrpTyr: 0.552 ± 0.299
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.276TyrAla: 0.276 ± 0.242
0.552TyrCys: 0.552 ± 0.36
2.209TyrAsp: 2.209 ± 0.618
2.209TyrGlu: 2.209 ± 0.743
4.141TyrPhe: 4.141 ± 0.888
3.037TyrGly: 3.037 ± 0.747
0.552TyrHis: 0.552 ± 0.38
3.037TyrIle: 3.037 ± 0.826
7.454TyrLys: 7.454 ± 1.257
3.865TyrLeu: 3.865 ± 0.986
0.828TyrMet: 0.828 ± 0.382
2.761TyrAsn: 2.761 ± 0.979
1.38TyrPro: 1.38 ± 0.614
1.657TyrGln: 1.657 ± 0.618
2.485TyrArg: 2.485 ± 0.827
3.313TyrSer: 3.313 ± 0.875
1.38TyrThr: 1.38 ± 0.805
2.209TyrVal: 2.209 ± 0.64
0.276TyrTrp: 0.276 ± 0.271
1.38TyrTyr: 1.38 ± 0.581
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 23 proteins (3623 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski