Amino acid dipepetide frequency for Streptococcus phage Javan155

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.94AlaAla: 3.94 ± 1.049
0.296AlaCys: 0.296 ± 0.162
5.122AlaAsp: 5.122 ± 0.626
5.713AlaGlu: 5.713 ± 0.748
2.66AlaPhe: 2.66 ± 0.52
4.827AlaGly: 4.827 ± 1.115
0.788AlaHis: 0.788 ± 0.321
6.994AlaIle: 6.994 ± 0.995
6.994AlaLys: 6.994 ± 0.818
5.812AlaLeu: 5.812 ± 0.597
1.478AlaMet: 1.478 ± 0.422
4.728AlaAsn: 4.728 ± 0.707
1.675AlaPro: 1.675 ± 0.361
2.266AlaGln: 2.266 ± 0.41
2.266AlaArg: 2.266 ± 0.459
3.546AlaSer: 3.546 ± 0.598
4.039AlaThr: 4.039 ± 0.682
4.827AlaVal: 4.827 ± 0.683
0.788AlaTrp: 0.788 ± 0.292
2.069AlaTyr: 2.069 ± 0.404
0.0AlaXaa: 0.0 ± 0.0
Cys
0.197CysAla: 0.197 ± 0.148
0.197CysCys: 0.197 ± 0.132
0.296CysAsp: 0.296 ± 0.215
0.69CysGlu: 0.69 ± 0.243
0.493CysPhe: 0.493 ± 0.198
0.394CysGly: 0.394 ± 0.213
0.197CysHis: 0.197 ± 0.138
0.296CysIle: 0.296 ± 0.219
0.591CysLys: 0.591 ± 0.263
0.887CysLeu: 0.887 ± 0.29
0.197CysMet: 0.197 ± 0.161
0.296CysAsn: 0.296 ± 0.185
0.0CysPro: 0.0 ± 0.0
0.099CysGln: 0.099 ± 0.1
0.099CysArg: 0.099 ± 0.101
0.394CysSer: 0.394 ± 0.188
0.0CysThr: 0.0 ± 0.0
0.394CysVal: 0.394 ± 0.237
0.0CysTrp: 0.0 ± 0.0
0.296CysTyr: 0.296 ± 0.203
0.0CysXaa: 0.0 ± 0.0
Asp
3.645AspAla: 3.645 ± 0.74
0.591AspCys: 0.591 ± 0.312
4.531AspAsp: 4.531 ± 0.534
4.531AspGlu: 4.531 ± 0.844
2.857AspPhe: 2.857 ± 0.447
5.812AspGly: 5.812 ± 0.84
0.591AspHis: 0.591 ± 0.18
4.236AspIle: 4.236 ± 0.46
6.304AspLys: 6.304 ± 0.765
6.797AspLeu: 6.797 ± 0.873
1.084AspMet: 1.084 ± 0.335
3.94AspAsn: 3.94 ± 0.455
1.576AspPro: 1.576 ± 0.415
1.182AspGln: 1.182 ± 0.323
2.266AspArg: 2.266 ± 0.438
3.645AspSer: 3.645 ± 0.705
3.743AspThr: 3.743 ± 0.863
4.334AspVal: 4.334 ± 0.549
1.084AspTrp: 1.084 ± 0.331
2.364AspTyr: 2.364 ± 0.519
0.0AspXaa: 0.0 ± 0.0
Glu
5.713GluAla: 5.713 ± 0.733
0.296GluCys: 0.296 ± 0.169
2.955GluAsp: 2.955 ± 0.591
5.812GluGlu: 5.812 ± 1.013
2.955GluPhe: 2.955 ± 0.502
3.251GluGly: 3.251 ± 0.546
1.182GluHis: 1.182 ± 0.399
6.206GluIle: 6.206 ± 0.852
6.797GluLys: 6.797 ± 0.979
7.683GluLeu: 7.683 ± 1.113
2.266GluMet: 2.266 ± 0.535
3.546GluAsn: 3.546 ± 0.744
1.773GluPro: 1.773 ± 0.393
3.448GluGln: 3.448 ± 0.587
2.266GluArg: 2.266 ± 0.476
3.743GluSer: 3.743 ± 0.544
4.827GluThr: 4.827 ± 0.797
4.236GluVal: 4.236 ± 0.757
0.985GluTrp: 0.985 ± 0.303
2.561GluTyr: 2.561 ± 0.566
0.0GluXaa: 0.0 ± 0.0
Phe
2.955PheAla: 2.955 ± 0.553
0.296PheCys: 0.296 ± 0.196
3.152PheAsp: 3.152 ± 0.471
3.645PheGlu: 3.645 ± 0.729
1.576PhePhe: 1.576 ± 0.359
2.955PheGly: 2.955 ± 0.52
0.493PheHis: 0.493 ± 0.193
1.576PheIle: 1.576 ± 0.397
3.251PheLys: 3.251 ± 0.569
2.758PheLeu: 2.758 ± 0.54
1.182PheMet: 1.182 ± 0.366
2.069PheAsn: 2.069 ± 0.381
0.69PhePro: 0.69 ± 0.278
1.478PheGln: 1.478 ± 0.388
2.266PheArg: 2.266 ± 0.479
2.561PheSer: 2.561 ± 0.676
2.266PheThr: 2.266 ± 0.43
3.251PheVal: 3.251 ± 0.573
0.197PheTrp: 0.197 ± 0.134
2.069PheTyr: 2.069 ± 0.44
0.0PheXaa: 0.0 ± 0.0
Gly
4.334GlyAla: 4.334 ± 0.857
0.197GlyCys: 0.197 ± 0.124
4.433GlyAsp: 4.433 ± 0.823
3.743GlyGlu: 3.743 ± 0.936
3.349GlyPhe: 3.349 ± 0.503
3.94GlyGly: 3.94 ± 0.709
1.281GlyHis: 1.281 ± 0.371
5.713GlyIle: 5.713 ± 0.851
6.107GlyLys: 6.107 ± 0.755
4.433GlyLeu: 4.433 ± 0.955
2.069GlyMet: 2.069 ± 0.471
4.039GlyAsn: 4.039 ± 0.641
1.97GlyPro: 1.97 ± 1.468
2.857GlyGln: 2.857 ± 0.603
1.478GlyArg: 1.478 ± 0.408
3.645GlySer: 3.645 ± 0.597
3.842GlyThr: 3.842 ± 0.582
3.546GlyVal: 3.546 ± 0.683
1.281GlyTrp: 1.281 ± 0.439
3.94GlyTyr: 3.94 ± 0.619
0.0GlyXaa: 0.0 ± 0.0
His
0.887HisAla: 0.887 ± 0.319
0.197HisCys: 0.197 ± 0.12
0.296HisAsp: 0.296 ± 0.205
1.182HisGlu: 1.182 ± 0.388
0.394HisPhe: 0.394 ± 0.181
1.084HisGly: 1.084 ± 0.282
0.296HisHis: 0.296 ± 0.168
0.788HisIle: 0.788 ± 0.288
0.69HisLys: 0.69 ± 0.274
1.773HisLeu: 1.773 ± 0.504
0.296HisMet: 0.296 ± 0.171
0.887HisAsn: 0.887 ± 0.294
0.591HisPro: 0.591 ± 0.24
0.69HisGln: 0.69 ± 0.298
0.493HisArg: 0.493 ± 0.199
0.493HisSer: 0.493 ± 0.174
1.182HisThr: 1.182 ± 0.291
0.69HisVal: 0.69 ± 0.26
0.099HisTrp: 0.099 ± 0.099
0.296HisTyr: 0.296 ± 0.158
0.0HisXaa: 0.0 ± 0.0
Ile
4.728IleAla: 4.728 ± 0.734
0.493IleCys: 0.493 ± 0.267
6.797IleAsp: 6.797 ± 0.929
4.63IleGlu: 4.63 ± 0.728
1.675IlePhe: 1.675 ± 0.413
4.137IleGly: 4.137 ± 0.804
1.084IleHis: 1.084 ± 0.274
4.334IleIle: 4.334 ± 0.574
7.289IleLys: 7.289 ± 0.791
5.024IleLeu: 5.024 ± 0.719
1.675IleMet: 1.675 ± 0.445
4.334IleAsn: 4.334 ± 0.561
2.364IlePro: 2.364 ± 0.573
1.675IleGln: 1.675 ± 0.323
2.955IleArg: 2.955 ± 0.408
5.615IleSer: 5.615 ± 1.205
5.024IleThr: 5.024 ± 0.797
4.925IleVal: 4.925 ± 0.771
0.296IleTrp: 0.296 ± 0.147
1.182IleTyr: 1.182 ± 0.338
0.0IleXaa: 0.0 ± 0.0
Lys
6.009LysAla: 6.009 ± 1.024
0.394LysCys: 0.394 ± 0.205
4.925LysAsp: 4.925 ± 0.587
6.895LysGlu: 6.895 ± 1.11
2.561LysPhe: 2.561 ± 0.509
5.91LysGly: 5.91 ± 0.991
1.576LysHis: 1.576 ± 0.439
6.895LysIle: 6.895 ± 1.043
6.895LysLys: 6.895 ± 1.358
6.009LysLeu: 6.009 ± 0.485
2.758LysMet: 2.758 ± 0.571
4.728LysAsn: 4.728 ± 0.753
2.955LysPro: 2.955 ± 0.584
4.334LysGln: 4.334 ± 0.689
4.925LysArg: 4.925 ± 0.703
6.797LysSer: 6.797 ± 0.849
5.319LysThr: 5.319 ± 0.643
5.319LysVal: 5.319 ± 0.87
1.576LysTrp: 1.576 ± 0.381
3.054LysTyr: 3.054 ± 0.44
0.0LysXaa: 0.0 ± 0.0
Leu
6.797LeuAla: 6.797 ± 0.804
0.197LeuCys: 0.197 ± 0.152
6.895LeuAsp: 6.895 ± 0.844
5.319LeuGlu: 5.319 ± 0.639
3.645LeuPhe: 3.645 ± 0.509
4.433LeuGly: 4.433 ± 0.762
0.788LeuHis: 0.788 ± 0.31
4.63LeuIle: 4.63 ± 0.593
9.555LeuLys: 9.555 ± 1.102
5.91LeuLeu: 5.91 ± 0.782
1.281LeuMet: 1.281 ± 0.341
4.433LeuAsn: 4.433 ± 0.608
2.955LeuPro: 2.955 ± 0.651
2.857LeuGln: 2.857 ± 0.621
2.66LeuArg: 2.66 ± 0.514
6.895LeuSer: 6.895 ± 0.99
4.137LeuThr: 4.137 ± 0.578
4.433LeuVal: 4.433 ± 0.693
0.69LeuTrp: 0.69 ± 0.279
3.349LeuTyr: 3.349 ± 0.56
0.0LeuXaa: 0.0 ± 0.0
Met
2.561MetAla: 2.561 ± 0.611
0.099MetCys: 0.099 ± 0.099
1.379MetAsp: 1.379 ± 0.45
1.182MetGlu: 1.182 ± 0.312
0.591MetPhe: 0.591 ± 0.227
1.281MetGly: 1.281 ± 0.544
0.296MetHis: 0.296 ± 0.143
1.773MetIle: 1.773 ± 0.39
1.675MetLys: 1.675 ± 0.435
1.379MetLeu: 1.379 ± 0.359
0.493MetMet: 0.493 ± 0.222
1.379MetAsn: 1.379 ± 0.382
0.69MetPro: 0.69 ± 0.276
1.182MetGln: 1.182 ± 0.34
0.493MetArg: 0.493 ± 0.283
1.97MetSer: 1.97 ± 0.402
2.463MetThr: 2.463 ± 0.605
1.281MetVal: 1.281 ± 0.405
0.197MetTrp: 0.197 ± 0.112
0.985MetTyr: 0.985 ± 0.333
0.0MetXaa: 0.0 ± 0.0
Asn
3.546AsnAla: 3.546 ± 0.623
0.493AsnCys: 0.493 ± 0.247
3.251AsnAsp: 3.251 ± 0.484
3.251AsnGlu: 3.251 ± 0.486
2.561AsnPhe: 2.561 ± 0.602
3.743AsnGly: 3.743 ± 0.558
0.985AsnHis: 0.985 ± 0.33
3.645AsnIle: 3.645 ± 0.547
4.433AsnLys: 4.433 ± 0.707
4.137AsnLeu: 4.137 ± 0.481
1.379AsnMet: 1.379 ± 0.325
3.251AsnAsn: 3.251 ± 0.609
3.054AsnPro: 3.054 ± 0.653
2.463AsnGln: 2.463 ± 0.479
2.758AsnArg: 2.758 ± 0.54
3.251AsnSer: 3.251 ± 0.623
3.152AsnThr: 3.152 ± 0.504
3.054AsnVal: 3.054 ± 0.559
0.985AsnTrp: 0.985 ± 0.327
2.167AsnTyr: 2.167 ± 0.398
0.0AsnXaa: 0.0 ± 0.0
Pro
1.872ProAla: 1.872 ± 0.481
0.099ProCys: 0.099 ± 0.103
1.576ProAsp: 1.576 ± 0.425
1.773ProGlu: 1.773 ± 0.528
1.182ProPhe: 1.182 ± 0.338
1.084ProGly: 1.084 ± 0.406
0.197ProHis: 0.197 ± 0.145
1.675ProIle: 1.675 ± 0.371
3.349ProLys: 3.349 ± 0.62
2.167ProLeu: 2.167 ± 0.469
0.985ProMet: 0.985 ± 0.221
1.379ProAsn: 1.379 ± 0.353
1.182ProPro: 1.182 ± 0.4
1.872ProGln: 1.872 ± 0.473
1.281ProArg: 1.281 ± 0.355
2.364ProSer: 2.364 ± 0.495
1.97ProThr: 1.97 ± 0.577
1.675ProVal: 1.675 ± 0.456
0.296ProTrp: 0.296 ± 0.177
1.182ProTyr: 1.182 ± 0.303
0.0ProXaa: 0.0 ± 0.0
Gln
3.152GlnAla: 3.152 ± 0.619
0.296GlnCys: 0.296 ± 0.164
1.576GlnAsp: 1.576 ± 0.374
1.97GlnGlu: 1.97 ± 0.77
1.576GlnPhe: 1.576 ± 0.394
3.054GlnGly: 3.054 ± 0.631
0.69GlnHis: 0.69 ± 0.22
2.758GlnIle: 2.758 ± 0.815
3.546GlnLys: 3.546 ± 0.656
2.463GlnLeu: 2.463 ± 0.498
0.69GlnMet: 0.69 ± 0.178
2.463GlnAsn: 2.463 ± 0.443
0.985GlnPro: 0.985 ± 0.34
1.576GlnGln: 1.576 ± 0.392
1.675GlnArg: 1.675 ± 0.49
3.251GlnSer: 3.251 ± 0.606
2.66GlnThr: 2.66 ± 0.677
2.857GlnVal: 2.857 ± 0.54
0.591GlnTrp: 0.591 ± 0.257
0.887GlnTyr: 0.887 ± 0.273
0.0GlnXaa: 0.0 ± 0.0
Arg
2.364ArgAla: 2.364 ± 0.599
0.197ArgCys: 0.197 ± 0.149
2.463ArgAsp: 2.463 ± 0.494
2.955ArgGlu: 2.955 ± 0.579
1.773ArgPhe: 1.773 ± 0.381
1.576ArgGly: 1.576 ± 0.707
0.591ArgHis: 0.591 ± 0.24
2.66ArgIle: 2.66 ± 0.677
3.152ArgLys: 3.152 ± 0.725
4.531ArgLeu: 4.531 ± 0.818
0.887ArgMet: 0.887 ± 0.316
1.872ArgAsn: 1.872 ± 0.369
0.887ArgPro: 0.887 ± 0.242
1.478ArgGln: 1.478 ± 0.315
2.069ArgArg: 2.069 ± 0.442
1.675ArgSer: 1.675 ± 0.514
2.463ArgThr: 2.463 ± 0.471
2.66ArgVal: 2.66 ± 0.561
0.493ArgTrp: 0.493 ± 0.195
2.069ArgTyr: 2.069 ± 0.43
0.0ArgXaa: 0.0 ± 0.0
Ser
5.024SerAla: 5.024 ± 1.335
0.591SerCys: 0.591 ± 0.254
4.039SerAsp: 4.039 ± 0.544
4.137SerGlu: 4.137 ± 0.654
4.137SerPhe: 4.137 ± 0.691
5.812SerGly: 5.812 ± 0.687
0.394SerHis: 0.394 ± 0.326
4.039SerIle: 4.039 ± 0.645
4.63SerLys: 4.63 ± 0.592
6.304SerLeu: 6.304 ± 0.879
1.084SerMet: 1.084 ± 0.259
3.645SerAsn: 3.645 ± 0.587
1.576SerPro: 1.576 ± 0.402
2.66SerGln: 2.66 ± 0.527
2.266SerArg: 2.266 ± 0.721
4.039SerSer: 4.039 ± 0.69
3.645SerThr: 3.645 ± 0.462
3.546SerVal: 3.546 ± 0.835
0.69SerTrp: 0.69 ± 0.286
2.069SerTyr: 2.069 ± 0.361
0.0SerXaa: 0.0 ± 0.0
Thr
4.827ThrAla: 4.827 ± 0.655
0.099ThrCys: 0.099 ± 0.095
4.039ThrAsp: 4.039 ± 0.461
4.531ThrGlu: 4.531 ± 0.662
2.561ThrPhe: 2.561 ± 0.46
5.812ThrGly: 5.812 ± 1.28
0.591ThrHis: 0.591 ± 0.3
4.137ThrIle: 4.137 ± 0.651
5.516ThrLys: 5.516 ± 0.671
5.812ThrLeu: 5.812 ± 0.777
1.281ThrMet: 1.281 ± 0.313
3.054ThrAsn: 3.054 ± 0.605
1.182ThrPro: 1.182 ± 0.31
2.463ThrGln: 2.463 ± 0.532
1.576ThrArg: 1.576 ± 0.352
3.645ThrSer: 3.645 ± 0.663
3.842ThrThr: 3.842 ± 0.6
4.728ThrVal: 4.728 ± 0.866
0.493ThrTrp: 0.493 ± 0.189
2.069ThrTyr: 2.069 ± 0.428
0.0ThrXaa: 0.0 ± 0.0
Val
4.334ValAla: 4.334 ± 0.66
0.296ValCys: 0.296 ± 0.169
4.728ValAsp: 4.728 ± 0.826
6.6ValGlu: 6.6 ± 0.96
2.364ValPhe: 2.364 ± 0.421
3.448ValGly: 3.448 ± 0.585
0.394ValHis: 0.394 ± 0.192
4.433ValIle: 4.433 ± 0.667
4.827ValLys: 4.827 ± 0.756
4.827ValLeu: 4.827 ± 0.733
1.478ValMet: 1.478 ± 0.342
2.463ValAsn: 2.463 ± 0.568
1.281ValPro: 1.281 ± 0.361
2.167ValGln: 2.167 ± 0.5
2.66ValArg: 2.66 ± 0.557
4.137ValSer: 4.137 ± 0.612
4.925ValThr: 4.925 ± 0.67
4.236ValVal: 4.236 ± 0.677
0.296ValTrp: 0.296 ± 0.18
2.758ValTyr: 2.758 ± 0.638
0.0ValXaa: 0.0 ± 0.0
Trp
0.788TrpAla: 0.788 ± 0.344
0.197TrpCys: 0.197 ± 0.137
0.493TrpAsp: 0.493 ± 0.16
0.985TrpGlu: 0.985 ± 0.329
0.591TrpPhe: 0.591 ± 0.241
1.281TrpGly: 1.281 ± 0.383
0.197TrpHis: 0.197 ± 0.185
0.985TrpIle: 0.985 ± 0.298
0.099TrpLys: 0.099 ± 0.107
0.591TrpLeu: 0.591 ± 0.241
0.296TrpMet: 0.296 ± 0.152
0.493TrpAsn: 0.493 ± 0.225
0.296TrpPro: 0.296 ± 0.166
0.591TrpGln: 0.591 ± 0.211
0.788TrpArg: 0.788 ± 0.321
1.084TrpSer: 1.084 ± 0.324
0.887TrpThr: 0.887 ± 0.361
0.394TrpVal: 0.394 ± 0.21
0.099TrpTrp: 0.099 ± 0.086
0.394TrpTyr: 0.394 ± 0.163
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.152TyrAla: 3.152 ± 0.603
0.493TyrCys: 0.493 ± 0.21
2.167TyrAsp: 2.167 ± 0.493
2.955TyrGlu: 2.955 ± 0.519
1.182TyrPhe: 1.182 ± 0.337
2.364TyrGly: 2.364 ± 0.349
0.69TyrHis: 0.69 ± 0.315
2.463TyrIle: 2.463 ± 0.438
4.236TyrLys: 4.236 ± 0.748
2.561TyrLeu: 2.561 ± 0.457
0.394TyrMet: 0.394 ± 0.197
2.561TyrAsn: 2.561 ± 0.722
1.478TyrPro: 1.478 ± 0.327
1.379TyrGln: 1.379 ± 0.319
1.576TyrArg: 1.576 ± 0.37
1.576TyrSer: 1.576 ± 0.489
1.872TyrThr: 1.872 ± 0.418
2.266TyrVal: 2.266 ± 0.426
0.394TyrTrp: 0.394 ± 0.158
1.379TyrTyr: 1.379 ± 0.414
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 53 proteins (10153 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski