Amino acid dipepetide frequency for Streptococcus phage Javan371

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.998AlaAla: 3.998 ± 1.025
0.0AlaCys: 0.0 ± 0.0
5.699AlaAsp: 5.699 ± 0.613
3.828AlaGlu: 3.828 ± 0.722
2.637AlaPhe: 2.637 ± 0.452
4.253AlaGly: 4.253 ± 0.668
1.106AlaHis: 1.106 ± 0.329
5.359AlaIle: 5.359 ± 0.846
5.529AlaLys: 5.529 ± 0.823
6.124AlaLeu: 6.124 ± 0.773
2.637AlaMet: 2.637 ± 0.666
4.678AlaAsn: 4.678 ± 0.558
1.786AlaPro: 1.786 ± 0.541
2.977AlaGln: 2.977 ± 0.585
2.467AlaArg: 2.467 ± 0.549
3.913AlaSer: 3.913 ± 0.588
4.423AlaThr: 4.423 ± 0.814
5.273AlaVal: 5.273 ± 0.765
0.595AlaTrp: 0.595 ± 0.238
1.956AlaTyr: 1.956 ± 0.443
0.0AlaXaa: 0.0 ± 0.0
Cys
0.34CysAla: 0.34 ± 0.195
0.085CysCys: 0.085 ± 0.102
0.255CysAsp: 0.255 ± 0.133
0.34CysGlu: 0.34 ± 0.178
0.425CysPhe: 0.425 ± 0.236
0.255CysGly: 0.255 ± 0.161
0.0CysHis: 0.0 ± 0.0
0.17CysIle: 0.17 ± 0.113
0.51CysLys: 0.51 ± 0.209
0.255CysLeu: 0.255 ± 0.134
0.255CysMet: 0.255 ± 0.132
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.34CysGln: 0.34 ± 0.149
0.34CysArg: 0.34 ± 0.152
0.34CysSer: 0.34 ± 0.23
0.17CysThr: 0.17 ± 0.132
0.425CysVal: 0.425 ± 0.193
0.0CysTrp: 0.0 ± 0.0
0.34CysTyr: 0.34 ± 0.187
0.0CysXaa: 0.0 ± 0.0
Asp
3.572AspAla: 3.572 ± 0.543
0.68AspCys: 0.68 ± 0.274
3.572AspAsp: 3.572 ± 0.662
4.933AspGlu: 4.933 ± 0.729
3.913AspPhe: 3.913 ± 0.599
5.018AspGly: 5.018 ± 0.673
1.106AspHis: 1.106 ± 0.315
4.083AspIle: 4.083 ± 0.632
6.209AspLys: 6.209 ± 0.657
6.379AspLeu: 6.379 ± 0.805
1.701AspMet: 1.701 ± 0.369
3.062AspAsn: 3.062 ± 0.502
2.041AspPro: 2.041 ± 0.421
2.041AspGln: 2.041 ± 0.451
2.041AspArg: 2.041 ± 0.367
3.572AspSer: 3.572 ± 0.506
1.956AspThr: 1.956 ± 0.469
4.933AspVal: 4.933 ± 0.55
1.191AspTrp: 1.191 ± 0.308
2.892AspTyr: 2.892 ± 0.52
0.0AspXaa: 0.0 ± 0.0
Glu
5.273GluAla: 5.273 ± 0.498
0.595GluCys: 0.595 ± 0.266
5.614GluAsp: 5.614 ± 0.79
7.57GluGlu: 7.57 ± 0.907
3.913GluPhe: 3.913 ± 0.663
3.657GluGly: 3.657 ± 0.463
0.766GluHis: 0.766 ± 0.259
6.039GluIle: 6.039 ± 0.877
8.165GluLys: 8.165 ± 1.007
8.846GluLeu: 8.846 ± 0.976
2.297GluMet: 2.297 ± 0.463
5.784GluAsn: 5.784 ± 0.871
2.041GluPro: 2.041 ± 0.345
3.487GluGln: 3.487 ± 0.6
2.467GluArg: 2.467 ± 0.481
4.678GluSer: 4.678 ± 0.597
3.742GluThr: 3.742 ± 0.546
5.103GluVal: 5.103 ± 0.644
1.021GluTrp: 1.021 ± 0.275
2.126GluTyr: 2.126 ± 0.408
0.0GluXaa: 0.0 ± 0.0
Phe
3.317PheAla: 3.317 ± 0.623
0.34PheCys: 0.34 ± 0.182
4.083PheAsp: 4.083 ± 0.623
4.763PheGlu: 4.763 ± 0.839
1.531PhePhe: 1.531 ± 0.372
3.742PheGly: 3.742 ± 0.664
0.34PheHis: 0.34 ± 0.132
3.317PheIle: 3.317 ± 0.568
4.083PheLys: 4.083 ± 0.599
2.382PheLeu: 2.382 ± 0.538
1.191PheMet: 1.191 ± 0.32
2.722PheAsn: 2.722 ± 0.441
1.191PhePro: 1.191 ± 0.295
1.701PheGln: 1.701 ± 0.35
1.871PheArg: 1.871 ± 0.431
2.892PheSer: 2.892 ± 0.555
2.382PheThr: 2.382 ± 0.546
2.297PheVal: 2.297 ± 0.503
0.255PheTrp: 0.255 ± 0.135
1.446PheTyr: 1.446 ± 0.344
0.0PheXaa: 0.0 ± 0.0
Gly
3.913GlyAla: 3.913 ± 0.674
0.085GlyCys: 0.085 ± 0.083
4.678GlyAsp: 4.678 ± 0.502
3.828GlyGlu: 3.828 ± 0.617
3.828GlyPhe: 3.828 ± 0.591
4.508GlyGly: 4.508 ± 0.732
0.766GlyHis: 0.766 ± 0.22
4.083GlyIle: 4.083 ± 0.701
6.124GlyLys: 6.124 ± 0.597
5.444GlyLeu: 5.444 ± 0.736
1.701GlyMet: 1.701 ± 0.387
3.317GlyAsn: 3.317 ± 0.524
0.595GlyPro: 0.595 ± 0.213
3.062GlyGln: 3.062 ± 0.563
2.297GlyArg: 2.297 ± 0.406
4.763GlySer: 4.763 ± 0.683
3.572GlyThr: 3.572 ± 0.638
2.977GlyVal: 2.977 ± 0.566
1.021GlyTrp: 1.021 ± 0.255
2.126GlyTyr: 2.126 ± 0.423
0.0GlyXaa: 0.0 ± 0.0
His
1.021HisAla: 1.021 ± 0.254
0.085HisCys: 0.085 ± 0.091
0.766HisAsp: 0.766 ± 0.32
1.106HisGlu: 1.106 ± 0.246
0.766HisPhe: 0.766 ± 0.272
0.68HisGly: 0.68 ± 0.222
0.34HisHis: 0.34 ± 0.163
1.021HisIle: 1.021 ± 0.39
0.595HisLys: 0.595 ± 0.188
0.936HisLeu: 0.936 ± 0.269
0.34HisMet: 0.34 ± 0.175
0.595HisAsn: 0.595 ± 0.262
0.766HisPro: 0.766 ± 0.264
0.51HisGln: 0.51 ± 0.187
0.51HisArg: 0.51 ± 0.192
0.851HisSer: 0.851 ± 0.281
0.936HisThr: 0.936 ± 0.243
0.851HisVal: 0.851 ± 0.188
0.17HisTrp: 0.17 ± 0.117
0.425HisTyr: 0.425 ± 0.285
0.0HisXaa: 0.0 ± 0.0
Ile
4.763IleAla: 4.763 ± 0.694
0.34IleCys: 0.34 ± 0.18
4.168IleAsp: 4.168 ± 0.476
7.315IleGlu: 7.315 ± 0.898
3.062IlePhe: 3.062 ± 0.618
3.828IleGly: 3.828 ± 0.681
0.766IleHis: 0.766 ± 0.249
2.977IleIle: 2.977 ± 0.483
6.804IleLys: 6.804 ± 0.965
5.103IleLeu: 5.103 ± 0.713
1.106IleMet: 1.106 ± 0.319
3.998IleAsn: 3.998 ± 0.528
2.467IlePro: 2.467 ± 0.478
2.807IleGln: 2.807 ± 0.468
2.382IleArg: 2.382 ± 0.441
3.147IleSer: 3.147 ± 0.783
4.678IleThr: 4.678 ± 0.453
2.977IleVal: 2.977 ± 0.52
0.425IleTrp: 0.425 ± 0.185
1.616IleTyr: 1.616 ± 0.306
0.0IleXaa: 0.0 ± 0.0
Lys
7.23LysAla: 7.23 ± 0.73
0.085LysCys: 0.085 ± 0.093
4.253LysAsp: 4.253 ± 0.774
8.591LysGlu: 8.591 ± 1.182
2.467LysPhe: 2.467 ± 0.346
3.572LysGly: 3.572 ± 0.493
1.276LysHis: 1.276 ± 0.361
6.464LysIle: 6.464 ± 0.702
7.995LysLys: 7.995 ± 0.856
6.89LysLeu: 6.89 ± 0.855
3.572LysMet: 3.572 ± 0.448
5.954LysAsn: 5.954 ± 0.655
2.892LysPro: 2.892 ± 0.672
4.678LysGln: 4.678 ± 0.59
4.338LysArg: 4.338 ± 0.704
5.699LysSer: 5.699 ± 0.515
5.188LysThr: 5.188 ± 0.788
4.678LysVal: 4.678 ± 0.585
1.531LysTrp: 1.531 ± 0.343
2.552LysTyr: 2.552 ± 0.397
0.0LysXaa: 0.0 ± 0.0
Leu
5.614LeuAla: 5.614 ± 0.757
0.68LeuCys: 0.68 ± 0.332
5.954LeuAsp: 5.954 ± 0.713
8.335LeuGlu: 8.335 ± 0.808
3.828LeuPhe: 3.828 ± 0.795
5.188LeuGly: 5.188 ± 1.001
1.106LeuHis: 1.106 ± 0.329
4.848LeuIle: 4.848 ± 0.693
7.995LeuLys: 7.995 ± 0.865
6.039LeuLeu: 6.039 ± 0.826
1.956LeuMet: 1.956 ± 0.377
5.018LeuAsn: 5.018 ± 0.693
2.211LeuPro: 2.211 ± 0.447
3.402LeuGln: 3.402 ± 0.709
3.657LeuArg: 3.657 ± 0.598
5.273LeuSer: 5.273 ± 0.68
4.763LeuThr: 4.763 ± 0.63
5.699LeuVal: 5.699 ± 0.708
0.51LeuTrp: 0.51 ± 0.202
2.807LeuTyr: 2.807 ± 0.5
0.0LeuXaa: 0.0 ± 0.0
Met
1.701MetAla: 1.701 ± 0.534
0.085MetCys: 0.085 ± 0.078
1.616MetAsp: 1.616 ± 0.442
2.041MetGlu: 2.041 ± 0.383
0.936MetPhe: 0.936 ± 0.269
0.766MetGly: 0.766 ± 0.245
0.255MetHis: 0.255 ± 0.141
2.467MetIle: 2.467 ± 0.388
2.467MetLys: 2.467 ± 0.4
1.956MetLeu: 1.956 ± 0.585
0.34MetMet: 0.34 ± 0.151
2.382MetAsn: 2.382 ± 0.406
0.851MetPro: 0.851 ± 0.251
0.936MetGln: 0.936 ± 0.353
0.936MetArg: 0.936 ± 0.344
2.467MetSer: 2.467 ± 0.462
2.297MetThr: 2.297 ± 0.381
1.361MetVal: 1.361 ± 0.31
0.085MetTrp: 0.085 ± 0.096
0.255MetTyr: 0.255 ± 0.132
0.0MetXaa: 0.0 ± 0.0
Asn
4.083AsnAla: 4.083 ± 0.95
0.51AsnCys: 0.51 ± 0.267
4.083AsnAsp: 4.083 ± 0.682
3.998AsnGlu: 3.998 ± 0.669
2.637AsnPhe: 2.637 ± 0.448
6.294AsnGly: 6.294 ± 0.864
0.936AsnHis: 0.936 ± 0.255
2.977AsnIle: 2.977 ± 0.544
5.529AsnLys: 5.529 ± 0.659
4.423AsnLeu: 4.423 ± 0.707
1.531AsnMet: 1.531 ± 0.338
2.892AsnAsn: 2.892 ± 0.432
2.467AsnPro: 2.467 ± 0.491
3.572AsnGln: 3.572 ± 0.516
1.531AsnArg: 1.531 ± 0.392
3.657AsnSer: 3.657 ± 0.524
3.232AsnThr: 3.232 ± 0.569
3.317AsnVal: 3.317 ± 0.408
0.766AsnTrp: 0.766 ± 0.218
1.276AsnTyr: 1.276 ± 0.342
0.0AsnXaa: 0.0 ± 0.0
Pro
1.276ProAla: 1.276 ± 0.318
0.0ProCys: 0.0 ± 0.0
1.446ProAsp: 1.446 ± 0.351
2.977ProGlu: 2.977 ± 0.606
1.616ProPhe: 1.616 ± 0.366
1.361ProGly: 1.361 ± 0.344
0.34ProHis: 0.34 ± 0.179
1.701ProIle: 1.701 ± 0.398
2.126ProLys: 2.126 ± 0.602
2.552ProLeu: 2.552 ± 0.544
0.595ProMet: 0.595 ± 0.215
1.191ProAsn: 1.191 ± 0.282
0.68ProPro: 0.68 ± 0.271
1.701ProGln: 1.701 ± 0.406
1.021ProArg: 1.021 ± 0.248
1.956ProSer: 1.956 ± 0.425
2.977ProThr: 2.977 ± 0.437
2.467ProVal: 2.467 ± 0.425
0.425ProTrp: 0.425 ± 0.149
0.766ProTyr: 0.766 ± 0.33
0.0ProXaa: 0.0 ± 0.0
Gln
3.998GlnAla: 3.998 ± 0.811
0.17GlnCys: 0.17 ± 0.128
1.361GlnAsp: 1.361 ± 0.394
3.062GlnGlu: 3.062 ± 0.398
2.552GlnPhe: 2.552 ± 0.429
3.487GlnGly: 3.487 ± 0.527
0.51GlnHis: 0.51 ± 0.163
2.467GlnIle: 2.467 ± 0.459
4.508GlnLys: 4.508 ± 0.633
5.018GlnLeu: 5.018 ± 0.707
1.021GlnMet: 1.021 ± 0.297
2.637GlnAsn: 2.637 ± 0.537
1.276GlnPro: 1.276 ± 0.468
1.786GlnGln: 1.786 ± 0.497
1.786GlnArg: 1.786 ± 0.388
2.722GlnSer: 2.722 ± 0.496
2.552GlnThr: 2.552 ± 0.554
1.276GlnVal: 1.276 ± 0.298
0.255GlnTrp: 0.255 ± 0.137
1.361GlnTyr: 1.361 ± 0.337
0.0GlnXaa: 0.0 ± 0.0
Arg
2.041ArgAla: 2.041 ± 0.42
0.085ArgCys: 0.085 ± 0.089
2.977ArgAsp: 2.977 ± 0.61
3.572ArgGlu: 3.572 ± 0.61
1.531ArgPhe: 1.531 ± 0.428
1.956ArgGly: 1.956 ± 0.421
1.021ArgHis: 1.021 ± 0.281
2.297ArgIle: 2.297 ± 0.489
3.062ArgLys: 3.062 ± 0.497
3.572ArgLeu: 3.572 ± 0.563
1.276ArgMet: 1.276 ± 0.35
2.041ArgAsn: 2.041 ± 0.355
0.851ArgPro: 0.851 ± 0.237
1.276ArgGln: 1.276 ± 0.361
1.361ArgArg: 1.361 ± 0.38
1.871ArgSer: 1.871 ± 0.497
2.297ArgThr: 2.297 ± 0.423
2.126ArgVal: 2.126 ± 0.557
0.425ArgTrp: 0.425 ± 0.163
2.297ArgTyr: 2.297 ± 0.328
0.0ArgXaa: 0.0 ± 0.0
Ser
4.253SerAla: 4.253 ± 0.917
0.17SerCys: 0.17 ± 0.113
3.998SerAsp: 3.998 ± 0.731
4.933SerGlu: 4.933 ± 0.628
3.062SerPhe: 3.062 ± 0.462
3.572SerGly: 3.572 ± 0.579
0.51SerHis: 0.51 ± 0.184
4.253SerIle: 4.253 ± 0.714
4.848SerLys: 4.848 ± 0.65
4.848SerLeu: 4.848 ± 0.512
1.701SerMet: 1.701 ± 0.418
3.487SerAsn: 3.487 ± 0.54
1.701SerPro: 1.701 ± 0.322
1.956SerGln: 1.956 ± 0.38
2.297SerArg: 2.297 ± 0.366
3.062SerSer: 3.062 ± 0.582
3.487SerThr: 3.487 ± 0.611
4.423SerVal: 4.423 ± 0.616
0.34SerTrp: 0.34 ± 0.13
2.211SerTyr: 2.211 ± 0.362
0.0SerXaa: 0.0 ± 0.0
Thr
5.529ThrAla: 5.529 ± 1.138
0.085ThrCys: 0.085 ± 0.074
2.892ThrAsp: 2.892 ± 0.435
3.402ThrGlu: 3.402 ± 0.558
2.977ThrPhe: 2.977 ± 0.57
3.998ThrGly: 3.998 ± 0.53
0.51ThrHis: 0.51 ± 0.207
4.338ThrIle: 4.338 ± 0.614
4.338ThrLys: 4.338 ± 0.598
5.614ThrLeu: 5.614 ± 0.659
1.191ThrMet: 1.191 ± 0.3
3.402ThrAsn: 3.402 ± 0.558
2.807ThrPro: 2.807 ± 0.423
2.552ThrGln: 2.552 ± 0.489
2.467ThrArg: 2.467 ± 0.39
2.892ThrSer: 2.892 ± 0.432
3.657ThrThr: 3.657 ± 0.511
3.998ThrVal: 3.998 ± 0.629
0.68ThrTrp: 0.68 ± 0.234
1.106ThrTyr: 1.106 ± 0.318
0.0ThrXaa: 0.0 ± 0.0
Val
4.763ValAla: 4.763 ± 0.593
0.595ValCys: 0.595 ± 0.202
4.253ValAsp: 4.253 ± 0.54
4.933ValGlu: 4.933 ± 0.774
2.041ValPhe: 2.041 ± 0.376
4.508ValGly: 4.508 ± 0.523
0.936ValHis: 0.936 ± 0.311
3.402ValIle: 3.402 ± 0.511
5.699ValLys: 5.699 ± 0.653
3.998ValLeu: 3.998 ± 0.646
1.276ValMet: 1.276 ± 0.307
3.572ValAsn: 3.572 ± 0.563
1.531ValPro: 1.531 ± 0.352
2.722ValGln: 2.722 ± 0.456
2.892ValArg: 2.892 ± 0.481
3.487ValSer: 3.487 ± 0.605
4.083ValThr: 4.083 ± 0.802
4.763ValVal: 4.763 ± 0.642
0.255ValTrp: 0.255 ± 0.134
1.701ValTyr: 1.701 ± 0.299
0.0ValXaa: 0.0 ± 0.0
Trp
0.595TrpAla: 0.595 ± 0.268
0.085TrpCys: 0.085 ± 0.076
0.51TrpAsp: 0.51 ± 0.199
0.425TrpGlu: 0.425 ± 0.156
0.595TrpPhe: 0.595 ± 0.223
0.936TrpGly: 0.936 ± 0.231
0.17TrpHis: 0.17 ± 0.13
0.851TrpIle: 0.851 ± 0.272
0.766TrpLys: 0.766 ± 0.225
0.936TrpLeu: 0.936 ± 0.3
0.085TrpMet: 0.085 ± 0.08
1.021TrpAsn: 1.021 ± 0.298
0.17TrpPro: 0.17 ± 0.115
0.68TrpGln: 0.68 ± 0.192
0.34TrpArg: 0.34 ± 0.171
0.34TrpSer: 0.34 ± 0.207
0.68TrpThr: 0.68 ± 0.248
0.766TrpVal: 0.766 ± 0.251
0.085TrpTrp: 0.085 ± 0.074
0.34TrpTyr: 0.34 ± 0.181
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.956TyrAla: 1.956 ± 0.317
0.085TyrCys: 0.085 ± 0.082
2.722TyrAsp: 2.722 ± 0.501
3.232TyrGlu: 3.232 ± 0.473
1.531TyrPhe: 1.531 ± 0.352
1.106TyrGly: 1.106 ± 0.335
0.51TyrHis: 0.51 ± 0.231
1.616TyrIle: 1.616 ± 0.493
2.552TyrLys: 2.552 ± 0.382
3.657TyrLeu: 3.657 ± 0.585
0.34TyrMet: 0.34 ± 0.17
1.956TyrAsn: 1.956 ± 0.362
0.851TyrPro: 0.851 ± 0.315
1.616TyrGln: 1.616 ± 0.397
0.936TyrArg: 0.936 ± 0.303
1.531TyrSer: 1.531 ± 0.308
1.361TyrThr: 1.361 ± 0.325
1.701TyrVal: 1.701 ± 0.38
0.34TyrTrp: 0.34 ± 0.14
0.766TyrTyr: 0.766 ± 0.251
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 61 proteins (11758 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski