Amino acid dipepetide frequency for Streptococcus phage Javan318

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.938AlaAla: 3.938 ± 1.348
0.342AlaCys: 0.342 ± 0.172
3.51AlaAsp: 3.51 ± 0.503
7.277AlaGlu: 7.277 ± 1.134
2.74AlaPhe: 2.74 ± 0.526
4.709AlaGly: 4.709 ± 0.744
0.856AlaHis: 0.856 ± 0.233
5.051AlaIle: 5.051 ± 0.896
5.651AlaLys: 5.651 ± 0.707
6.849AlaLeu: 6.849 ± 0.877
1.969AlaMet: 1.969 ± 0.667
4.195AlaAsn: 4.195 ± 0.7
1.884AlaPro: 1.884 ± 0.442
2.483AlaGln: 2.483 ± 0.519
2.825AlaArg: 2.825 ± 0.503
5.651AlaSer: 5.651 ± 0.821
4.452AlaThr: 4.452 ± 0.676
3.596AlaVal: 3.596 ± 0.497
0.771AlaTrp: 0.771 ± 0.214
2.825AlaTyr: 2.825 ± 0.536
0.0AlaXaa: 0.0 ± 0.0
Cys
0.428CysAla: 0.428 ± 0.19
0.086CysCys: 0.086 ± 0.094
0.086CysAsp: 0.086 ± 0.094
0.514CysGlu: 0.514 ± 0.165
0.257CysPhe: 0.257 ± 0.137
0.342CysGly: 0.342 ± 0.132
0.0CysHis: 0.0 ± 0.0
0.086CysIle: 0.086 ± 0.071
0.257CysLys: 0.257 ± 0.138
0.599CysLeu: 0.599 ± 0.218
0.086CysMet: 0.086 ± 0.098
0.0CysAsn: 0.0 ± 0.0
0.171CysPro: 0.171 ± 0.143
0.257CysGln: 0.257 ± 0.126
0.0CysArg: 0.0 ± 0.0
0.342CysSer: 0.342 ± 0.18
0.171CysThr: 0.171 ± 0.111
0.342CysVal: 0.342 ± 0.17
0.171CysTrp: 0.171 ± 0.118
0.257CysTyr: 0.257 ± 0.153
0.0CysXaa: 0.0 ± 0.0
Asp
5.137AspAla: 5.137 ± 0.5
0.257AspCys: 0.257 ± 0.137
4.281AspAsp: 4.281 ± 0.597
4.024AspGlu: 4.024 ± 0.596
3.253AspPhe: 3.253 ± 0.544
6.25AspGly: 6.25 ± 0.695
0.342AspHis: 0.342 ± 0.147
4.281AspIle: 4.281 ± 0.489
5.308AspLys: 5.308 ± 0.672
4.452AspLeu: 4.452 ± 0.678
1.627AspMet: 1.627 ± 0.336
4.024AspAsn: 4.024 ± 0.625
0.942AspPro: 0.942 ± 0.33
1.284AspGln: 1.284 ± 0.329
2.825AspArg: 2.825 ± 0.409
3.253AspSer: 3.253 ± 0.547
3.767AspThr: 3.767 ± 0.48
3.425AspVal: 3.425 ± 0.599
0.257AspTrp: 0.257 ± 0.146
3.253AspTyr: 3.253 ± 0.689
0.0AspXaa: 0.0 ± 0.0
Glu
4.195GluAla: 4.195 ± 0.62
0.257GluCys: 0.257 ± 0.152
3.767GluAsp: 3.767 ± 0.582
6.764GluGlu: 6.764 ± 1.036
2.74GluPhe: 2.74 ± 0.488
2.568GluGly: 2.568 ± 0.484
0.685GluHis: 0.685 ± 0.214
6.25GluIle: 6.25 ± 0.891
5.908GluLys: 5.908 ± 0.655
7.962GluLeu: 7.962 ± 1.044
1.969GluMet: 1.969 ± 0.38
5.479GluAsn: 5.479 ± 0.674
1.455GluPro: 1.455 ± 0.534
4.709GluGln: 4.709 ± 0.666
3.682GluArg: 3.682 ± 0.647
3.425GluSer: 3.425 ± 0.581
3.938GluThr: 3.938 ± 0.496
5.137GluVal: 5.137 ± 1.091
1.027GluTrp: 1.027 ± 0.251
3.168GluTyr: 3.168 ± 0.648
0.0GluXaa: 0.0 ± 0.0
Phe
2.483PheAla: 2.483 ± 0.468
0.257PheCys: 0.257 ± 0.125
4.11PheAsp: 4.11 ± 0.489
4.024PheGlu: 4.024 ± 0.696
1.455PhePhe: 1.455 ± 0.392
2.483PheGly: 2.483 ± 0.511
0.685PheHis: 0.685 ± 0.25
2.911PheIle: 2.911 ± 0.487
2.825PheLys: 2.825 ± 0.481
2.226PheLeu: 2.226 ± 0.409
0.856PheMet: 0.856 ± 0.283
2.74PheAsn: 2.74 ± 0.416
1.113PhePro: 1.113 ± 0.278
1.798PheGln: 1.798 ± 0.467
1.37PheArg: 1.37 ± 0.282
2.911PheSer: 2.911 ± 0.642
2.312PheThr: 2.312 ± 0.506
2.483PheVal: 2.483 ± 0.499
0.428PheTrp: 0.428 ± 0.202
1.627PheTyr: 1.627 ± 0.398
0.0PheXaa: 0.0 ± 0.0
Gly
3.682GlyAla: 3.682 ± 0.817
0.428GlyCys: 0.428 ± 0.215
2.483GlyAsp: 2.483 ± 0.49
3.767GlyGlu: 3.767 ± 0.374
2.397GlyPhe: 2.397 ± 0.578
4.281GlyGly: 4.281 ± 0.543
1.37GlyHis: 1.37 ± 0.314
5.308GlyIle: 5.308 ± 0.749
4.623GlyLys: 4.623 ± 0.59
4.88GlyLeu: 4.88 ± 0.58
2.14GlyMet: 2.14 ± 0.384
5.137GlyAsn: 5.137 ± 1.613
0.171GlyPro: 0.171 ± 0.157
2.911GlyGln: 2.911 ± 0.505
2.911GlyArg: 2.911 ± 0.491
4.366GlySer: 4.366 ± 0.696
4.281GlyThr: 4.281 ± 0.677
4.538GlyVal: 4.538 ± 0.749
1.199GlyTrp: 1.199 ± 0.539
2.654GlyTyr: 2.654 ± 0.445
0.0GlyXaa: 0.0 ± 0.0
His
0.856HisAla: 0.856 ± 0.269
0.171HisCys: 0.171 ± 0.11
0.942HisAsp: 0.942 ± 0.347
0.942HisGlu: 0.942 ± 0.265
0.771HisPhe: 0.771 ± 0.292
0.685HisGly: 0.685 ± 0.252
0.428HisHis: 0.428 ± 0.203
0.856HisIle: 0.856 ± 0.312
1.541HisLys: 1.541 ± 0.384
1.027HisLeu: 1.027 ± 0.293
0.342HisMet: 0.342 ± 0.166
0.942HisAsn: 0.942 ± 0.289
0.514HisPro: 0.514 ± 0.303
0.685HisGln: 0.685 ± 0.163
0.599HisArg: 0.599 ± 0.258
1.199HisSer: 1.199 ± 0.339
0.942HisThr: 0.942 ± 0.338
0.514HisVal: 0.514 ± 0.237
0.171HisTrp: 0.171 ± 0.106
0.942HisTyr: 0.942 ± 0.364
0.0HisXaa: 0.0 ± 0.0
Ile
5.137IleAla: 5.137 ± 0.718
0.171IleCys: 0.171 ± 0.125
4.966IleAsp: 4.966 ± 0.674
6.421IleGlu: 6.421 ± 0.769
1.798IlePhe: 1.798 ± 0.452
4.195IleGly: 4.195 ± 0.509
1.37IleHis: 1.37 ± 0.267
3.596IleIle: 3.596 ± 0.569
6.678IleLys: 6.678 ± 0.778
4.024IleLeu: 4.024 ± 0.667
1.199IleMet: 1.199 ± 0.296
4.11IleAsn: 4.11 ± 0.5
2.397IlePro: 2.397 ± 0.618
2.997IleGln: 2.997 ± 0.562
2.055IleArg: 2.055 ± 0.502
4.366IleSer: 4.366 ± 0.44
4.11IleThr: 4.11 ± 0.54
4.024IleVal: 4.024 ± 0.53
0.685IleTrp: 0.685 ± 0.3
2.055IleTyr: 2.055 ± 0.432
0.0IleXaa: 0.0 ± 0.0
Lys
6.25LysAla: 6.25 ± 0.84
0.428LysCys: 0.428 ± 0.201
3.938LysAsp: 3.938 ± 0.54
6.935LysGlu: 6.935 ± 0.955
2.483LysPhe: 2.483 ± 0.458
4.795LysGly: 4.795 ± 0.518
1.541LysHis: 1.541 ± 0.425
4.452LysIle: 4.452 ± 0.569
7.705LysLys: 7.705 ± 0.94
5.822LysLeu: 5.822 ± 0.732
2.055LysMet: 2.055 ± 0.479
5.993LysAsn: 5.993 ± 0.848
2.055LysPro: 2.055 ± 0.451
3.168LysGln: 3.168 ± 0.537
3.168LysArg: 3.168 ± 0.499
5.651LysSer: 5.651 ± 0.795
5.565LysThr: 5.565 ± 0.735
5.736LysVal: 5.736 ± 0.709
1.027LysTrp: 1.027 ± 0.269
2.74LysTyr: 2.74 ± 0.51
0.0LysXaa: 0.0 ± 0.0
Leu
6.849LeuAla: 6.849 ± 0.708
0.086LeuCys: 0.086 ± 0.086
6.079LeuAsp: 6.079 ± 0.662
6.849LeuGlu: 6.849 ± 1.077
2.825LeuPhe: 2.825 ± 0.459
4.623LeuGly: 4.623 ± 0.71
0.942LeuHis: 0.942 ± 0.269
3.853LeuIle: 3.853 ± 0.661
7.877LeuLys: 7.877 ± 0.906
5.308LeuLeu: 5.308 ± 0.659
1.541LeuMet: 1.541 ± 0.357
5.565LeuAsn: 5.565 ± 0.868
2.312LeuPro: 2.312 ± 0.396
3.682LeuGln: 3.682 ± 0.561
2.226LeuArg: 2.226 ± 0.483
6.079LeuSer: 6.079 ± 0.62
6.164LeuThr: 6.164 ± 0.712
4.195LeuVal: 4.195 ± 0.6
0.428LeuTrp: 0.428 ± 0.205
2.312LeuTyr: 2.312 ± 0.297
0.0LeuXaa: 0.0 ± 0.0
Met
1.627MetAla: 1.627 ± 0.39
0.171MetCys: 0.171 ± 0.133
1.284MetAsp: 1.284 ± 0.322
1.712MetGlu: 1.712 ± 0.396
0.856MetPhe: 0.856 ± 0.27
1.027MetGly: 1.027 ± 0.25
0.342MetHis: 0.342 ± 0.153
1.969MetIle: 1.969 ± 0.368
2.226MetLys: 2.226 ± 0.34
1.712MetLeu: 1.712 ± 0.317
0.428MetMet: 0.428 ± 0.16
0.685MetAsn: 0.685 ± 0.266
0.942MetPro: 0.942 ± 0.291
1.627MetGln: 1.627 ± 0.396
1.798MetArg: 1.798 ± 0.432
1.37MetSer: 1.37 ± 0.272
1.884MetThr: 1.884 ± 0.348
1.541MetVal: 1.541 ± 0.433
0.257MetTrp: 0.257 ± 0.145
0.685MetTyr: 0.685 ± 0.217
0.0MetXaa: 0.0 ± 0.0
Asn
5.993AsnAla: 5.993 ± 1.413
0.086AsnCys: 0.086 ± 0.076
3.51AsnAsp: 3.51 ± 0.648
4.452AsnGlu: 4.452 ± 0.791
2.312AsnPhe: 2.312 ± 0.522
4.88AsnGly: 4.88 ± 0.857
1.284AsnHis: 1.284 ± 0.322
3.168AsnIle: 3.168 ± 0.676
4.709AsnLys: 4.709 ± 0.691
5.479AsnLeu: 5.479 ± 0.755
1.455AsnMet: 1.455 ± 0.364
3.682AsnAsn: 3.682 ± 0.839
2.397AsnPro: 2.397 ± 0.522
2.312AsnGln: 2.312 ± 0.397
2.226AsnArg: 2.226 ± 0.381
4.195AsnSer: 4.195 ± 0.661
3.082AsnThr: 3.082 ± 0.412
3.938AsnVal: 3.938 ± 0.665
0.599AsnTrp: 0.599 ± 0.21
1.969AsnTyr: 1.969 ± 0.444
0.0AsnXaa: 0.0 ± 0.0
Pro
1.798ProAla: 1.798 ± 0.388
0.171ProCys: 0.171 ± 0.141
1.884ProAsp: 1.884 ± 0.406
1.627ProGlu: 1.627 ± 0.35
1.541ProPhe: 1.541 ± 0.397
0.685ProGly: 0.685 ± 0.232
0.171ProHis: 0.171 ± 0.116
1.798ProIle: 1.798 ± 0.35
2.226ProLys: 2.226 ± 0.535
1.798ProLeu: 1.798 ± 0.445
0.514ProMet: 0.514 ± 0.238
1.113ProAsn: 1.113 ± 0.28
0.428ProPro: 0.428 ± 0.168
1.113ProGln: 1.113 ± 0.299
1.027ProArg: 1.027 ± 0.308
1.884ProSer: 1.884 ± 0.341
1.798ProThr: 1.798 ± 0.374
1.798ProVal: 1.798 ± 0.352
0.257ProTrp: 0.257 ± 0.148
1.627ProTyr: 1.627 ± 0.371
0.0ProXaa: 0.0 ± 0.0
Gln
3.767GlnAla: 3.767 ± 0.536
0.257GlnCys: 0.257 ± 0.136
1.113GlnAsp: 1.113 ± 0.31
3.168GlnGlu: 3.168 ± 0.405
1.37GlnPhe: 1.37 ± 0.283
3.082GlnGly: 3.082 ± 0.693
0.599GlnHis: 0.599 ± 0.209
2.911GlnIle: 2.911 ± 0.476
3.767GlnLys: 3.767 ± 0.566
3.425GlnLeu: 3.425 ± 0.513
1.37GlnMet: 1.37 ± 0.345
2.654GlnAsn: 2.654 ± 0.524
0.771GlnPro: 0.771 ± 0.233
2.654GlnGln: 2.654 ± 0.663
2.226GlnArg: 2.226 ± 0.419
2.14GlnSer: 2.14 ± 0.511
1.969GlnThr: 1.969 ± 0.525
2.74GlnVal: 2.74 ± 0.463
0.514GlnTrp: 0.514 ± 0.193
2.055GlnTyr: 2.055 ± 0.425
0.0GlnXaa: 0.0 ± 0.0
Arg
2.654ArgAla: 2.654 ± 0.426
0.171ArgCys: 0.171 ± 0.119
2.14ArgAsp: 2.14 ± 0.417
2.14ArgGlu: 2.14 ± 0.409
2.055ArgPhe: 2.055 ± 0.39
2.911ArgGly: 2.911 ± 0.503
0.771ArgHis: 0.771 ± 0.212
2.483ArgIle: 2.483 ± 0.484
2.911ArgLys: 2.911 ± 0.469
4.11ArgLeu: 4.11 ± 0.591
0.856ArgMet: 0.856 ± 0.268
3.082ArgAsn: 3.082 ± 0.54
1.712ArgPro: 1.712 ± 0.363
1.627ArgGln: 1.627 ± 0.349
1.798ArgArg: 1.798 ± 0.467
1.199ArgSer: 1.199 ± 0.301
1.969ArgThr: 1.969 ± 0.395
3.168ArgVal: 3.168 ± 0.512
1.027ArgTrp: 1.027 ± 0.313
2.055ArgTyr: 2.055 ± 0.487
0.0ArgXaa: 0.0 ± 0.0
Ser
4.11SerAla: 4.11 ± 0.756
0.342SerCys: 0.342 ± 0.167
5.651SerAsp: 5.651 ± 0.759
3.767SerGlu: 3.767 ± 0.711
3.339SerPhe: 3.339 ± 0.501
2.997SerGly: 2.997 ± 0.972
1.455SerHis: 1.455 ± 0.397
3.596SerIle: 3.596 ± 0.676
3.682SerLys: 3.682 ± 0.532
6.421SerLeu: 6.421 ± 0.777
1.455SerMet: 1.455 ± 0.407
3.767SerAsn: 3.767 ± 0.565
1.969SerPro: 1.969 ± 0.371
2.568SerGln: 2.568 ± 0.447
2.568SerArg: 2.568 ± 0.384
3.853SerSer: 3.853 ± 0.714
4.709SerThr: 4.709 ± 0.589
4.623SerVal: 4.623 ± 0.698
1.027SerTrp: 1.027 ± 0.275
1.969SerTyr: 1.969 ± 0.371
0.0SerXaa: 0.0 ± 0.0
Thr
4.88ThrAla: 4.88 ± 1.1
0.342ThrCys: 0.342 ± 0.156
3.339ThrAsp: 3.339 ± 0.658
4.281ThrGlu: 4.281 ± 0.575
3.425ThrPhe: 3.425 ± 0.445
4.795ThrGly: 4.795 ± 0.633
0.342ThrHis: 0.342 ± 0.182
6.079ThrIle: 6.079 ± 0.703
4.452ThrLys: 4.452 ± 0.688
5.308ThrLeu: 5.308 ± 0.762
1.627ThrMet: 1.627 ± 0.394
3.168ThrAsn: 3.168 ± 0.396
1.969ThrPro: 1.969 ± 0.358
2.397ThrGln: 2.397 ± 0.404
2.568ThrArg: 2.568 ± 0.551
2.397ThrSer: 2.397 ± 0.454
3.682ThrThr: 3.682 ± 0.449
4.795ThrVal: 4.795 ± 0.669
1.113ThrTrp: 1.113 ± 0.337
2.397ThrTyr: 2.397 ± 0.376
0.0ThrXaa: 0.0 ± 0.0
Val
4.966ValAla: 4.966 ± 0.636
0.086ValCys: 0.086 ± 0.095
5.565ValAsp: 5.565 ± 0.715
3.596ValGlu: 3.596 ± 0.768
2.911ValPhe: 2.911 ± 0.458
3.767ValGly: 3.767 ± 0.467
0.685ValHis: 0.685 ± 0.253
4.795ValIle: 4.795 ± 0.576
4.966ValLys: 4.966 ± 0.646
3.339ValLeu: 3.339 ± 0.492
1.455ValMet: 1.455 ± 0.416
2.654ValAsn: 2.654 ± 0.506
0.856ValPro: 0.856 ± 0.236
2.14ValGln: 2.14 ± 0.465
2.14ValArg: 2.14 ± 0.314
5.565ValSer: 5.565 ± 0.783
5.736ValThr: 5.736 ± 0.705
4.538ValVal: 4.538 ± 0.77
1.712ValTrp: 1.712 ± 0.857
2.483ValTyr: 2.483 ± 0.551
0.0ValXaa: 0.0 ± 0.0
Trp
1.199TrpAla: 1.199 ± 0.224
0.0TrpCys: 0.0 ± 0.0
0.771TrpAsp: 0.771 ± 0.28
0.771TrpGlu: 0.771 ± 0.222
0.685TrpPhe: 0.685 ± 0.331
1.541TrpGly: 1.541 ± 0.922
0.171TrpHis: 0.171 ± 0.128
0.514TrpIle: 0.514 ± 0.217
0.856TrpLys: 0.856 ± 0.233
0.942TrpLeu: 0.942 ± 0.285
0.342TrpMet: 0.342 ± 0.165
0.599TrpAsn: 0.599 ± 0.166
0.171TrpPro: 0.171 ± 0.142
0.685TrpGln: 0.685 ± 0.245
0.599TrpArg: 0.599 ± 0.212
0.856TrpSer: 0.856 ± 0.286
0.342TrpThr: 0.342 ± 0.203
0.771TrpVal: 0.771 ± 0.192
0.257TrpTrp: 0.257 ± 0.181
1.027TrpTyr: 1.027 ± 0.459
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.627TyrAla: 1.627 ± 0.352
0.342TyrCys: 0.342 ± 0.228
2.825TyrAsp: 2.825 ± 0.594
2.14TyrGlu: 2.14 ± 0.399
1.884TyrPhe: 1.884 ± 0.526
2.911TyrGly: 2.911 ± 0.535
1.027TyrHis: 1.027 ± 0.334
2.397TyrIle: 2.397 ± 0.422
3.339TyrLys: 3.339 ± 0.663
4.11TyrLeu: 4.11 ± 0.737
0.771TyrMet: 0.771 ± 0.272
2.14TyrAsn: 2.14 ± 0.399
1.027TyrPro: 1.027 ± 0.339
1.455TyrGln: 1.455 ± 0.361
2.226TyrArg: 2.226 ± 0.496
3.168TyrSer: 3.168 ± 0.472
2.483TyrThr: 2.483 ± 0.564
2.055TyrVal: 2.055 ± 0.358
0.171TyrTrp: 0.171 ± 0.112
2.397TyrTyr: 2.397 ± 0.485
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 56 proteins (11681 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski