Amino acid dipepetide frequency for Streptococcus phage Javan374

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.656AlaAla: 3.656 ± 0.996
0.249AlaCys: 0.249 ± 0.14
4.737AlaAsp: 4.737 ± 0.753
4.986AlaGlu: 4.986 ± 0.966
2.659AlaPhe: 2.659 ± 0.488
3.906AlaGly: 3.906 ± 0.723
1.33AlaHis: 1.33 ± 0.313
5.401AlaIle: 5.401 ± 0.926
5.484AlaLys: 5.484 ± 0.956
6.399AlaLeu: 6.399 ± 0.773
2.576AlaMet: 2.576 ± 0.678
3.989AlaAsn: 3.989 ± 0.532
1.496AlaPro: 1.496 ± 0.387
3.158AlaGln: 3.158 ± 0.591
2.41AlaArg: 2.41 ± 0.571
4.321AlaSer: 4.321 ± 0.723
3.656AlaThr: 3.656 ± 0.756
5.152AlaVal: 5.152 ± 0.908
0.748AlaTrp: 0.748 ± 0.276
1.496AlaTyr: 1.496 ± 0.455
0.0AlaXaa: 0.0 ± 0.0
Cys
0.166CysAla: 0.166 ± 0.113
0.0CysCys: 0.0 ± 0.0
0.249CysAsp: 0.249 ± 0.122
0.415CysGlu: 0.415 ± 0.253
0.249CysPhe: 0.249 ± 0.143
0.665CysGly: 0.665 ± 0.261
0.083CysHis: 0.083 ± 0.1
0.166CysIle: 0.166 ± 0.104
0.748CysLys: 0.748 ± 0.237
0.166CysLeu: 0.166 ± 0.112
0.083CysMet: 0.083 ± 0.083
0.332CysAsn: 0.332 ± 0.213
0.083CysPro: 0.083 ± 0.098
0.415CysGln: 0.415 ± 0.205
0.499CysArg: 0.499 ± 0.184
0.166CysSer: 0.166 ± 0.111
0.332CysThr: 0.332 ± 0.171
0.499CysVal: 0.499 ± 0.184
0.083CysTrp: 0.083 ± 0.079
0.332CysTyr: 0.332 ± 0.168
0.0CysXaa: 0.0 ± 0.0
Asp
3.075AspAla: 3.075 ± 0.561
0.249AspCys: 0.249 ± 0.151
3.158AspAsp: 3.158 ± 0.568
4.737AspGlu: 4.737 ± 0.58
3.656AspPhe: 3.656 ± 0.511
4.653AspGly: 4.653 ± 0.575
1.08AspHis: 1.08 ± 0.342
4.82AspIle: 4.82 ± 0.813
6.149AspLys: 6.149 ± 0.612
5.817AspLeu: 5.817 ± 0.733
1.413AspMet: 1.413 ± 0.353
2.659AspAsn: 2.659 ± 0.457
2.161AspPro: 2.161 ± 0.465
2.659AspGln: 2.659 ± 0.389
2.244AspArg: 2.244 ± 0.565
3.324AspSer: 3.324 ± 0.599
2.244AspThr: 2.244 ± 0.525
4.82AspVal: 4.82 ± 0.626
0.831AspTrp: 0.831 ± 0.277
2.493AspTyr: 2.493 ± 0.477
0.0AspXaa: 0.0 ± 0.0
Glu
5.152GluAla: 5.152 ± 0.658
0.997GluCys: 0.997 ± 0.383
4.653GluAsp: 4.653 ± 0.685
6.565GluGlu: 6.565 ± 0.821
3.407GluPhe: 3.407 ± 0.648
3.075GluGly: 3.075 ± 0.394
0.831GluHis: 0.831 ± 0.222
6.232GluIle: 6.232 ± 0.834
6.232GluLys: 6.232 ± 0.683
8.476GluLeu: 8.476 ± 0.928
2.077GluMet: 2.077 ± 0.623
5.401GluAsn: 5.401 ± 0.659
1.496GluPro: 1.496 ± 0.436
4.072GluGln: 4.072 ± 0.613
2.908GluArg: 2.908 ± 0.54
4.737GluSer: 4.737 ± 0.883
3.324GluThr: 3.324 ± 0.598
4.653GluVal: 4.653 ± 0.511
1.662GluTrp: 1.662 ± 0.373
3.075GluTyr: 3.075 ± 0.592
0.0GluXaa: 0.0 ± 0.0
Phe
3.158PheAla: 3.158 ± 0.512
0.415PheCys: 0.415 ± 0.172
3.407PheAsp: 3.407 ± 0.504
4.404PheGlu: 4.404 ± 0.703
1.994PhePhe: 1.994 ± 0.46
4.155PheGly: 4.155 ± 0.703
0.499PheHis: 0.499 ± 0.184
2.659PheIle: 2.659 ± 0.597
3.656PheLys: 3.656 ± 0.5
3.241PheLeu: 3.241 ± 0.689
1.08PheMet: 1.08 ± 0.311
3.241PheAsn: 3.241 ± 0.632
1.413PhePro: 1.413 ± 0.422
1.662PheGln: 1.662 ± 0.348
1.828PheArg: 1.828 ± 0.445
2.908PheSer: 2.908 ± 0.474
2.659PheThr: 2.659 ± 0.587
2.41PheVal: 2.41 ± 0.682
0.332PheTrp: 0.332 ± 0.166
1.33PheTyr: 1.33 ± 0.334
0.0PheXaa: 0.0 ± 0.0
Gly
3.324GlyAla: 3.324 ± 0.628
0.166GlyCys: 0.166 ± 0.123
4.487GlyAsp: 4.487 ± 0.497
3.739GlyGlu: 3.739 ± 0.584
4.155GlyPhe: 4.155 ± 0.714
4.238GlyGly: 4.238 ± 0.726
0.748GlyHis: 0.748 ± 0.282
4.903GlyIle: 4.903 ± 0.731
6.149GlyLys: 6.149 ± 0.726
5.817GlyLeu: 5.817 ± 0.751
1.745GlyMet: 1.745 ± 0.452
3.656GlyAsn: 3.656 ± 0.509
1.163GlyPro: 1.163 ± 0.257
2.576GlyGln: 2.576 ± 0.445
2.493GlyArg: 2.493 ± 0.325
4.57GlySer: 4.57 ± 0.617
3.739GlyThr: 3.739 ± 0.638
3.158GlyVal: 3.158 ± 0.756
0.997GlyTrp: 0.997 ± 0.288
2.327GlyTyr: 2.327 ± 0.478
0.0GlyXaa: 0.0 ± 0.0
His
0.997HisAla: 0.997 ± 0.259
0.166HisCys: 0.166 ± 0.121
0.997HisAsp: 0.997 ± 0.396
1.246HisGlu: 1.246 ± 0.3
1.163HisPhe: 1.163 ± 0.353
0.997HisGly: 0.997 ± 0.338
0.415HisHis: 0.415 ± 0.194
1.163HisIle: 1.163 ± 0.306
0.665HisLys: 0.665 ± 0.202
0.748HisLeu: 0.748 ± 0.27
0.332HisMet: 0.332 ± 0.222
0.415HisAsn: 0.415 ± 0.234
0.582HisPro: 0.582 ± 0.225
0.249HisGln: 0.249 ± 0.148
0.748HisArg: 0.748 ± 0.26
0.831HisSer: 0.831 ± 0.29
0.914HisThr: 0.914 ± 0.234
0.748HisVal: 0.748 ± 0.236
0.0HisTrp: 0.0 ± 0.0
0.499HisTyr: 0.499 ± 0.271
0.0HisXaa: 0.0 ± 0.0
Ile
5.568IleAla: 5.568 ± 0.77
0.748IleCys: 0.748 ± 0.276
4.986IleAsp: 4.986 ± 0.728
6.482IleGlu: 6.482 ± 0.824
3.407IlePhe: 3.407 ± 0.711
4.737IleGly: 4.737 ± 0.688
0.831IleHis: 0.831 ± 0.247
3.739IleIle: 3.739 ± 0.692
6.232IleLys: 6.232 ± 0.84
4.903IleLeu: 4.903 ± 0.687
1.662IleMet: 1.662 ± 0.361
3.49IleAsn: 3.49 ± 0.542
1.911IlePro: 1.911 ± 0.323
2.825IleGln: 2.825 ± 0.531
2.825IleArg: 2.825 ± 0.497
3.823IleSer: 3.823 ± 0.695
4.238IleThr: 4.238 ± 0.589
3.075IleVal: 3.075 ± 0.373
0.582IleTrp: 0.582 ± 0.246
2.825IleTyr: 2.825 ± 0.574
0.0IleXaa: 0.0 ± 0.0
Lys
6.149LysAla: 6.149 ± 0.851
0.083LysCys: 0.083 ± 0.089
4.487LysAsp: 4.487 ± 0.609
7.645LysGlu: 7.645 ± 0.917
3.075LysPhe: 3.075 ± 0.503
4.487LysGly: 4.487 ± 0.676
1.246LysHis: 1.246 ± 0.327
6.731LysIle: 6.731 ± 0.906
6.565LysLys: 6.565 ± 0.871
6.232LysLeu: 6.232 ± 0.764
3.324LysMet: 3.324 ± 0.484
6.149LysAsn: 6.149 ± 0.609
2.41LysPro: 2.41 ± 0.461
3.823LysGln: 3.823 ± 0.478
4.155LysArg: 4.155 ± 0.589
5.651LysSer: 5.651 ± 0.586
5.318LysThr: 5.318 ± 0.766
3.906LysVal: 3.906 ± 0.546
1.163LysTrp: 1.163 ± 0.299
2.825LysTyr: 2.825 ± 0.445
0.0LysXaa: 0.0 ± 0.0
Leu
5.9LeuAla: 5.9 ± 0.699
0.582LeuCys: 0.582 ± 0.281
6.066LeuAsp: 6.066 ± 0.818
6.814LeuGlu: 6.814 ± 0.826
3.906LeuPhe: 3.906 ± 0.784
5.235LeuGly: 5.235 ± 1.014
1.246LeuHis: 1.246 ± 0.328
4.653LeuIle: 4.653 ± 0.745
7.977LeuLys: 7.977 ± 0.862
5.9LeuLeu: 5.9 ± 0.891
1.994LeuMet: 1.994 ± 0.346
4.903LeuAsn: 4.903 ± 0.76
1.828LeuPro: 1.828 ± 0.358
3.739LeuGln: 3.739 ± 0.72
3.324LeuArg: 3.324 ± 0.585
6.315LeuSer: 6.315 ± 1.173
5.401LeuThr: 5.401 ± 0.589
5.069LeuVal: 5.069 ± 0.865
0.415LeuTrp: 0.415 ± 0.163
2.742LeuTyr: 2.742 ± 0.509
0.0LeuXaa: 0.0 ± 0.0
Met
1.828MetAla: 1.828 ± 0.544
0.0MetCys: 0.0 ± 0.0
1.163MetAsp: 1.163 ± 0.337
2.244MetGlu: 2.244 ± 0.408
0.748MetPhe: 0.748 ± 0.21
0.665MetGly: 0.665 ± 0.252
0.249MetHis: 0.249 ± 0.142
2.41MetIle: 2.41 ± 0.545
2.493MetLys: 2.493 ± 0.427
2.327MetLeu: 2.327 ± 0.524
0.415MetMet: 0.415 ± 0.166
2.244MetAsn: 2.244 ± 0.413
0.914MetPro: 0.914 ± 0.251
0.831MetGln: 0.831 ± 0.242
0.831MetArg: 0.831 ± 0.291
1.745MetSer: 1.745 ± 0.415
2.41MetThr: 2.41 ± 0.351
1.745MetVal: 1.745 ± 0.346
0.083MetTrp: 0.083 ± 0.082
0.166MetTyr: 0.166 ± 0.114
0.0MetXaa: 0.0 ± 0.0
Asn
3.241AsnAla: 3.241 ± 0.833
0.332AsnCys: 0.332 ± 0.159
3.158AsnAsp: 3.158 ± 0.538
3.823AsnGlu: 3.823 ± 0.883
2.077AsnPhe: 2.077 ± 0.355
6.232AsnGly: 6.232 ± 0.843
0.997AsnHis: 0.997 ± 0.218
3.407AsnIle: 3.407 ± 0.7
6.149AsnLys: 6.149 ± 0.704
4.903AsnLeu: 4.903 ± 0.627
1.662AsnMet: 1.662 ± 0.36
3.573AsnAsn: 3.573 ± 0.729
2.327AsnPro: 2.327 ± 0.447
3.241AsnGln: 3.241 ± 0.492
2.077AsnArg: 2.077 ± 0.417
4.072AsnSer: 4.072 ± 0.537
3.989AsnThr: 3.989 ± 0.492
3.324AsnVal: 3.324 ± 0.424
0.665AsnTrp: 0.665 ± 0.215
1.828AsnTyr: 1.828 ± 0.444
0.0AsnXaa: 0.0 ± 0.0
Pro
1.413ProAla: 1.413 ± 0.384
0.166ProCys: 0.166 ± 0.096
1.413ProAsp: 1.413 ± 0.36
3.241ProGlu: 3.241 ± 0.48
1.33ProPhe: 1.33 ± 0.346
1.33ProGly: 1.33 ± 0.359
0.249ProHis: 0.249 ± 0.133
2.161ProIle: 2.161 ± 0.42
1.745ProLys: 1.745 ± 0.373
2.244ProLeu: 2.244 ± 0.525
0.499ProMet: 0.499 ± 0.191
1.911ProAsn: 1.911 ± 0.36
0.582ProPro: 0.582 ± 0.233
1.08ProGln: 1.08 ± 0.372
0.997ProArg: 0.997 ± 0.264
1.994ProSer: 1.994 ± 0.443
2.493ProThr: 2.493 ± 0.559
1.662ProVal: 1.662 ± 0.336
0.415ProTrp: 0.415 ± 0.173
0.831ProTyr: 0.831 ± 0.349
0.0ProXaa: 0.0 ± 0.0
Gln
4.155GlnAla: 4.155 ± 0.753
0.415GlnCys: 0.415 ± 0.228
0.914GlnAsp: 0.914 ± 0.275
3.324GlnGlu: 3.324 ± 0.628
2.576GlnPhe: 2.576 ± 0.409
3.823GlnGly: 3.823 ± 0.558
0.582GlnHis: 0.582 ± 0.173
2.659GlnIle: 2.659 ± 0.441
4.321GlnLys: 4.321 ± 0.715
4.238GlnLeu: 4.238 ± 0.62
0.997GlnMet: 0.997 ± 0.261
2.327GlnAsn: 2.327 ± 0.441
1.08GlnPro: 1.08 ± 0.465
2.161GlnGln: 2.161 ± 0.548
1.911GlnArg: 1.911 ± 0.42
1.911GlnSer: 1.911 ± 0.42
2.825GlnThr: 2.825 ± 0.542
1.579GlnVal: 1.579 ± 0.36
0.332GlnTrp: 0.332 ± 0.166
1.163GlnTyr: 1.163 ± 0.278
0.0GlnXaa: 0.0 ± 0.0
Arg
1.828ArgAla: 1.828 ± 0.329
0.083ArgCys: 0.083 ± 0.094
2.908ArgAsp: 2.908 ± 0.649
3.49ArgGlu: 3.49 ± 0.54
1.994ArgPhe: 1.994 ± 0.435
2.327ArgGly: 2.327 ± 0.548
0.997ArgHis: 0.997 ± 0.286
2.825ArgIle: 2.825 ± 0.625
3.241ArgLys: 3.241 ± 0.517
3.823ArgLeu: 3.823 ± 0.595
1.496ArgMet: 1.496 ± 0.404
2.576ArgAsn: 2.576 ± 0.501
1.33ArgPro: 1.33 ± 0.416
1.911ArgGln: 1.911 ± 0.388
1.994ArgArg: 1.994 ± 0.435
2.327ArgSer: 2.327 ± 0.459
2.161ArgThr: 2.161 ± 0.385
1.994ArgVal: 1.994 ± 0.462
0.499ArgTrp: 0.499 ± 0.174
1.662ArgTyr: 1.662 ± 0.32
0.0ArgXaa: 0.0 ± 0.0
Ser
4.903SerAla: 4.903 ± 1.036
0.415SerCys: 0.415 ± 0.185
3.989SerAsp: 3.989 ± 0.507
4.072SerGlu: 4.072 ± 0.51
2.742SerPhe: 2.742 ± 0.458
3.823SerGly: 3.823 ± 0.622
0.748SerHis: 0.748 ± 0.264
4.737SerIle: 4.737 ± 0.779
5.235SerLys: 5.235 ± 0.68
4.737SerLeu: 4.737 ± 0.603
1.828SerMet: 1.828 ± 0.446
4.072SerAsn: 4.072 ± 0.634
1.662SerPro: 1.662 ± 0.36
1.994SerGln: 1.994 ± 0.387
2.576SerArg: 2.576 ± 0.481
2.992SerSer: 2.992 ± 0.726
3.739SerThr: 3.739 ± 0.482
4.155SerVal: 4.155 ± 0.558
0.415SerTrp: 0.415 ± 0.173
2.742SerTyr: 2.742 ± 0.43
0.0SerXaa: 0.0 ± 0.0
Thr
5.9ThrAla: 5.9 ± 1.144
0.0ThrCys: 0.0 ± 0.0
3.573ThrAsp: 3.573 ± 0.464
4.072ThrGlu: 4.072 ± 0.493
3.075ThrPhe: 3.075 ± 0.459
3.739ThrGly: 3.739 ± 0.429
0.499ThrHis: 0.499 ± 0.225
3.739ThrIle: 3.739 ± 0.823
4.238ThrLys: 4.238 ± 0.571
5.152ThrLeu: 5.152 ± 0.737
0.831ThrMet: 0.831 ± 0.224
3.656ThrAsn: 3.656 ± 0.524
2.493ThrPro: 2.493 ± 0.609
2.161ThrGln: 2.161 ± 0.389
2.659ThrArg: 2.659 ± 0.457
3.906ThrSer: 3.906 ± 0.446
3.989ThrThr: 3.989 ± 0.543
4.155ThrVal: 4.155 ± 0.686
0.582ThrTrp: 0.582 ± 0.239
1.08ThrTyr: 1.08 ± 0.319
0.0ThrXaa: 0.0 ± 0.0
Val
4.653ValAla: 4.653 ± 0.683
0.332ValCys: 0.332 ± 0.149
4.487ValAsp: 4.487 ± 0.694
4.653ValGlu: 4.653 ± 0.638
1.911ValPhe: 1.911 ± 0.356
4.072ValGly: 4.072 ± 0.496
0.582ValHis: 0.582 ± 0.262
3.656ValIle: 3.656 ± 0.656
4.57ValLys: 4.57 ± 0.567
4.238ValLeu: 4.238 ± 0.752
0.997ValMet: 0.997 ± 0.298
3.407ValAsn: 3.407 ± 0.395
1.745ValPro: 1.745 ± 0.378
2.493ValGln: 2.493 ± 0.391
2.825ValArg: 2.825 ± 0.484
3.823ValSer: 3.823 ± 0.479
3.989ValThr: 3.989 ± 0.561
4.238ValVal: 4.238 ± 0.762
0.332ValTrp: 0.332 ± 0.228
1.745ValTyr: 1.745 ± 0.324
0.0ValXaa: 0.0 ± 0.0
Trp
0.665TrpAla: 0.665 ± 0.235
0.166TrpCys: 0.166 ± 0.114
0.332TrpAsp: 0.332 ± 0.146
0.582TrpGlu: 0.582 ± 0.196
0.499TrpPhe: 0.499 ± 0.243
0.914TrpGly: 0.914 ± 0.24
0.166TrpHis: 0.166 ± 0.14
0.914TrpIle: 0.914 ± 0.315
0.665TrpLys: 0.665 ± 0.206
0.914TrpLeu: 0.914 ± 0.242
0.083TrpMet: 0.083 ± 0.095
0.914TrpAsn: 0.914 ± 0.303
0.166TrpPro: 0.166 ± 0.121
0.582TrpGln: 0.582 ± 0.196
0.831TrpArg: 0.831 ± 0.271
0.415TrpSer: 0.415 ± 0.167
0.665TrpThr: 0.665 ± 0.246
0.582TrpVal: 0.582 ± 0.166
0.166TrpTrp: 0.166 ± 0.13
0.499TrpTyr: 0.499 ± 0.24
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.077TyrAla: 2.077 ± 0.319
0.166TyrCys: 0.166 ± 0.118
3.075TyrAsp: 3.075 ± 0.535
2.493TyrGlu: 2.493 ± 0.412
1.828TyrPhe: 1.828 ± 0.331
1.08TyrGly: 1.08 ± 0.328
0.499TyrHis: 0.499 ± 0.236
1.911TyrIle: 1.911 ± 0.445
2.992TyrLys: 2.992 ± 0.508
3.739TyrLeu: 3.739 ± 0.807
0.332TyrMet: 0.332 ± 0.202
2.161TyrAsn: 2.161 ± 0.314
0.914TyrPro: 0.914 ± 0.352
1.662TyrGln: 1.662 ± 0.417
1.33TyrArg: 1.33 ± 0.344
1.745TyrSer: 1.745 ± 0.34
1.413TyrThr: 1.413 ± 0.399
1.828TyrVal: 1.828 ± 0.492
0.415TyrTrp: 0.415 ± 0.172
1.08TyrTyr: 1.08 ± 0.345
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 58 proteins (12035 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski