Amino acid dipepetide frequency for Streptococcus phage Javan392

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.242AlaAla: 1.242 ± 0.488
0.331AlaCys: 0.331 ± 0.145
4.638AlaAsp: 4.638 ± 0.54
4.473AlaGlu: 4.473 ± 0.654
2.651AlaPhe: 2.651 ± 0.41
4.473AlaGly: 4.473 ± 0.926
0.58AlaHis: 0.58 ± 0.28
5.053AlaIle: 5.053 ± 0.613
6.958AlaLys: 6.958 ± 0.845
6.378AlaLeu: 6.378 ± 0.781
1.739AlaMet: 1.739 ± 0.457
4.556AlaAsn: 4.556 ± 0.577
1.242AlaPro: 1.242 ± 0.337
3.065AlaGln: 3.065 ± 0.626
2.236AlaArg: 2.236 ± 0.353
5.55AlaSer: 5.55 ± 1.13
4.059AlaThr: 4.059 ± 0.698
4.556AlaVal: 4.556 ± 0.711
0.745AlaTrp: 0.745 ± 0.276
2.982AlaTyr: 2.982 ± 0.491
0.0AlaXaa: 0.0 ± 0.0
Cys
0.248CysAla: 0.248 ± 0.136
0.0CysCys: 0.0 ± 0.0
0.497CysAsp: 0.497 ± 0.216
0.248CysGlu: 0.248 ± 0.147
0.0CysPhe: 0.0 ± 0.0
0.745CysGly: 0.745 ± 0.411
0.166CysHis: 0.166 ± 0.118
0.331CysIle: 0.331 ± 0.19
0.994CysLys: 0.994 ± 0.534
0.331CysLeu: 0.331 ± 0.167
0.083CysMet: 0.083 ± 0.091
0.083CysAsn: 0.083 ± 0.078
0.414CysPro: 0.414 ± 0.169
0.248CysGln: 0.248 ± 0.146
0.497CysArg: 0.497 ± 0.313
0.58CysSer: 0.58 ± 0.252
0.497CysThr: 0.497 ± 0.201
0.497CysVal: 0.497 ± 0.208
0.166CysTrp: 0.166 ± 0.111
0.248CysTyr: 0.248 ± 0.135
0.0CysXaa: 0.0 ± 0.0
Asp
3.396AspAla: 3.396 ± 0.651
0.331AspCys: 0.331 ± 0.184
4.141AspAsp: 4.141 ± 0.529
4.887AspGlu: 4.887 ± 0.841
4.141AspPhe: 4.141 ± 0.574
4.638AspGly: 4.638 ± 0.564
0.745AspHis: 0.745 ± 0.271
5.301AspIle: 5.301 ± 0.815
4.638AspLys: 4.638 ± 0.652
4.97AspLeu: 4.97 ± 0.64
1.822AspMet: 1.822 ± 0.33
2.733AspAsn: 2.733 ± 0.473
2.071AspPro: 2.071 ± 0.41
1.822AspGln: 1.822 ± 0.405
2.568AspArg: 2.568 ± 0.549
3.976AspSer: 3.976 ± 0.495
3.727AspThr: 3.727 ± 0.6
4.638AspVal: 4.638 ± 0.573
0.911AspTrp: 0.911 ± 0.272
3.562AspTyr: 3.562 ± 0.639
0.0AspXaa: 0.0 ± 0.0
Glu
4.804GluAla: 4.804 ± 0.819
0.497GluCys: 0.497 ± 0.209
3.313GluAsp: 3.313 ± 0.641
5.384GluGlu: 5.384 ± 0.897
2.485GluPhe: 2.485 ± 0.467
3.313GluGly: 3.313 ± 0.553
0.994GluHis: 0.994 ± 0.279
6.129GluIle: 6.129 ± 0.867
5.715GluLys: 5.715 ± 1.022
8.034GluLeu: 8.034 ± 1.059
1.822GluMet: 1.822 ± 0.452
3.644GluAsn: 3.644 ± 0.413
1.657GluPro: 1.657 ± 0.304
2.733GluGln: 2.733 ± 0.448
2.816GluArg: 2.816 ± 0.585
3.562GluSer: 3.562 ± 0.487
4.473GluThr: 4.473 ± 0.537
4.887GluVal: 4.887 ± 0.723
1.16GluTrp: 1.16 ± 0.277
3.562GluTyr: 3.562 ± 0.648
0.0GluXaa: 0.0 ± 0.0
Phe
2.568PheAla: 2.568 ± 0.589
0.331PheCys: 0.331 ± 0.16
4.059PheAsp: 4.059 ± 0.554
2.485PheGlu: 2.485 ± 0.403
1.16PhePhe: 1.16 ± 0.311
3.562PheGly: 3.562 ± 0.478
0.414PheHis: 0.414 ± 0.151
2.733PheIle: 2.733 ± 0.466
3.479PheLys: 3.479 ± 0.545
2.899PheLeu: 2.899 ± 0.562
0.663PheMet: 0.663 ± 0.227
2.568PheAsn: 2.568 ± 0.452
0.58PhePro: 0.58 ± 0.201
0.58PheGln: 0.58 ± 0.221
1.16PheArg: 1.16 ± 0.252
3.23PheSer: 3.23 ± 0.531
2.568PheThr: 2.568 ± 0.547
1.988PheVal: 1.988 ± 0.332
0.58PheTrp: 0.58 ± 0.242
1.905PheTyr: 1.905 ± 0.373
0.0PheXaa: 0.0 ± 0.0
Gly
4.059GlyAla: 4.059 ± 0.788
0.414GlyCys: 0.414 ± 0.269
3.81GlyAsp: 3.81 ± 0.517
3.065GlyGlu: 3.065 ± 0.407
3.313GlyPhe: 3.313 ± 0.528
4.804GlyGly: 4.804 ± 0.946
0.994GlyHis: 0.994 ± 0.297
5.715GlyIle: 5.715 ± 0.781
6.461GlyLys: 6.461 ± 0.674
6.129GlyLeu: 6.129 ± 0.75
1.905GlyMet: 1.905 ± 0.489
3.644GlyAsn: 3.644 ± 0.79
0.663GlyPro: 0.663 ± 0.202
1.822GlyGln: 1.822 ± 0.417
2.071GlyArg: 2.071 ± 0.452
3.893GlySer: 3.893 ± 0.704
4.473GlyThr: 4.473 ± 0.68
3.479GlyVal: 3.479 ± 0.539
0.911GlyTrp: 0.911 ± 0.237
3.148GlyTyr: 3.148 ± 0.395
0.0GlyXaa: 0.0 ± 0.0
His
1.077HisAla: 1.077 ± 0.314
0.166HisCys: 0.166 ± 0.113
1.408HisAsp: 1.408 ± 0.402
0.911HisGlu: 0.911 ± 0.245
0.663HisPhe: 0.663 ± 0.231
0.911HisGly: 0.911 ± 0.256
0.248HisHis: 0.248 ± 0.141
1.16HisIle: 1.16 ± 0.241
1.574HisLys: 1.574 ± 0.4
0.745HisLeu: 0.745 ± 0.261
0.331HisMet: 0.331 ± 0.147
0.497HisAsn: 0.497 ± 0.189
0.58HisPro: 0.58 ± 0.255
0.414HisGln: 0.414 ± 0.171
0.663HisArg: 0.663 ± 0.259
0.663HisSer: 0.663 ± 0.239
0.994HisThr: 0.994 ± 0.206
1.242HisVal: 1.242 ± 0.383
0.166HisTrp: 0.166 ± 0.128
0.414HisTyr: 0.414 ± 0.167
0.0HisXaa: 0.0 ± 0.0
Ile
4.804IleAla: 4.804 ± 0.625
0.58IleCys: 0.58 ± 0.194
4.97IleAsp: 4.97 ± 0.769
5.881IleGlu: 5.881 ± 0.936
2.402IlePhe: 2.402 ± 0.504
3.727IleGly: 3.727 ± 0.504
1.242IleHis: 1.242 ± 0.26
4.556IleIle: 4.556 ± 0.718
7.372IleLys: 7.372 ± 0.753
4.473IleLeu: 4.473 ± 0.463
1.739IleMet: 1.739 ± 0.37
4.804IleAsn: 4.804 ± 0.629
3.313IlePro: 3.313 ± 0.532
2.568IleGln: 2.568 ± 0.304
2.651IleArg: 2.651 ± 0.627
4.141IleSer: 4.141 ± 0.652
4.307IleThr: 4.307 ± 0.714
3.976IleVal: 3.976 ± 0.657
0.745IleTrp: 0.745 ± 0.172
2.651IleTyr: 2.651 ± 0.532
0.0IleXaa: 0.0 ± 0.0
Lys
7.041LysAla: 7.041 ± 0.857
0.414LysCys: 0.414 ± 0.318
5.135LysAsp: 5.135 ± 0.738
7.206LysGlu: 7.206 ± 0.914
3.396LysPhe: 3.396 ± 0.658
4.473LysGly: 4.473 ± 0.695
1.491LysHis: 1.491 ± 0.292
6.626LysIle: 6.626 ± 0.945
7.62LysLys: 7.62 ± 0.848
6.792LysLeu: 6.792 ± 0.793
1.905LysMet: 1.905 ± 0.457
5.301LysAsn: 5.301 ± 0.905
1.988LysPro: 1.988 ± 0.397
3.727LysGln: 3.727 ± 0.371
3.065LysArg: 3.065 ± 0.517
4.473LysSer: 4.473 ± 0.62
6.129LysThr: 6.129 ± 0.748
5.384LysVal: 5.384 ± 0.773
1.739LysTrp: 1.739 ± 0.37
3.396LysTyr: 3.396 ± 0.529
0.0LysXaa: 0.0 ± 0.0
Leu
5.881LeuAla: 5.881 ± 0.813
0.497LeuCys: 0.497 ± 0.222
5.218LeuAsp: 5.218 ± 0.592
6.792LeuGlu: 6.792 ± 1.321
2.236LeuPhe: 2.236 ± 0.365
4.556LeuGly: 4.556 ± 0.67
1.408LeuHis: 1.408 ± 0.343
5.218LeuIle: 5.218 ± 0.667
7.703LeuLys: 7.703 ± 0.902
6.461LeuLeu: 6.461 ± 0.865
1.657LeuMet: 1.657 ± 0.316
5.964LeuAsn: 5.964 ± 0.767
2.899LeuPro: 2.899 ± 0.584
3.396LeuGln: 3.396 ± 0.584
2.402LeuArg: 2.402 ± 0.5
5.881LeuSer: 5.881 ± 0.593
6.295LeuThr: 6.295 ± 0.789
4.721LeuVal: 4.721 ± 0.507
0.745LeuTrp: 0.745 ± 0.22
2.154LeuTyr: 2.154 ± 0.403
0.0LeuXaa: 0.0 ± 0.0
Met
1.905MetAla: 1.905 ± 0.517
0.083MetCys: 0.083 ± 0.077
1.077MetAsp: 1.077 ± 0.338
1.905MetGlu: 1.905 ± 0.488
1.077MetPhe: 1.077 ± 0.216
1.491MetGly: 1.491 ± 0.363
0.083MetHis: 0.083 ± 0.085
1.408MetIle: 1.408 ± 0.307
2.319MetLys: 2.319 ± 0.406
2.319MetLeu: 2.319 ± 0.516
0.497MetMet: 0.497 ± 0.271
2.071MetAsn: 2.071 ± 0.432
0.828MetPro: 0.828 ± 0.286
0.497MetGln: 0.497 ± 0.167
1.077MetArg: 1.077 ± 0.224
1.657MetSer: 1.657 ± 0.35
1.739MetThr: 1.739 ± 0.307
1.242MetVal: 1.242 ± 0.305
0.248MetTrp: 0.248 ± 0.159
0.745MetTyr: 0.745 ± 0.242
0.0MetXaa: 0.0 ± 0.0
Asn
5.053AsnAla: 5.053 ± 0.787
0.331AsnCys: 0.331 ± 0.177
3.976AsnAsp: 3.976 ± 0.682
3.065AsnGlu: 3.065 ± 0.545
2.236AsnPhe: 2.236 ± 0.419
6.626AsnGly: 6.626 ± 1.081
1.242AsnHis: 1.242 ± 0.235
3.976AsnIle: 3.976 ± 0.545
4.638AsnLys: 4.638 ± 0.605
3.562AsnLeu: 3.562 ± 0.512
1.657AsnMet: 1.657 ± 0.324
3.644AsnAsn: 3.644 ± 0.822
1.988AsnPro: 1.988 ± 0.427
3.479AsnGln: 3.479 ± 0.628
1.822AsnArg: 1.822 ± 0.376
4.224AsnSer: 4.224 ± 0.51
3.479AsnThr: 3.479 ± 0.678
2.319AsnVal: 2.319 ± 0.494
0.911AsnTrp: 0.911 ± 0.29
2.071AsnTyr: 2.071 ± 0.438
0.0AsnXaa: 0.0 ± 0.0
Pro
1.905ProAla: 1.905 ± 0.367
0.166ProCys: 0.166 ± 0.104
1.988ProAsp: 1.988 ± 0.387
2.485ProGlu: 2.485 ± 0.494
0.828ProPhe: 0.828 ± 0.301
1.16ProGly: 1.16 ± 0.323
0.248ProHis: 0.248 ± 0.124
1.739ProIle: 1.739 ± 0.289
2.899ProLys: 2.899 ± 0.5
2.236ProLeu: 2.236 ± 0.507
0.828ProMet: 0.828 ± 0.283
1.574ProAsn: 1.574 ± 0.263
0.58ProPro: 0.58 ± 0.19
1.822ProGln: 1.822 ± 0.442
1.077ProArg: 1.077 ± 0.392
2.071ProSer: 2.071 ± 0.399
2.154ProThr: 2.154 ± 0.404
2.071ProVal: 2.071 ± 0.555
0.248ProTrp: 0.248 ± 0.123
0.828ProTyr: 0.828 ± 0.308
0.0ProXaa: 0.0 ± 0.0
Gln
4.141GlnAla: 4.141 ± 0.671
0.497GlnCys: 0.497 ± 0.261
1.739GlnAsp: 1.739 ± 0.365
2.568GlnGlu: 2.568 ± 0.578
1.242GlnPhe: 1.242 ± 0.324
2.651GlnGly: 2.651 ± 0.403
0.828GlnHis: 0.828 ± 0.271
2.485GlnIle: 2.485 ± 0.565
2.651GlnLys: 2.651 ± 0.444
2.982GlnLeu: 2.982 ± 0.428
1.242GlnMet: 1.242 ± 0.269
2.071GlnAsn: 2.071 ± 0.402
1.408GlnPro: 1.408 ± 0.274
1.739GlnGln: 1.739 ± 0.407
1.16GlnArg: 1.16 ± 0.322
2.319GlnSer: 2.319 ± 0.463
2.319GlnThr: 2.319 ± 0.369
2.071GlnVal: 2.071 ± 0.514
0.663GlnTrp: 0.663 ± 0.293
1.574GlnTyr: 1.574 ± 0.292
0.0GlnXaa: 0.0 ± 0.0
Arg
1.491ArgAla: 1.491 ± 0.373
0.414ArgCys: 0.414 ± 0.24
2.402ArgAsp: 2.402 ± 0.393
2.733ArgGlu: 2.733 ± 0.55
1.242ArgPhe: 1.242 ± 0.299
1.325ArgGly: 1.325 ± 0.313
0.58ArgHis: 0.58 ± 0.228
2.402ArgIle: 2.402 ± 0.46
2.982ArgLys: 2.982 ± 0.572
3.976ArgLeu: 3.976 ± 0.527
1.077ArgMet: 1.077 ± 0.305
2.236ArgAsn: 2.236 ± 0.457
0.828ArgPro: 0.828 ± 0.33
1.574ArgGln: 1.574 ± 0.345
1.16ArgArg: 1.16 ± 0.315
1.905ArgSer: 1.905 ± 0.456
1.408ArgThr: 1.408 ± 0.364
2.236ArgVal: 2.236 ± 0.42
0.663ArgTrp: 0.663 ± 0.239
1.491ArgTyr: 1.491 ± 0.287
0.0ArgXaa: 0.0 ± 0.0
Ser
4.638SerAla: 4.638 ± 0.648
0.331SerCys: 0.331 ± 0.165
4.059SerAsp: 4.059 ± 0.605
5.053SerGlu: 5.053 ± 0.573
2.651SerPhe: 2.651 ± 0.56
5.053SerGly: 5.053 ± 0.706
0.828SerHis: 0.828 ± 0.355
4.307SerIle: 4.307 ± 0.529
6.047SerLys: 6.047 ± 0.938
4.39SerLeu: 4.39 ± 0.531
1.574SerMet: 1.574 ± 0.367
4.638SerAsn: 4.638 ± 0.735
1.905SerPro: 1.905 ± 0.381
1.822SerGln: 1.822 ± 0.35
2.402SerArg: 2.402 ± 0.441
6.129SerSer: 6.129 ± 1.203
3.313SerThr: 3.313 ± 0.496
3.727SerVal: 3.727 ± 0.542
0.994SerTrp: 0.994 ± 0.233
2.319SerTyr: 2.319 ± 0.415
0.0SerXaa: 0.0 ± 0.0
Thr
5.55ThrAla: 5.55 ± 0.73
0.331ThrCys: 0.331 ± 0.197
3.396ThrAsp: 3.396 ± 0.473
3.313ThrGlu: 3.313 ± 0.432
2.816ThrPhe: 2.816 ± 0.412
4.721ThrGly: 4.721 ± 0.597
0.828ThrHis: 0.828 ± 0.236
5.218ThrIle: 5.218 ± 0.774
4.887ThrLys: 4.887 ± 0.764
4.97ThrLeu: 4.97 ± 0.722
1.408ThrMet: 1.408 ± 0.303
3.893ThrAsn: 3.893 ± 0.798
1.739ThrPro: 1.739 ± 0.409
2.651ThrGln: 2.651 ± 0.526
1.657ThrArg: 1.657 ± 0.383
4.141ThrSer: 4.141 ± 0.823
4.224ThrThr: 4.224 ± 0.771
4.307ThrVal: 4.307 ± 0.689
0.994ThrTrp: 0.994 ± 0.336
2.982ThrTyr: 2.982 ± 0.561
0.0ThrXaa: 0.0 ± 0.0
Val
4.307ValAla: 4.307 ± 0.62
0.248ValCys: 0.248 ± 0.155
4.39ValAsp: 4.39 ± 0.585
4.721ValGlu: 4.721 ± 0.811
2.236ValPhe: 2.236 ± 0.308
3.893ValGly: 3.893 ± 0.765
0.911ValHis: 0.911 ± 0.387
3.396ValIle: 3.396 ± 0.545
4.141ValLys: 4.141 ± 0.575
5.053ValLeu: 5.053 ± 0.533
1.242ValMet: 1.242 ± 0.276
3.893ValAsn: 3.893 ± 0.566
2.154ValPro: 2.154 ± 0.612
2.071ValGln: 2.071 ± 0.447
1.491ValArg: 1.491 ± 0.322
4.307ValSer: 4.307 ± 0.68
4.224ValThr: 4.224 ± 0.58
3.148ValVal: 3.148 ± 0.529
0.58ValTrp: 0.58 ± 0.203
2.568ValTyr: 2.568 ± 0.421
0.0ValXaa: 0.0 ± 0.0
Trp
0.663TrpAla: 0.663 ± 0.27
0.166TrpCys: 0.166 ± 0.103
1.077TrpAsp: 1.077 ± 0.318
0.911TrpGlu: 0.911 ± 0.216
0.745TrpPhe: 0.745 ± 0.255
0.745TrpGly: 0.745 ± 0.231
0.414TrpHis: 0.414 ± 0.158
0.497TrpIle: 0.497 ± 0.15
1.408TrpLys: 1.408 ± 0.365
1.574TrpLeu: 1.574 ± 0.41
0.083TrpMet: 0.083 ± 0.062
0.58TrpAsn: 0.58 ± 0.198
0.497TrpPro: 0.497 ± 0.239
0.663TrpGln: 0.663 ± 0.237
0.497TrpArg: 0.497 ± 0.267
0.994TrpSer: 0.994 ± 0.305
0.828TrpThr: 0.828 ± 0.262
0.911TrpVal: 0.911 ± 0.288
0.083TrpTrp: 0.083 ± 0.062
0.331TrpTyr: 0.331 ± 0.243
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.733TyrAla: 2.733 ± 0.398
0.745TyrCys: 0.745 ± 0.248
3.727TyrAsp: 3.727 ± 0.62
2.899TyrGlu: 2.899 ± 0.5
1.988TyrPhe: 1.988 ± 0.473
1.988TyrGly: 1.988 ± 0.415
0.58TyrHis: 0.58 ± 0.217
2.651TyrIle: 2.651 ± 0.551
2.816TyrLys: 2.816 ± 0.593
3.81TyrLeu: 3.81 ± 0.532
0.911TyrMet: 0.911 ± 0.232
1.905TyrAsn: 1.905 ± 0.411
1.574TyrPro: 1.574 ± 0.31
1.657TyrGln: 1.657 ± 0.405
1.657TyrArg: 1.657 ± 0.382
2.485TyrSer: 2.485 ± 0.464
2.816TyrThr: 2.816 ± 0.596
1.574TyrVal: 1.574 ± 0.316
0.414TyrTrp: 0.414 ± 0.177
2.485TyrTyr: 2.485 ± 0.56
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 55 proteins (12074 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski