Amino acid dipepetide frequency for Wolbachia phage WO

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.597AlaAla: 5.597 ± 0.78
0.914AlaCys: 0.914 ± 0.313
1.942AlaAsp: 1.942 ± 0.474
3.427AlaGlu: 3.427 ± 0.574
2.513AlaPhe: 2.513 ± 0.541
3.541AlaGly: 3.541 ± 0.69
1.028AlaHis: 1.028 ± 0.332
7.082AlaIle: 7.082 ± 1.274
4.455AlaLys: 4.455 ± 0.607
6.853AlaLeu: 6.853 ± 1.047
1.599AlaMet: 1.599 ± 0.342
2.856AlaAsn: 2.856 ± 0.515
1.485AlaPro: 1.485 ± 0.455
1.371AlaGln: 1.371 ± 0.378
3.312AlaArg: 3.312 ± 0.574
3.198AlaSer: 3.198 ± 0.526
3.655AlaThr: 3.655 ± 0.755
4.112AlaVal: 4.112 ± 0.638
0.685AlaTrp: 0.685 ± 0.3
2.284AlaTyr: 2.284 ± 0.439
0.0AlaXaa: 0.0 ± 0.0
Cys
0.8CysAla: 0.8 ± 0.321
0.0CysCys: 0.0 ± 0.0
0.8CysAsp: 0.8 ± 0.255
0.343CysGlu: 0.343 ± 0.194
1.256CysPhe: 1.256 ± 0.353
1.142CysGly: 1.142 ± 0.333
0.457CysHis: 0.457 ± 0.218
1.371CysIle: 1.371 ± 0.359
0.8CysLys: 0.8 ± 0.286
0.914CysLeu: 0.914 ± 0.273
0.114CysMet: 0.114 ± 0.118
0.914CysAsn: 0.914 ± 0.299
0.228CysPro: 0.228 ± 0.151
0.228CysGln: 0.228 ± 0.159
1.256CysArg: 1.256 ± 0.366
0.685CysSer: 0.685 ± 0.25
0.8CysThr: 0.8 ± 0.352
1.028CysVal: 1.028 ± 0.42
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.084AspAla: 3.084 ± 0.494
0.571AspCys: 0.571 ± 0.233
1.713AspAsp: 1.713 ± 0.478
3.883AspGlu: 3.883 ± 0.629
2.513AspPhe: 2.513 ± 0.697
3.084AspGly: 3.084 ± 0.718
1.485AspHis: 1.485 ± 0.372
3.541AspIle: 3.541 ± 0.693
3.084AspLys: 3.084 ± 0.55
3.541AspLeu: 3.541 ± 0.848
1.256AspMet: 1.256 ± 0.353
1.713AspAsn: 1.713 ± 0.487
2.056AspPro: 2.056 ± 0.531
1.371AspGln: 1.371 ± 0.438
2.856AspArg: 2.856 ± 0.518
2.627AspSer: 2.627 ± 0.623
2.056AspThr: 2.056 ± 0.43
3.541AspVal: 3.541 ± 0.484
0.343AspTrp: 0.343 ± 0.195
1.599AspTyr: 1.599 ± 0.392
0.0AspXaa: 0.0 ± 0.0
Glu
3.769GluAla: 3.769 ± 0.69
1.256GluCys: 1.256 ± 0.369
3.769GluAsp: 3.769 ± 0.791
7.196GluGlu: 7.196 ± 1.272
3.198GluPhe: 3.198 ± 0.563
3.541GluGly: 3.541 ± 0.695
2.056GluHis: 2.056 ± 0.426
6.396GluIle: 6.396 ± 1.026
7.424GluLys: 7.424 ± 1.399
8.452GluLeu: 8.452 ± 1.086
1.028GluMet: 1.028 ± 0.363
3.998GluAsn: 3.998 ± 0.649
1.942GluPro: 1.942 ± 0.45
3.541GluGln: 3.541 ± 0.618
2.627GluArg: 2.627 ± 0.681
4.911GluSer: 4.911 ± 0.808
3.198GluThr: 3.198 ± 0.522
5.597GluVal: 5.597 ± 0.958
0.457GluTrp: 0.457 ± 0.255
2.741GluTyr: 2.741 ± 0.554
0.0GluXaa: 0.0 ± 0.0
Phe
3.427PheAla: 3.427 ± 0.797
1.028PheCys: 1.028 ± 0.251
2.17PheAsp: 2.17 ± 0.555
2.399PheGlu: 2.399 ± 0.433
2.056PhePhe: 2.056 ± 0.662
2.741PheGly: 2.741 ± 0.616
0.8PheHis: 0.8 ± 0.294
3.998PheIle: 3.998 ± 0.683
2.056PheLys: 2.056 ± 0.522
3.769PheLeu: 3.769 ± 0.898
1.028PheMet: 1.028 ± 0.367
2.399PheAsn: 2.399 ± 0.341
1.028PhePro: 1.028 ± 0.37
1.028PheGln: 1.028 ± 0.347
2.17PheArg: 2.17 ± 0.544
2.97PheSer: 2.97 ± 0.569
2.17PheThr: 2.17 ± 0.572
3.084PheVal: 3.084 ± 0.615
0.685PheTrp: 0.685 ± 0.342
1.485PheTyr: 1.485 ± 0.401
0.0PheXaa: 0.0 ± 0.0
Gly
2.856GlyAla: 2.856 ± 0.581
1.142GlyCys: 1.142 ± 0.317
2.97GlyAsp: 2.97 ± 0.541
6.625GlyGlu: 6.625 ± 1.278
1.942GlyPhe: 1.942 ± 0.575
2.741GlyGly: 2.741 ± 0.541
0.685GlyHis: 0.685 ± 0.254
7.31GlyIle: 7.31 ± 0.767
5.597GlyLys: 5.597 ± 1.037
5.711GlyLeu: 5.711 ± 0.897
1.942GlyMet: 1.942 ± 0.508
2.513GlyAsn: 2.513 ± 0.472
1.485GlyPro: 1.485 ± 0.609
2.056GlyGln: 2.056 ± 0.405
2.97GlyArg: 2.97 ± 0.438
4.683GlySer: 4.683 ± 0.803
2.513GlyThr: 2.513 ± 0.629
6.511GlyVal: 6.511 ± 0.785
0.914GlyTrp: 0.914 ± 0.321
2.513GlyTyr: 2.513 ± 0.502
0.0GlyXaa: 0.0 ± 0.0
His
1.599HisAla: 1.599 ± 0.537
0.343HisCys: 0.343 ± 0.185
0.571HisAsp: 0.571 ± 0.253
0.914HisGlu: 0.914 ± 0.329
1.485HisPhe: 1.485 ± 0.354
0.457HisGly: 0.457 ± 0.235
0.685HisHis: 0.685 ± 0.312
1.371HisIle: 1.371 ± 0.431
1.028HisLys: 1.028 ± 0.31
2.056HisLeu: 2.056 ± 0.461
0.457HisMet: 0.457 ± 0.25
1.142HisAsn: 1.142 ± 0.328
0.8HisPro: 0.8 ± 0.276
0.685HisGln: 0.685 ± 0.36
0.914HisArg: 0.914 ± 0.314
1.713HisSer: 1.713 ± 0.427
0.571HisThr: 0.571 ± 0.235
1.599HisVal: 1.599 ± 0.52
0.343HisTrp: 0.343 ± 0.216
1.599HisTyr: 1.599 ± 0.462
0.0HisXaa: 0.0 ± 0.0
Ile
5.825IleAla: 5.825 ± 0.763
1.256IleCys: 1.256 ± 0.41
5.483IleAsp: 5.483 ± 0.96
6.967IleGlu: 6.967 ± 0.961
3.655IlePhe: 3.655 ± 0.939
7.881IleGly: 7.881 ± 1.05
1.485IleHis: 1.485 ± 0.413
7.31IleIle: 7.31 ± 0.935
6.739IleLys: 6.739 ± 0.82
7.196IleLeu: 7.196 ± 1.002
2.284IleMet: 2.284 ± 0.609
4.911IleAsn: 4.911 ± 0.68
2.17IlePro: 2.17 ± 0.546
1.371IleGln: 1.371 ± 0.481
3.541IleArg: 3.541 ± 0.539
5.939IleSer: 5.939 ± 0.815
5.825IleThr: 5.825 ± 1.092
4.34IleVal: 4.34 ± 0.688
0.457IleTrp: 0.457 ± 0.207
3.084IleTyr: 3.084 ± 0.699
0.0IleXaa: 0.0 ± 0.0
Lys
5.254LysAla: 5.254 ± 0.976
0.8LysCys: 0.8 ± 0.351
4.569LysAsp: 4.569 ± 0.72
5.825LysGlu: 5.825 ± 1.163
2.856LysPhe: 2.856 ± 0.702
4.569LysGly: 4.569 ± 0.861
1.828LysHis: 1.828 ± 0.54
5.254LysIle: 5.254 ± 0.701
5.483LysLys: 5.483 ± 0.962
7.539LysLeu: 7.539 ± 0.899
1.942LysMet: 1.942 ± 0.503
3.769LysAsn: 3.769 ± 0.641
1.371LysPro: 1.371 ± 0.519
4.112LysGln: 4.112 ± 0.743
5.254LysArg: 5.254 ± 0.803
4.797LysSer: 4.797 ± 0.573
2.97LysThr: 2.97 ± 0.605
4.683LysVal: 4.683 ± 0.782
1.713LysTrp: 1.713 ± 0.572
1.828LysTyr: 1.828 ± 0.449
0.0LysXaa: 0.0 ± 0.0
Leu
6.511LeuAla: 6.511 ± 0.865
0.914LeuCys: 0.914 ± 0.317
4.112LeuAsp: 4.112 ± 0.723
7.082LeuGlu: 7.082 ± 1.044
4.683LeuPhe: 4.683 ± 0.713
5.825LeuGly: 5.825 ± 0.716
2.856LeuHis: 2.856 ± 0.751
7.539LeuIle: 7.539 ± 1.242
8.11LeuLys: 8.11 ± 0.724
7.653LeuLeu: 7.653 ± 1.238
2.17LeuMet: 2.17 ± 0.624
3.769LeuAsn: 3.769 ± 0.749
3.541LeuPro: 3.541 ± 0.843
2.399LeuGln: 2.399 ± 0.458
5.026LeuArg: 5.026 ± 0.862
9.709LeuSer: 9.709 ± 1.015
4.112LeuThr: 4.112 ± 0.753
4.683LeuVal: 4.683 ± 0.581
1.142LeuTrp: 1.142 ± 0.499
3.427LeuTyr: 3.427 ± 0.662
0.0LeuXaa: 0.0 ± 0.0
Met
1.713MetAla: 1.713 ± 0.449
0.343MetCys: 0.343 ± 0.186
0.8MetAsp: 0.8 ± 0.309
1.713MetGlu: 1.713 ± 0.43
0.914MetPhe: 0.914 ± 0.306
1.828MetGly: 1.828 ± 0.404
0.114MetHis: 0.114 ± 0.12
1.256MetIle: 1.256 ± 0.506
2.17MetLys: 2.17 ± 0.475
3.427MetLeu: 3.427 ± 0.582
0.571MetMet: 0.571 ± 0.25
0.8MetAsn: 0.8 ± 0.263
0.685MetPro: 0.685 ± 0.281
1.028MetGln: 1.028 ± 0.326
1.371MetArg: 1.371 ± 0.472
1.713MetSer: 1.713 ± 0.491
1.028MetThr: 1.028 ± 0.401
1.256MetVal: 1.256 ± 0.396
0.343MetTrp: 0.343 ± 0.172
0.457MetTyr: 0.457 ± 0.235
0.0MetXaa: 0.0 ± 0.0
Asn
2.17AsnAla: 2.17 ± 0.537
0.228AsnCys: 0.228 ± 0.147
2.513AsnAsp: 2.513 ± 0.615
2.97AsnGlu: 2.97 ± 0.657
1.371AsnPhe: 1.371 ± 0.423
2.97AsnGly: 2.97 ± 0.542
1.142AsnHis: 1.142 ± 0.317
5.368AsnIle: 5.368 ± 0.795
4.569AsnLys: 4.569 ± 0.852
4.683AsnLeu: 4.683 ± 0.856
0.571AsnMet: 0.571 ± 0.232
2.97AsnAsn: 2.97 ± 0.941
1.942AsnPro: 1.942 ± 0.598
1.028AsnGln: 1.028 ± 0.332
2.513AsnArg: 2.513 ± 0.437
2.513AsnSer: 2.513 ± 0.453
2.97AsnThr: 2.97 ± 0.602
2.97AsnVal: 2.97 ± 0.608
0.685AsnTrp: 0.685 ± 0.394
1.371AsnTyr: 1.371 ± 0.356
0.0AsnXaa: 0.0 ± 0.0
Pro
2.17ProAla: 2.17 ± 0.439
0.457ProCys: 0.457 ± 0.272
1.371ProAsp: 1.371 ± 0.345
3.427ProGlu: 3.427 ± 0.471
1.256ProPhe: 1.256 ± 0.384
2.97ProGly: 2.97 ± 0.692
0.914ProHis: 0.914 ± 0.308
2.399ProIle: 2.399 ± 0.604
1.371ProLys: 1.371 ± 0.364
3.312ProLeu: 3.312 ± 0.69
1.256ProMet: 1.256 ± 0.381
1.485ProAsn: 1.485 ± 0.446
1.028ProPro: 1.028 ± 0.426
0.914ProGln: 0.914 ± 0.325
0.8ProArg: 0.8 ± 0.348
1.599ProSer: 1.599 ± 0.424
1.828ProThr: 1.828 ± 0.466
2.513ProVal: 2.513 ± 0.705
0.685ProTrp: 0.685 ± 0.288
0.685ProTyr: 0.685 ± 0.251
0.0ProXaa: 0.0 ± 0.0
Gln
2.284GlnAla: 2.284 ± 0.444
0.343GlnCys: 0.343 ± 0.189
0.571GlnAsp: 0.571 ± 0.263
2.513GlnGlu: 2.513 ± 0.55
0.685GlnPhe: 0.685 ± 0.271
1.256GlnGly: 1.256 ± 0.337
0.457GlnHis: 0.457 ± 0.236
3.541GlnIle: 3.541 ± 0.762
2.056GlnLys: 2.056 ± 0.53
2.97GlnLeu: 2.97 ± 0.84
0.685GlnMet: 0.685 ± 0.316
1.142GlnAsn: 1.142 ± 0.381
1.713GlnPro: 1.713 ± 0.428
1.942GlnGln: 1.942 ± 0.435
1.028GlnArg: 1.028 ± 0.381
1.713GlnSer: 1.713 ± 0.445
1.256GlnThr: 1.256 ± 0.306
2.17GlnVal: 2.17 ± 0.371
0.114GlnTrp: 0.114 ± 0.116
0.685GlnTyr: 0.685 ± 0.317
0.0GlnXaa: 0.0 ± 0.0
Arg
2.513ArgAla: 2.513 ± 0.546
0.457ArgCys: 0.457 ± 0.224
3.084ArgAsp: 3.084 ± 0.529
4.112ArgGlu: 4.112 ± 0.709
1.828ArgPhe: 1.828 ± 0.667
3.198ArgGly: 3.198 ± 0.685
0.457ArgHis: 0.457 ± 0.286
3.998ArgIle: 3.998 ± 0.691
5.026ArgLys: 5.026 ± 0.939
4.34ArgLeu: 4.34 ± 0.898
1.371ArgMet: 1.371 ± 0.421
3.198ArgAsn: 3.198 ± 0.557
2.513ArgPro: 2.513 ± 0.48
0.571ArgGln: 0.571 ± 0.243
2.97ArgArg: 2.97 ± 0.572
3.427ArgSer: 3.427 ± 0.597
2.513ArgThr: 2.513 ± 0.566
3.769ArgVal: 3.769 ± 0.735
1.028ArgTrp: 1.028 ± 0.379
1.142ArgTyr: 1.142 ± 0.317
0.0ArgXaa: 0.0 ± 0.0
Ser
3.427SerAla: 3.427 ± 0.677
0.8SerCys: 0.8 ± 0.4
2.056SerAsp: 2.056 ± 0.353
4.569SerGlu: 4.569 ± 0.732
3.541SerPhe: 3.541 ± 0.764
6.396SerGly: 6.396 ± 0.943
1.713SerHis: 1.713 ± 0.51
6.625SerIle: 6.625 ± 0.869
4.797SerLys: 4.797 ± 0.786
6.054SerLeu: 6.054 ± 0.774
1.256SerMet: 1.256 ± 0.343
2.97SerAsn: 2.97 ± 0.635
1.942SerPro: 1.942 ± 0.48
1.028SerGln: 1.028 ± 0.446
3.769SerArg: 3.769 ± 0.542
3.769SerSer: 3.769 ± 0.685
3.883SerThr: 3.883 ± 0.814
3.998SerVal: 3.998 ± 0.755
0.8SerTrp: 0.8 ± 0.227
3.084SerTyr: 3.084 ± 0.509
0.0SerXaa: 0.0 ± 0.0
Thr
3.883ThrAla: 3.883 ± 0.6
0.571ThrCys: 0.571 ± 0.23
1.942ThrAsp: 1.942 ± 0.566
3.998ThrGlu: 3.998 ± 0.619
1.828ThrPhe: 1.828 ± 0.545
3.998ThrGly: 3.998 ± 0.956
0.343ThrHis: 0.343 ± 0.196
3.998ThrIle: 3.998 ± 0.552
3.998ThrLys: 3.998 ± 0.883
4.797ThrLeu: 4.797 ± 0.84
0.571ThrMet: 0.571 ± 0.258
1.485ThrAsn: 1.485 ± 0.415
2.284ThrPro: 2.284 ± 0.469
1.713ThrGln: 1.713 ± 0.439
2.513ThrArg: 2.513 ± 0.611
2.97ThrSer: 2.97 ± 0.655
3.084ThrThr: 3.084 ± 0.604
2.741ThrVal: 2.741 ± 0.664
0.685ThrTrp: 0.685 ± 0.338
1.828ThrTyr: 1.828 ± 0.511
0.0ThrXaa: 0.0 ± 0.0
Val
2.741ValAla: 2.741 ± 0.584
0.8ValCys: 0.8 ± 0.343
3.998ValAsp: 3.998 ± 0.818
5.14ValGlu: 5.14 ± 0.895
2.513ValPhe: 2.513 ± 0.634
4.455ValGly: 4.455 ± 0.83
0.8ValHis: 0.8 ± 0.262
5.368ValIle: 5.368 ± 1.107
4.797ValLys: 4.797 ± 0.756
6.967ValLeu: 6.967 ± 0.782
1.713ValMet: 1.713 ± 0.493
3.198ValAsn: 3.198 ± 0.55
2.627ValPro: 2.627 ± 0.55
1.599ValGln: 1.599 ± 0.507
4.112ValArg: 4.112 ± 0.967
4.226ValSer: 4.226 ± 0.724
2.741ValThr: 2.741 ± 0.527
5.939ValVal: 5.939 ± 0.92
1.142ValTrp: 1.142 ± 0.437
2.513ValTyr: 2.513 ± 0.519
0.0ValXaa: 0.0 ± 0.0
Trp
0.228TrpAla: 0.228 ± 0.144
0.0TrpCys: 0.0 ± 0.0
0.571TrpAsp: 0.571 ± 0.264
0.571TrpGlu: 0.571 ± 0.292
0.343TrpPhe: 0.343 ± 0.198
0.571TrpGly: 0.571 ± 0.287
0.0TrpHis: 0.0 ± 0.0
1.828TrpIle: 1.828 ± 0.446
1.028TrpLys: 1.028 ± 0.437
1.942TrpLeu: 1.942 ± 0.502
0.457TrpMet: 0.457 ± 0.278
0.571TrpAsn: 0.571 ± 0.232
0.685TrpPro: 0.685 ± 0.374
0.685TrpGln: 0.685 ± 0.255
0.571TrpArg: 0.571 ± 0.252
0.914TrpSer: 0.914 ± 0.381
0.457TrpThr: 0.457 ± 0.287
0.685TrpVal: 0.685 ± 0.303
0.0TrpTrp: 0.0 ± 0.0
0.571TrpTyr: 0.571 ± 0.238
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.056TyrAla: 2.056 ± 0.461
0.914TyrCys: 0.914 ± 0.307
0.914TyrAsp: 0.914 ± 0.33
3.427TyrGlu: 3.427 ± 0.638
2.056TyrPhe: 2.056 ± 0.5
2.513TyrGly: 2.513 ± 0.443
1.028TyrHis: 1.028 ± 0.381
2.17TyrIle: 2.17 ± 0.458
1.828TyrLys: 1.828 ± 0.436
3.084TyrLeu: 3.084 ± 0.92
1.142TyrMet: 1.142 ± 0.375
1.713TyrAsn: 1.713 ± 0.458
0.914TyrPro: 0.914 ± 0.316
0.457TyrGln: 0.457 ± 0.237
1.942TyrArg: 1.942 ± 0.431
2.513TyrSer: 2.513 ± 0.462
1.599TyrThr: 1.599 ± 0.365
2.17TyrVal: 2.17 ± 0.489
0.457TyrTrp: 0.457 ± 0.3
1.256TyrTyr: 1.256 ± 0.356
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 38 proteins (8756 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski