Amino acid dipepetide frequency for Streptococcus phage Javan191

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.567AlaAla: 4.567 ± 0.835
0.913AlaCys: 0.913 ± 0.289
4.871AlaAsp: 4.871 ± 0.754
4.567AlaGlu: 4.567 ± 0.578
3.197AlaPhe: 3.197 ± 0.404
5.328AlaGly: 5.328 ± 0.779
0.913AlaHis: 0.913 ± 0.221
6.318AlaIle: 6.318 ± 1.167
6.089AlaLys: 6.089 ± 0.635
5.1AlaLeu: 5.1 ± 0.893
1.522AlaMet: 1.522 ± 0.44
3.882AlaAsn: 3.882 ± 0.613
1.598AlaPro: 1.598 ± 0.366
1.446AlaGln: 1.446 ± 0.402
3.197AlaArg: 3.197 ± 0.447
3.882AlaSer: 3.882 ± 0.598
3.121AlaThr: 3.121 ± 0.512
5.024AlaVal: 5.024 ± 0.605
0.228AlaTrp: 0.228 ± 0.127
2.892AlaTyr: 2.892 ± 0.459
0.0AlaXaa: 0.0 ± 0.0
Cys
0.533CysAla: 0.533 ± 0.193
0.152CysCys: 0.152 ± 0.105
0.837CysAsp: 0.837 ± 0.278
0.533CysGlu: 0.533 ± 0.266
0.685CysPhe: 0.685 ± 0.202
0.913CysGly: 0.913 ± 0.282
0.076CysHis: 0.076 ± 0.068
0.304CysIle: 0.304 ± 0.142
0.685CysLys: 0.685 ± 0.261
0.609CysLeu: 0.609 ± 0.208
0.152CysMet: 0.152 ± 0.101
0.685CysAsn: 0.685 ± 0.244
0.381CysPro: 0.381 ± 0.143
0.381CysGln: 0.381 ± 0.189
0.685CysArg: 0.685 ± 0.271
0.381CysSer: 0.381 ± 0.153
0.457CysThr: 0.457 ± 0.2
0.761CysVal: 0.761 ± 0.216
0.0CysTrp: 0.0 ± 0.0
0.457CysTyr: 0.457 ± 0.202
0.0CysXaa: 0.0 ± 0.0
Asp
5.1AspAla: 5.1 ± 0.558
0.381AspCys: 0.381 ± 0.194
3.654AspAsp: 3.654 ± 0.744
6.698AspGlu: 6.698 ± 0.908
2.283AspPhe: 2.283 ± 0.349
4.491AspGly: 4.491 ± 0.973
1.142AspHis: 1.142 ± 0.294
4.415AspIle: 4.415 ± 0.641
4.795AspLys: 4.795 ± 0.641
4.415AspLeu: 4.415 ± 0.647
2.283AspMet: 2.283 ± 0.361
2.664AspAsn: 2.664 ± 0.44
1.751AspPro: 1.751 ± 0.334
0.913AspGln: 0.913 ± 0.21
3.577AspArg: 3.577 ± 0.464
3.654AspSer: 3.654 ± 0.428
3.577AspThr: 3.577 ± 0.426
4.339AspVal: 4.339 ± 0.606
1.066AspTrp: 1.066 ± 0.279
2.436AspTyr: 2.436 ± 0.521
0.0AspXaa: 0.0 ± 0.0
Glu
3.73GluAla: 3.73 ± 0.522
0.913GluCys: 0.913 ± 0.194
5.024GluAsp: 5.024 ± 0.891
7.84GluGlu: 7.84 ± 0.859
3.045GluPhe: 3.045 ± 0.517
3.882GluGly: 3.882 ± 0.402
1.218GluHis: 1.218 ± 0.311
7.459GluIle: 7.459 ± 0.818
9.819GluLys: 9.819 ± 1.123
6.698GluLeu: 6.698 ± 0.703
2.207GluMet: 2.207 ± 0.449
4.034GluAsn: 4.034 ± 0.413
1.598GluPro: 1.598 ± 0.294
2.74GluGln: 2.74 ± 0.531
4.11GluArg: 4.11 ± 0.825
4.262GluSer: 4.262 ± 0.656
4.186GluThr: 4.186 ± 0.709
5.252GluVal: 5.252 ± 0.759
0.533GluTrp: 0.533 ± 0.181
3.197GluTyr: 3.197 ± 0.498
0.0GluXaa: 0.0 ± 0.0
Phe
2.436PheAla: 2.436 ± 0.438
0.076PheCys: 0.076 ± 0.076
3.501PheAsp: 3.501 ± 0.519
3.654PheGlu: 3.654 ± 0.638
1.294PhePhe: 1.294 ± 0.263
2.74PheGly: 2.74 ± 0.447
0.457PheHis: 0.457 ± 0.155
2.664PheIle: 2.664 ± 0.539
3.349PheLys: 3.349 ± 0.483
3.577PheLeu: 3.577 ± 0.569
0.533PheMet: 0.533 ± 0.204
1.827PheAsn: 1.827 ± 0.405
1.446PhePro: 1.446 ± 0.376
0.609PheGln: 0.609 ± 0.235
1.903PheArg: 1.903 ± 0.412
3.045PheSer: 3.045 ± 0.463
1.903PheThr: 1.903 ± 0.41
2.283PheVal: 2.283 ± 0.485
0.457PheTrp: 0.457 ± 0.179
1.827PheTyr: 1.827 ± 0.469
0.0PheXaa: 0.0 ± 0.0
Gly
4.262GlyAla: 4.262 ± 0.833
0.533GlyCys: 0.533 ± 0.184
4.186GlyAsp: 4.186 ± 0.544
3.654GlyGlu: 3.654 ± 0.55
3.349GlyPhe: 3.349 ± 0.619
5.024GlyGly: 5.024 ± 0.853
1.294GlyHis: 1.294 ± 0.325
5.176GlyIle: 5.176 ± 0.556
6.698GlyLys: 6.698 ± 0.703
6.089GlyLeu: 6.089 ± 0.971
1.522GlyMet: 1.522 ± 0.291
3.501GlyAsn: 3.501 ± 0.516
0.609GlyPro: 0.609 ± 0.271
1.294GlyGln: 1.294 ± 0.377
2.664GlyArg: 2.664 ± 0.511
3.806GlySer: 3.806 ± 0.676
3.654GlyThr: 3.654 ± 0.584
3.121GlyVal: 3.121 ± 0.508
0.685GlyTrp: 0.685 ± 0.218
3.045GlyTyr: 3.045 ± 0.477
0.0GlyXaa: 0.0 ± 0.0
His
0.837HisAla: 0.837 ± 0.218
0.228HisCys: 0.228 ± 0.124
1.066HisAsp: 1.066 ± 0.272
0.685HisGlu: 0.685 ± 0.242
1.142HisPhe: 1.142 ± 0.271
1.294HisGly: 1.294 ± 0.323
0.533HisHis: 0.533 ± 0.192
1.37HisIle: 1.37 ± 0.312
0.989HisLys: 0.989 ± 0.277
1.598HisLeu: 1.598 ± 0.297
0.457HisMet: 0.457 ± 0.17
0.761HisAsn: 0.761 ± 0.222
0.761HisPro: 0.761 ± 0.226
0.228HisGln: 0.228 ± 0.133
0.913HisArg: 0.913 ± 0.249
1.066HisSer: 1.066 ± 0.354
0.761HisThr: 0.761 ± 0.275
0.609HisVal: 0.609 ± 0.193
0.152HisTrp: 0.152 ± 0.18
0.304HisTyr: 0.304 ± 0.149
0.0HisXaa: 0.0 ± 0.0
Ile
6.546IleAla: 6.546 ± 1.013
0.913IleCys: 0.913 ± 0.23
5.785IleAsp: 5.785 ± 0.47
5.709IleGlu: 5.709 ± 0.641
2.892IlePhe: 2.892 ± 0.486
3.806IleGly: 3.806 ± 0.491
1.522IleHis: 1.522 ± 0.314
5.556IleIle: 5.556 ± 0.612
5.861IleLys: 5.861 ± 1.381
5.328IleLeu: 5.328 ± 0.712
2.436IleMet: 2.436 ± 0.442
2.968IleAsn: 2.968 ± 0.629
3.577IlePro: 3.577 ± 0.576
1.827IleGln: 1.827 ± 0.444
3.197IleArg: 3.197 ± 0.498
5.633IleSer: 5.633 ± 0.781
5.024IleThr: 5.024 ± 0.781
3.273IleVal: 3.273 ± 0.507
1.294IleTrp: 1.294 ± 0.459
3.349IleTyr: 3.349 ± 0.534
0.0IleXaa: 0.0 ± 0.0
Lys
7.079LysAla: 7.079 ± 0.709
0.761LysCys: 0.761 ± 0.233
5.024LysAsp: 5.024 ± 0.637
7.612LysGlu: 7.612 ± 0.679
3.501LysPhe: 3.501 ± 0.404
4.186LysGly: 4.186 ± 0.685
1.142LysHis: 1.142 ± 0.315
6.546LysIle: 6.546 ± 0.653
6.546LysLys: 6.546 ± 0.78
6.241LysLeu: 6.241 ± 0.573
2.512LysMet: 2.512 ± 0.425
5.1LysAsn: 5.1 ± 0.897
3.197LysPro: 3.197 ± 0.517
2.968LysGln: 2.968 ± 0.412
4.491LysArg: 4.491 ± 0.919
5.176LysSer: 5.176 ± 0.747
3.654LysThr: 3.654 ± 0.562
5.48LysVal: 5.48 ± 0.655
0.913LysTrp: 0.913 ± 0.234
3.654LysTyr: 3.654 ± 0.592
0.0LysXaa: 0.0 ± 0.0
Leu
4.186LeuAla: 4.186 ± 0.571
0.913LeuCys: 0.913 ± 0.236
6.013LeuAsp: 6.013 ± 0.61
8.677LeuGlu: 8.677 ± 0.903
2.512LeuPhe: 2.512 ± 0.453
6.241LeuGly: 6.241 ± 0.886
1.142LeuHis: 1.142 ± 0.347
5.328LeuIle: 5.328 ± 0.572
6.926LeuLys: 6.926 ± 0.781
7.612LeuLeu: 7.612 ± 0.934
2.892LeuMet: 2.892 ± 0.666
2.816LeuAsn: 2.816 ± 0.457
2.74LeuPro: 2.74 ± 0.456
2.74LeuGln: 2.74 ± 0.627
2.512LeuArg: 2.512 ± 0.47
7.155LeuSer: 7.155 ± 0.749
3.882LeuThr: 3.882 ± 0.54
3.958LeuVal: 3.958 ± 0.465
0.685LeuTrp: 0.685 ± 0.19
3.045LeuTyr: 3.045 ± 0.666
0.0LeuXaa: 0.0 ± 0.0
Met
2.207MetAla: 2.207 ± 0.369
0.228MetCys: 0.228 ± 0.141
2.283MetAsp: 2.283 ± 0.455
2.36MetGlu: 2.36 ± 0.364
0.761MetPhe: 0.761 ± 0.238
1.37MetGly: 1.37 ± 0.296
0.228MetHis: 0.228 ± 0.132
1.446MetIle: 1.446 ± 0.422
2.36MetLys: 2.36 ± 0.389
1.675MetLeu: 1.675 ± 0.337
0.989MetMet: 0.989 ± 0.287
1.522MetAsn: 1.522 ± 0.366
1.066MetPro: 1.066 ± 0.272
0.837MetGln: 0.837 ± 0.21
1.37MetArg: 1.37 ± 0.312
2.055MetSer: 2.055 ± 0.334
1.751MetThr: 1.751 ± 0.378
1.979MetVal: 1.979 ± 0.323
0.304MetTrp: 0.304 ± 0.139
0.533MetTyr: 0.533 ± 0.2
0.0MetXaa: 0.0 ± 0.0
Asn
3.882AsnAla: 3.882 ± 0.851
0.533AsnCys: 0.533 ± 0.195
1.675AsnAsp: 1.675 ± 0.338
3.654AsnGlu: 3.654 ± 0.527
1.751AsnPhe: 1.751 ± 0.372
4.947AsnGly: 4.947 ± 0.816
1.066AsnHis: 1.066 ± 0.327
3.577AsnIle: 3.577 ± 0.623
3.121AsnLys: 3.121 ± 0.415
3.958AsnLeu: 3.958 ± 0.549
1.066AsnMet: 1.066 ± 0.247
1.598AsnAsn: 1.598 ± 0.244
2.055AsnPro: 2.055 ± 0.405
1.751AsnGln: 1.751 ± 0.317
2.816AsnArg: 2.816 ± 0.424
3.045AsnSer: 3.045 ± 0.563
2.664AsnThr: 2.664 ± 0.517
2.512AsnVal: 2.512 ± 0.506
0.761AsnTrp: 0.761 ± 0.252
1.979AsnTyr: 1.979 ± 0.604
0.0AsnXaa: 0.0 ± 0.0
Pro
1.598ProAla: 1.598 ± 0.359
0.381ProCys: 0.381 ± 0.139
1.446ProAsp: 1.446 ± 0.338
3.349ProGlu: 3.349 ± 0.562
1.522ProPhe: 1.522 ± 0.301
1.675ProGly: 1.675 ± 0.323
0.685ProHis: 0.685 ± 0.217
2.283ProIle: 2.283 ± 0.451
1.979ProLys: 1.979 ± 0.378
2.588ProLeu: 2.588 ± 0.473
1.218ProMet: 1.218 ± 0.233
1.066ProAsn: 1.066 ± 0.327
1.218ProPro: 1.218 ± 0.292
0.685ProGln: 0.685 ± 0.181
1.522ProArg: 1.522 ± 0.367
1.979ProSer: 1.979 ± 0.327
1.903ProThr: 1.903 ± 0.423
2.36ProVal: 2.36 ± 0.407
0.457ProTrp: 0.457 ± 0.182
1.522ProTyr: 1.522 ± 0.347
0.0ProXaa: 0.0 ± 0.0
Gln
2.664GlnAla: 2.664 ± 0.361
0.152GlnCys: 0.152 ± 0.1
1.218GlnAsp: 1.218 ± 0.379
1.675GlnGlu: 1.675 ± 0.487
0.913GlnPhe: 0.913 ± 0.205
1.37GlnGly: 1.37 ± 0.294
0.152GlnHis: 0.152 ± 0.115
2.436GlnIle: 2.436 ± 0.473
3.806GlnLys: 3.806 ± 0.499
1.979GlnLeu: 1.979 ± 0.437
0.685GlnMet: 0.685 ± 0.191
1.979GlnAsn: 1.979 ± 0.367
1.218GlnPro: 1.218 ± 0.288
1.37GlnGln: 1.37 ± 0.325
0.989GlnArg: 0.989 ± 0.318
2.055GlnSer: 2.055 ± 0.372
0.989GlnThr: 0.989 ± 0.27
1.294GlnVal: 1.294 ± 0.326
0.228GlnTrp: 0.228 ± 0.142
1.142GlnTyr: 1.142 ± 0.291
0.0GlnXaa: 0.0 ± 0.0
Arg
2.283ArgAla: 2.283 ± 0.492
1.142ArgCys: 1.142 ± 0.295
2.055ArgAsp: 2.055 ± 0.382
3.045ArgGlu: 3.045 ± 0.56
1.598ArgPhe: 1.598 ± 0.286
2.283ArgGly: 2.283 ± 0.428
0.609ArgHis: 0.609 ± 0.237
3.73ArgIle: 3.73 ± 0.54
4.415ArgLys: 4.415 ± 0.756
4.339ArgLeu: 4.339 ± 0.694
1.37ArgMet: 1.37 ± 0.313
2.588ArgAsn: 2.588 ± 0.521
0.989ArgPro: 0.989 ± 0.318
1.522ArgGln: 1.522 ± 0.475
2.588ArgArg: 2.588 ± 0.61
2.436ArgSer: 2.436 ± 0.493
2.588ArgThr: 2.588 ± 0.45
3.577ArgVal: 3.577 ± 0.534
1.066ArgTrp: 1.066 ± 0.364
1.522ArgTyr: 1.522 ± 0.341
0.0ArgXaa: 0.0 ± 0.0
Ser
5.1SerAla: 5.1 ± 0.583
0.152SerCys: 0.152 ± 0.083
4.567SerAsp: 4.567 ± 0.519
5.252SerGlu: 5.252 ± 0.596
2.436SerPhe: 2.436 ± 0.508
3.958SerGly: 3.958 ± 0.606
0.913SerHis: 0.913 ± 0.31
5.252SerIle: 5.252 ± 0.751
5.176SerLys: 5.176 ± 0.585
5.937SerLeu: 5.937 ± 0.801
1.827SerMet: 1.827 ± 0.558
3.273SerAsn: 3.273 ± 0.424
1.903SerPro: 1.903 ± 0.318
2.055SerGln: 2.055 ± 0.56
2.588SerArg: 2.588 ± 0.504
6.013SerSer: 6.013 ± 1.159
3.197SerThr: 3.197 ± 0.641
4.719SerVal: 4.719 ± 0.693
0.761SerTrp: 0.761 ± 0.276
2.436SerTyr: 2.436 ± 0.468
0.0SerXaa: 0.0 ± 0.0
Thr
4.491ThrAla: 4.491 ± 0.775
0.0ThrCys: 0.0 ± 0.0
2.968ThrAsp: 2.968 ± 0.433
4.339ThrGlu: 4.339 ± 0.444
1.903ThrPhe: 1.903 ± 0.366
3.349ThrGly: 3.349 ± 0.51
1.066ThrHis: 1.066 ± 0.293
3.882ThrIle: 3.882 ± 0.674
4.11ThrLys: 4.11 ± 0.586
4.415ThrLeu: 4.415 ± 0.583
1.218ThrMet: 1.218 ± 0.254
2.283ThrAsn: 2.283 ± 0.323
1.751ThrPro: 1.751 ± 0.392
1.294ThrGln: 1.294 ± 0.355
1.294ThrArg: 1.294 ± 0.261
3.577ThrSer: 3.577 ± 0.628
3.197ThrThr: 3.197 ± 0.601
4.339ThrVal: 4.339 ± 1.022
0.533ThrTrp: 0.533 ± 0.202
1.598ThrTyr: 1.598 ± 0.314
0.0ThrXaa: 0.0 ± 0.0
Val
3.197ValAla: 3.197 ± 0.399
0.913ValCys: 0.913 ± 0.266
4.339ValAsp: 4.339 ± 0.546
3.806ValGlu: 3.806 ± 0.564
2.664ValPhe: 2.664 ± 0.512
3.425ValGly: 3.425 ± 0.817
0.837ValHis: 0.837 ± 0.28
5.1ValIle: 5.1 ± 1.083
5.709ValLys: 5.709 ± 0.75
5.328ValLeu: 5.328 ± 0.492
0.989ValMet: 0.989 ± 0.287
3.425ValAsn: 3.425 ± 0.544
2.131ValPro: 2.131 ± 0.374
1.751ValGln: 1.751 ± 0.356
2.892ValArg: 2.892 ± 0.495
4.339ValSer: 4.339 ± 0.578
2.968ValThr: 2.968 ± 0.57
4.186ValVal: 4.186 ± 0.578
0.685ValTrp: 0.685 ± 0.251
2.664ValTyr: 2.664 ± 0.469
0.0ValXaa: 0.0 ± 0.0
Trp
0.457TrpAla: 0.457 ± 0.179
0.228TrpCys: 0.228 ± 0.155
0.533TrpAsp: 0.533 ± 0.213
1.675TrpGlu: 1.675 ± 0.364
0.381TrpPhe: 0.381 ± 0.194
0.837TrpGly: 0.837 ± 0.246
0.304TrpHis: 0.304 ± 0.136
0.228TrpIle: 0.228 ± 0.126
0.761TrpLys: 0.761 ± 0.281
0.989TrpLeu: 0.989 ± 0.235
0.533TrpMet: 0.533 ± 0.218
0.761TrpAsn: 0.761 ± 0.317
0.228TrpPro: 0.228 ± 0.112
0.533TrpGln: 0.533 ± 0.18
0.457TrpArg: 0.457 ± 0.186
0.761TrpSer: 0.761 ± 0.241
0.381TrpThr: 0.381 ± 0.182
0.685TrpVal: 0.685 ± 0.283
0.076TrpTrp: 0.076 ± 0.09
0.381TrpTyr: 0.381 ± 0.154
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.501TyrAla: 3.501 ± 0.665
0.152TyrCys: 0.152 ± 0.104
2.588TyrAsp: 2.588 ± 0.547
2.968TyrGlu: 2.968 ± 0.457
1.751TyrPhe: 1.751 ± 0.386
3.045TyrGly: 3.045 ± 0.671
0.457TyrHis: 0.457 ± 0.181
3.425TyrIle: 3.425 ± 0.43
2.588TyrLys: 2.588 ± 0.444
3.501TyrLeu: 3.501 ± 0.661
0.837TyrMet: 0.837 ± 0.343
1.675TyrAsn: 1.675 ± 0.475
1.218TyrPro: 1.218 ± 0.324
1.37TyrGln: 1.37 ± 0.298
1.979TyrArg: 1.979 ± 0.477
3.197TyrSer: 3.197 ± 0.627
1.751TyrThr: 1.751 ± 0.389
1.675TyrVal: 1.675 ± 0.373
0.304TyrTrp: 0.304 ± 0.13
1.675TyrTyr: 1.675 ± 0.495
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 55 proteins (13139 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski