Amino acid dipepetide frequency for Enterococcus phage Ec-ZZ2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.759AlaAla: 0.759 ± 0.337
0.337AlaCys: 0.337 ± 0.188
2.698AlaAsp: 2.698 ± 0.526
4.047AlaGlu: 4.047 ± 0.478
2.53AlaPhe: 2.53 ± 0.433
3.12AlaGly: 3.12 ± 0.575
0.927AlaHis: 0.927 ± 0.283
4.89AlaIle: 4.89 ± 0.855
5.818AlaLys: 5.818 ± 0.858
5.396AlaLeu: 5.396 ± 0.729
3.035AlaMet: 3.035 ± 0.612
4.132AlaAsn: 4.132 ± 0.603
1.855AlaPro: 1.855 ± 0.404
1.18AlaGln: 1.18 ± 0.287
1.433AlaArg: 1.433 ± 0.314
3.457AlaSer: 3.457 ± 0.556
4.553AlaThr: 4.553 ± 0.715
4.553AlaVal: 4.553 ± 0.698
0.506AlaTrp: 0.506 ± 0.208
2.53AlaTyr: 2.53 ± 0.437
0.0AlaXaa: 0.0 ± 0.0
Cys
0.253CysAla: 0.253 ± 0.136
0.0CysCys: 0.0 ± 0.0
0.422CysAsp: 0.422 ± 0.195
0.59CysGlu: 0.59 ± 0.236
0.084CysPhe: 0.084 ± 0.098
0.337CysGly: 0.337 ± 0.187
0.169CysHis: 0.169 ± 0.126
0.253CysIle: 0.253 ± 0.15
0.422CysLys: 0.422 ± 0.216
0.59CysLeu: 0.59 ± 0.265
0.169CysMet: 0.169 ± 0.142
0.759CysAsn: 0.759 ± 0.307
0.0CysPro: 0.0 ± 0.0
0.084CysGln: 0.084 ± 0.085
0.253CysArg: 0.253 ± 0.132
0.337CysSer: 0.337 ± 0.176
0.422CysThr: 0.422 ± 0.211
0.337CysVal: 0.337 ± 0.163
0.169CysTrp: 0.169 ± 0.12
0.169CysTyr: 0.169 ± 0.119
0.0CysXaa: 0.0 ± 0.0
Asp
3.373AspAla: 3.373 ± 0.572
0.337AspCys: 0.337 ± 0.174
2.192AspAsp: 2.192 ± 0.51
3.963AspGlu: 3.963 ± 0.559
2.951AspPhe: 2.951 ± 0.581
4.89AspGly: 4.89 ± 0.524
0.675AspHis: 0.675 ± 0.271
4.469AspIle: 4.469 ± 0.768
6.914AspLys: 6.914 ± 0.741
4.806AspLeu: 4.806 ± 0.61
1.686AspMet: 1.686 ± 0.39
4.722AspAsn: 4.722 ± 0.701
2.024AspPro: 2.024 ± 0.429
1.265AspGln: 1.265 ± 0.218
2.108AspArg: 2.108 ± 0.411
2.867AspSer: 2.867 ± 0.514
3.794AspThr: 3.794 ± 0.704
4.806AspVal: 4.806 ± 0.597
0.506AspTrp: 0.506 ± 0.245
3.035AspTyr: 3.035 ± 0.581
0.0AspXaa: 0.0 ± 0.0
Glu
4.384GluAla: 4.384 ± 0.644
0.506GluCys: 0.506 ± 0.236
4.975GluAsp: 4.975 ± 0.864
7.589GluGlu: 7.589 ± 1.684
2.782GluPhe: 2.782 ± 0.56
4.132GluGly: 4.132 ± 0.494
1.686GluHis: 1.686 ± 0.329
3.794GluIle: 3.794 ± 0.628
6.324GluLys: 6.324 ± 0.651
8.769GluLeu: 8.769 ± 1.015
2.361GluMet: 2.361 ± 0.545
3.963GluAsn: 3.963 ± 0.662
2.53GluPro: 2.53 ± 0.565
3.12GluGln: 3.12 ± 0.606
2.951GluArg: 2.951 ± 0.5
4.132GluSer: 4.132 ± 0.58
4.637GluThr: 4.637 ± 0.532
7.673GluVal: 7.673 ± 0.783
1.433GluTrp: 1.433 ± 0.29
3.288GluTyr: 3.288 ± 0.703
0.0GluXaa: 0.0 ± 0.0
Phe
2.024PheAla: 2.024 ± 0.316
0.253PheCys: 0.253 ± 0.148
3.12PheAsp: 3.12 ± 0.582
3.204PheGlu: 3.204 ± 0.739
0.927PhePhe: 0.927 ± 0.307
2.867PheGly: 2.867 ± 0.444
0.253PheHis: 0.253 ± 0.152
3.879PheIle: 3.879 ± 0.658
3.963PheLys: 3.963 ± 0.547
2.445PheLeu: 2.445 ± 0.456
1.265PheMet: 1.265 ± 0.374
2.951PheAsn: 2.951 ± 0.439
0.759PhePro: 0.759 ± 0.237
1.602PheGln: 1.602 ± 0.394
1.771PheArg: 1.771 ± 0.344
2.192PheSer: 2.192 ± 0.41
3.541PheThr: 3.541 ± 0.664
2.614PheVal: 2.614 ± 0.449
0.422PheTrp: 0.422 ± 0.194
1.096PheTyr: 1.096 ± 0.262
0.0PheXaa: 0.0 ± 0.0
Gly
4.384GlyAla: 4.384 ± 1.283
0.337GlyCys: 0.337 ± 0.214
3.541GlyAsp: 3.541 ± 0.72
3.12GlyGlu: 3.12 ± 0.463
3.457GlyPhe: 3.457 ± 0.392
4.384GlyGly: 4.384 ± 1.114
0.843GlyHis: 0.843 ± 0.275
5.396GlyIle: 5.396 ± 0.903
6.155GlyLys: 6.155 ± 0.775
5.818GlyLeu: 5.818 ± 0.757
1.686GlyMet: 1.686 ± 0.385
3.457GlyAsn: 3.457 ± 0.495
0.927GlyPro: 0.927 ± 0.291
2.024GlyGln: 2.024 ± 0.406
2.277GlyArg: 2.277 ± 0.351
3.794GlySer: 3.794 ± 0.687
4.3GlyThr: 4.3 ± 0.745
4.132GlyVal: 4.132 ± 0.646
1.18GlyTrp: 1.18 ± 0.286
2.782GlyTyr: 2.782 ± 0.55
0.0GlyXaa: 0.0 ± 0.0
His
0.843HisAla: 0.843 ± 0.296
0.337HisCys: 0.337 ± 0.15
0.843HisAsp: 0.843 ± 0.247
0.927HisGlu: 0.927 ± 0.322
0.759HisPhe: 0.759 ± 0.265
0.759HisGly: 0.759 ± 0.263
0.253HisHis: 0.253 ± 0.139
0.675HisIle: 0.675 ± 0.215
1.602HisLys: 1.602 ± 0.334
0.927HisLeu: 0.927 ± 0.28
0.337HisMet: 0.337 ± 0.178
1.349HisAsn: 1.349 ± 0.327
0.169HisPro: 0.169 ± 0.124
0.506HisGln: 0.506 ± 0.177
1.012HisArg: 1.012 ± 0.333
0.422HisSer: 0.422 ± 0.223
0.927HisThr: 0.927 ± 0.38
1.096HisVal: 1.096 ± 0.329
0.084HisTrp: 0.084 ± 0.084
0.843HisTyr: 0.843 ± 0.261
0.0HisXaa: 0.0 ± 0.0
Ile
4.3IleAla: 4.3 ± 0.589
0.59IleCys: 0.59 ± 0.276
5.481IleAsp: 5.481 ± 0.629
6.83IleGlu: 6.83 ± 0.94
2.108IlePhe: 2.108 ± 0.491
4.384IleGly: 4.384 ± 0.768
0.843IleHis: 0.843 ± 0.281
3.963IleIle: 3.963 ± 0.552
6.324IleLys: 6.324 ± 0.729
5.649IleLeu: 5.649 ± 0.797
1.602IleMet: 1.602 ± 0.387
5.059IleAsn: 5.059 ± 0.77
2.445IlePro: 2.445 ± 0.438
3.204IleGln: 3.204 ± 0.471
2.108IleArg: 2.108 ± 0.469
3.541IleSer: 3.541 ± 0.458
3.879IleThr: 3.879 ± 0.523
4.3IleVal: 4.3 ± 0.634
0.422IleTrp: 0.422 ± 0.191
1.686IleTyr: 1.686 ± 0.392
0.0IleXaa: 0.0 ± 0.0
Lys
6.998LysAla: 6.998 ± 0.93
0.169LysCys: 0.169 ± 0.105
5.059LysAsp: 5.059 ± 0.757
8.432LysGlu: 8.432 ± 0.895
3.879LysPhe: 3.879 ± 0.469
4.806LysGly: 4.806 ± 0.766
1.096LysHis: 1.096 ± 0.354
5.481LysIle: 5.481 ± 0.717
5.481LysLys: 5.481 ± 0.97
7.083LysLeu: 7.083 ± 0.958
3.204LysMet: 3.204 ± 0.624
5.987LysAsn: 5.987 ± 0.855
3.12LysPro: 3.12 ± 0.516
4.384LysGln: 4.384 ± 0.58
3.963LysArg: 3.963 ± 0.542
3.963LysSer: 3.963 ± 0.728
5.059LysThr: 5.059 ± 0.511
5.987LysVal: 5.987 ± 0.713
1.096LysTrp: 1.096 ± 0.303
3.204LysTyr: 3.204 ± 0.483
0.0LysXaa: 0.0 ± 0.0
Leu
3.963LeuAla: 3.963 ± 0.691
0.337LeuCys: 0.337 ± 0.224
6.155LeuAsp: 6.155 ± 0.769
8.685LeuGlu: 8.685 ± 0.951
3.457LeuPhe: 3.457 ± 0.548
5.312LeuGly: 5.312 ± 0.717
0.759LeuHis: 0.759 ± 0.232
4.806LeuIle: 4.806 ± 0.745
6.324LeuLys: 6.324 ± 0.966
6.155LeuLeu: 6.155 ± 0.831
1.602LeuMet: 1.602 ± 0.336
5.734LeuAsn: 5.734 ± 0.597
2.951LeuPro: 2.951 ± 0.572
3.963LeuGln: 3.963 ± 0.526
2.698LeuArg: 2.698 ± 0.473
4.975LeuSer: 4.975 ± 0.678
4.637LeuThr: 4.637 ± 0.475
5.396LeuVal: 5.396 ± 0.879
1.096LeuTrp: 1.096 ± 0.327
2.698LeuTyr: 2.698 ± 0.552
0.0LeuXaa: 0.0 ± 0.0
Met
1.686MetAla: 1.686 ± 0.418
0.253MetCys: 0.253 ± 0.149
1.602MetAsp: 1.602 ± 0.403
2.698MetGlu: 2.698 ± 0.524
1.265MetPhe: 1.265 ± 0.4
1.686MetGly: 1.686 ± 0.335
0.253MetHis: 0.253 ± 0.144
1.602MetIle: 1.602 ± 0.321
2.698MetLys: 2.698 ± 0.402
2.277MetLeu: 2.277 ± 0.398
0.675MetMet: 0.675 ± 0.284
2.024MetAsn: 2.024 ± 0.455
0.843MetPro: 0.843 ± 0.292
1.349MetGln: 1.349 ± 0.329
1.518MetArg: 1.518 ± 0.362
1.518MetSer: 1.518 ± 0.349
1.686MetThr: 1.686 ± 0.395
1.855MetVal: 1.855 ± 0.561
0.759MetTrp: 0.759 ± 0.265
1.18MetTyr: 1.18 ± 0.3
0.0MetXaa: 0.0 ± 0.0
Asn
4.3AsnAla: 4.3 ± 0.794
0.169AsnCys: 0.169 ± 0.133
3.288AsnAsp: 3.288 ± 0.578
5.818AsnGlu: 5.818 ± 0.796
1.855AsnPhe: 1.855 ± 0.449
6.83AsnGly: 6.83 ± 0.798
1.096AsnHis: 1.096 ± 0.34
4.047AsnIle: 4.047 ± 0.797
6.155AsnLys: 6.155 ± 0.848
4.637AsnLeu: 4.637 ± 0.549
2.192AsnMet: 2.192 ± 0.398
3.963AsnAsn: 3.963 ± 0.494
1.855AsnPro: 1.855 ± 0.379
1.518AsnGln: 1.518 ± 0.365
2.108AsnArg: 2.108 ± 0.372
3.541AsnSer: 3.541 ± 0.486
5.059AsnThr: 5.059 ± 0.735
3.541AsnVal: 3.541 ± 0.56
0.843AsnTrp: 0.843 ± 0.238
2.867AsnTyr: 2.867 ± 0.507
0.0AsnXaa: 0.0 ± 0.0
Pro
1.518ProAla: 1.518 ± 0.39
0.084ProCys: 0.084 ± 0.094
2.361ProAsp: 2.361 ± 0.506
2.867ProGlu: 2.867 ± 0.498
1.265ProPhe: 1.265 ± 0.271
0.253ProGly: 0.253 ± 0.11
0.337ProHis: 0.337 ± 0.168
1.855ProIle: 1.855 ± 0.463
2.614ProLys: 2.614 ± 0.592
3.035ProLeu: 3.035 ± 0.558
1.18ProMet: 1.18 ± 0.262
1.939ProAsn: 1.939 ± 0.411
0.506ProPro: 0.506 ± 0.206
1.18ProGln: 1.18 ± 0.36
0.59ProArg: 0.59 ± 0.19
1.433ProSer: 1.433 ± 0.388
2.108ProThr: 2.108 ± 0.563
1.939ProVal: 1.939 ± 0.442
0.253ProTrp: 0.253 ± 0.154
1.855ProTyr: 1.855 ± 0.486
0.0ProXaa: 0.0 ± 0.0
Gln
1.939GlnAla: 1.939 ± 0.377
0.506GlnCys: 0.506 ± 0.255
2.108GlnAsp: 2.108 ± 0.316
2.108GlnGlu: 2.108 ± 0.383
1.686GlnPhe: 1.686 ± 0.398
1.939GlnGly: 1.939 ± 0.448
0.675GlnHis: 0.675 ± 0.227
3.035GlnIle: 3.035 ± 0.662
2.445GlnLys: 2.445 ± 0.506
2.867GlnLeu: 2.867 ± 0.429
0.927GlnMet: 0.927 ± 0.317
1.686GlnAsn: 1.686 ± 0.328
1.433GlnPro: 1.433 ± 0.291
1.855GlnGln: 1.855 ± 0.341
1.855GlnArg: 1.855 ± 0.472
2.192GlnSer: 2.192 ± 0.423
1.939GlnThr: 1.939 ± 0.298
2.445GlnVal: 2.445 ± 0.449
0.506GlnTrp: 0.506 ± 0.259
2.53GlnTyr: 2.53 ± 0.415
0.0GlnXaa: 0.0 ± 0.0
Arg
1.602ArgAla: 1.602 ± 0.36
0.422ArgCys: 0.422 ± 0.178
2.53ArgAsp: 2.53 ± 0.529
1.433ArgGlu: 1.433 ± 0.383
1.602ArgPhe: 1.602 ± 0.359
2.108ArgGly: 2.108 ± 0.457
0.843ArgHis: 0.843 ± 0.328
2.108ArgIle: 2.108 ± 0.486
3.373ArgLys: 3.373 ± 0.597
3.204ArgLeu: 3.204 ± 0.608
1.012ArgMet: 1.012 ± 0.264
2.361ArgAsn: 2.361 ± 0.475
1.18ArgPro: 1.18 ± 0.29
1.602ArgGln: 1.602 ± 0.49
1.518ArgArg: 1.518 ± 0.364
1.686ArgSer: 1.686 ± 0.408
1.771ArgThr: 1.771 ± 0.438
2.614ArgVal: 2.614 ± 0.461
0.337ArgTrp: 0.337 ± 0.192
1.518ArgTyr: 1.518 ± 0.4
0.0ArgXaa: 0.0 ± 0.0
Ser
3.035SerAla: 3.035 ± 0.616
0.0SerCys: 0.0 ± 0.0
2.782SerAsp: 2.782 ± 0.406
4.132SerGlu: 4.132 ± 0.421
2.192SerPhe: 2.192 ± 0.341
4.89SerGly: 4.89 ± 0.799
1.433SerHis: 1.433 ± 0.334
3.963SerIle: 3.963 ± 0.571
5.143SerLys: 5.143 ± 0.824
3.541SerLeu: 3.541 ± 0.639
1.939SerMet: 1.939 ± 0.368
3.288SerAsn: 3.288 ± 0.519
1.012SerPro: 1.012 ± 0.291
2.53SerGln: 2.53 ± 0.579
1.096SerArg: 1.096 ± 0.339
2.698SerSer: 2.698 ± 0.496
3.71SerThr: 3.71 ± 0.708
2.951SerVal: 2.951 ± 0.623
0.759SerTrp: 0.759 ± 0.262
2.614SerTyr: 2.614 ± 0.645
0.0SerXaa: 0.0 ± 0.0
Thr
3.794ThrAla: 3.794 ± 0.59
0.084ThrCys: 0.084 ± 0.079
3.626ThrAsp: 3.626 ± 0.764
4.384ThrGlu: 4.384 ± 0.566
2.361ThrPhe: 2.361 ± 0.457
4.469ThrGly: 4.469 ± 0.687
1.096ThrHis: 1.096 ± 0.271
5.228ThrIle: 5.228 ± 0.734
6.914ThrLys: 6.914 ± 0.704
5.481ThrLeu: 5.481 ± 0.897
1.855ThrMet: 1.855 ± 0.438
3.457ThrAsn: 3.457 ± 0.574
2.445ThrPro: 2.445 ± 0.387
2.445ThrGln: 2.445 ± 0.468
1.939ThrArg: 1.939 ± 0.385
2.53ThrSer: 2.53 ± 0.409
4.89ThrThr: 4.89 ± 1.049
3.794ThrVal: 3.794 ± 0.434
0.59ThrTrp: 0.59 ± 0.267
2.782ThrTyr: 2.782 ± 0.521
0.0ThrXaa: 0.0 ± 0.0
Val
5.481ValAla: 5.481 ± 0.694
0.169ValCys: 0.169 ± 0.117
4.806ValAsp: 4.806 ± 0.683
5.143ValGlu: 5.143 ± 0.743
3.373ValPhe: 3.373 ± 0.433
4.722ValGly: 4.722 ± 0.775
0.759ValHis: 0.759 ± 0.3
4.722ValIle: 4.722 ± 0.726
5.059ValLys: 5.059 ± 0.906
4.553ValLeu: 4.553 ± 0.569
1.518ValMet: 1.518 ± 0.34
4.637ValAsn: 4.637 ± 0.665
2.024ValPro: 2.024 ± 0.375
1.855ValGln: 1.855 ± 0.521
2.024ValArg: 2.024 ± 0.404
5.987ValSer: 5.987 ± 0.941
3.879ValThr: 3.879 ± 0.723
4.3ValVal: 4.3 ± 0.61
0.759ValTrp: 0.759 ± 0.319
2.361ValTyr: 2.361 ± 0.518
0.0ValXaa: 0.0 ± 0.0
Trp
0.59TrpAla: 0.59 ± 0.211
0.169TrpCys: 0.169 ± 0.126
0.759TrpAsp: 0.759 ± 0.28
1.518TrpGlu: 1.518 ± 0.332
0.927TrpPhe: 0.927 ± 0.259
0.843TrpGly: 0.843 ± 0.313
0.084TrpHis: 0.084 ± 0.072
0.759TrpIle: 0.759 ± 0.326
0.506TrpLys: 0.506 ± 0.215
1.012TrpLeu: 1.012 ± 0.285
0.084TrpMet: 0.084 ± 0.083
0.843TrpAsn: 0.843 ± 0.272
0.0TrpPro: 0.0 ± 0.0
0.253TrpGln: 0.253 ± 0.117
0.506TrpArg: 0.506 ± 0.223
0.59TrpSer: 0.59 ± 0.208
0.675TrpThr: 0.675 ± 0.242
1.18TrpVal: 1.18 ± 0.29
0.337TrpTrp: 0.337 ± 0.192
0.506TrpTyr: 0.506 ± 0.219
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.53TyrAla: 2.53 ± 0.444
0.759TyrCys: 0.759 ± 0.317
3.035TyrAsp: 3.035 ± 0.557
3.541TyrGlu: 3.541 ± 0.555
1.686TyrPhe: 1.686 ± 0.31
1.265TyrGly: 1.265 ± 0.319
0.59TyrHis: 0.59 ± 0.225
3.879TyrIle: 3.879 ± 0.721
4.216TyrLys: 4.216 ± 0.72
3.457TyrLeu: 3.457 ± 0.638
1.012TyrMet: 1.012 ± 0.248
3.373TyrAsn: 3.373 ± 0.525
1.096TyrPro: 1.096 ± 0.328
0.759TyrGln: 0.759 ± 0.29
1.096TyrArg: 1.096 ± 0.352
1.855TyrSer: 1.855 ± 0.451
2.698TyrThr: 2.698 ± 0.622
2.614TyrVal: 2.614 ± 0.432
0.084TyrTrp: 0.084 ± 0.072
1.855TyrTyr: 1.855 ± 0.484
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 59 proteins (11861 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski