Amino acid dipepetide frequency for Streptococcus phage IPP44

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.64AlaAla: 2.64 ± 0.911
0.634AlaCys: 0.634 ± 0.259
4.329AlaAsp: 4.329 ± 0.597
5.597AlaGlu: 5.597 ± 0.863
2.006AlaPhe: 2.006 ± 0.406
3.273AlaGly: 3.273 ± 0.938
0.95AlaHis: 0.95 ± 0.263
5.808AlaIle: 5.808 ± 1.383
5.597AlaLys: 5.597 ± 0.59
6.441AlaLeu: 6.441 ± 0.794
2.218AlaMet: 2.218 ± 0.481
3.59AlaAsn: 3.59 ± 0.582
1.162AlaPro: 1.162 ± 0.383
2.957AlaGln: 2.957 ± 0.65
2.851AlaArg: 2.851 ± 0.532
3.907AlaSer: 3.907 ± 0.486
3.907AlaThr: 3.907 ± 0.689
2.851AlaVal: 2.851 ± 0.753
0.95AlaTrp: 0.95 ± 0.52
2.534AlaTyr: 2.534 ± 0.454
0.0AlaXaa: 0.0 ± 0.0
Cys
0.106CysAla: 0.106 ± 0.105
0.0CysCys: 0.0 ± 0.0
0.211CysAsp: 0.211 ± 0.132
1.056CysGlu: 1.056 ± 0.353
0.211CysPhe: 0.211 ± 0.156
0.317CysGly: 0.317 ± 0.178
0.422CysHis: 0.422 ± 0.201
0.317CysIle: 0.317 ± 0.214
0.106CysLys: 0.106 ± 0.124
0.845CysLeu: 0.845 ± 0.334
0.0CysMet: 0.0 ± 0.0
0.211CysAsn: 0.211 ± 0.16
0.0CysPro: 0.0 ± 0.0
0.106CysGln: 0.106 ± 0.12
0.422CysArg: 0.422 ± 0.263
0.528CysSer: 0.528 ± 0.304
0.211CysThr: 0.211 ± 0.166
0.317CysVal: 0.317 ± 0.18
0.0CysTrp: 0.0 ± 0.0
0.422CysTyr: 0.422 ± 0.204
0.0CysXaa: 0.0 ± 0.0
Asp
3.696AspAla: 3.696 ± 0.925
0.211AspCys: 0.211 ± 0.159
3.379AspAsp: 3.379 ± 0.644
4.118AspGlu: 4.118 ± 0.769
2.957AspPhe: 2.957 ± 0.673
4.329AspGly: 4.329 ± 0.806
1.162AspHis: 1.162 ± 0.322
4.224AspIle: 4.224 ± 0.737
5.808AspLys: 5.808 ± 0.959
3.696AspLeu: 3.696 ± 0.639
1.69AspMet: 1.69 ± 0.44
2.64AspAsn: 2.64 ± 0.488
2.112AspPro: 2.112 ± 0.446
0.739AspGln: 0.739 ± 0.208
2.112AspArg: 2.112 ± 0.503
3.168AspSer: 3.168 ± 0.511
3.485AspThr: 3.485 ± 0.559
3.907AspVal: 3.907 ± 0.589
1.267AspTrp: 1.267 ± 0.326
3.59AspTyr: 3.59 ± 0.744
0.0AspXaa: 0.0 ± 0.0
Glu
4.963GluAla: 4.963 ± 0.991
0.528GluCys: 0.528 ± 0.27
3.696GluAsp: 3.696 ± 0.67
8.976GluGlu: 8.976 ± 1.293
3.062GluPhe: 3.062 ± 0.468
3.062GluGly: 3.062 ± 0.54
1.162GluHis: 1.162 ± 0.286
7.181GluIle: 7.181 ± 0.756
9.293GluLys: 9.293 ± 1.498
11.088GluLeu: 11.088 ± 1.321
2.006GluMet: 2.006 ± 0.654
4.224GluAsn: 4.224 ± 0.769
2.112GluPro: 2.112 ± 0.509
3.59GluGln: 3.59 ± 0.865
3.59GluArg: 3.59 ± 0.671
4.224GluSer: 4.224 ± 0.637
4.541GluThr: 4.541 ± 0.656
4.435GluVal: 4.435 ± 0.625
1.69GluTrp: 1.69 ± 0.381
2.534GluTyr: 2.534 ± 0.517
0.0GluXaa: 0.0 ± 0.0
Phe
2.429PheAla: 2.429 ± 0.612
0.211PheCys: 0.211 ± 0.136
3.907PheAsp: 3.907 ± 0.771
3.379PheGlu: 3.379 ± 0.51
1.373PhePhe: 1.373 ± 0.567
2.006PheGly: 2.006 ± 0.475
0.634PheHis: 0.634 ± 0.295
3.273PheIle: 3.273 ± 0.577
3.696PheLys: 3.696 ± 0.815
1.901PheLeu: 1.901 ± 0.554
1.795PheMet: 1.795 ± 0.597
2.746PheAsn: 2.746 ± 0.436
0.739PhePro: 0.739 ± 0.283
1.267PheGln: 1.267 ± 0.303
1.584PheArg: 1.584 ± 0.369
2.218PheSer: 2.218 ± 0.392
3.062PheThr: 3.062 ± 0.509
1.267PheVal: 1.267 ± 0.424
0.317PheTrp: 0.317 ± 0.157
1.584PheTyr: 1.584 ± 0.428
0.0PheXaa: 0.0 ± 0.0
Gly
3.59GlyAla: 3.59 ± 0.964
0.106GlyCys: 0.106 ± 0.103
2.64GlyAsp: 2.64 ± 0.494
4.013GlyGlu: 4.013 ± 0.497
2.64GlyPhe: 2.64 ± 0.424
3.485GlyGly: 3.485 ± 0.634
0.95GlyHis: 0.95 ± 0.339
5.174GlyIle: 5.174 ± 0.722
5.28GlyLys: 5.28 ± 0.47
5.28GlyLeu: 5.28 ± 0.983
1.373GlyMet: 1.373 ± 0.382
2.64GlyAsn: 2.64 ± 0.512
0.211GlyPro: 0.211 ± 0.129
2.957GlyGln: 2.957 ± 0.608
2.218GlyArg: 2.218 ± 0.544
4.224GlySer: 4.224 ± 0.679
3.696GlyThr: 3.696 ± 0.769
3.485GlyVal: 3.485 ± 0.551
1.478GlyTrp: 1.478 ± 0.838
2.429GlyTyr: 2.429 ± 0.562
0.0GlyXaa: 0.0 ± 0.0
His
1.056HisAla: 1.056 ± 0.318
0.0HisCys: 0.0 ± 0.0
0.317HisAsp: 0.317 ± 0.16
1.267HisGlu: 1.267 ± 0.411
1.056HisPhe: 1.056 ± 0.292
0.739HisGly: 0.739 ± 0.299
0.211HisHis: 0.211 ± 0.177
1.795HisIle: 1.795 ± 0.358
0.845HisLys: 0.845 ± 0.249
1.373HisLeu: 1.373 ± 0.422
0.422HisMet: 0.422 ± 0.191
1.056HisAsn: 1.056 ± 0.314
0.211HisPro: 0.211 ± 0.151
0.845HisGln: 0.845 ± 0.309
0.422HisArg: 0.422 ± 0.205
0.95HisSer: 0.95 ± 0.402
1.584HisThr: 1.584 ± 0.41
1.056HisVal: 1.056 ± 0.355
0.0HisTrp: 0.0 ± 0.0
0.95HisTyr: 0.95 ± 0.369
0.0HisXaa: 0.0 ± 0.0
Ile
5.069IleAla: 5.069 ± 0.973
0.845IleCys: 0.845 ± 0.364
5.385IleAsp: 5.385 ± 0.803
5.913IleGlu: 5.913 ± 0.986
2.534IlePhe: 2.534 ± 0.535
3.485IleGly: 3.485 ± 0.758
1.267IleHis: 1.267 ± 0.348
3.59IleIle: 3.59 ± 0.725
5.174IleLys: 5.174 ± 0.827
5.597IleLeu: 5.597 ± 0.894
1.267IleMet: 1.267 ± 0.433
5.28IleAsn: 5.28 ± 0.673
1.901IlePro: 1.901 ± 0.454
3.168IleGln: 3.168 ± 0.528
3.062IleArg: 3.062 ± 0.634
5.28IleSer: 5.28 ± 0.778
3.696IleThr: 3.696 ± 0.468
3.485IleVal: 3.485 ± 0.644
0.845IleTrp: 0.845 ± 0.339
2.429IleTyr: 2.429 ± 0.521
0.0IleXaa: 0.0 ± 0.0
Lys
6.23LysAla: 6.23 ± 0.84
0.211LysCys: 0.211 ± 0.159
5.28LysAsp: 5.28 ± 0.656
8.131LysGlu: 8.131 ± 1.351
2.112LysPhe: 2.112 ± 0.382
4.329LysGly: 4.329 ± 0.702
2.006LysHis: 2.006 ± 0.384
5.385LysIle: 5.385 ± 0.928
5.913LysLys: 5.913 ± 1.015
7.181LysLeu: 7.181 ± 0.871
2.218LysMet: 2.218 ± 0.56
4.752LysAsn: 4.752 ± 0.708
1.795LysPro: 1.795 ± 0.504
4.224LysGln: 4.224 ± 0.699
3.59LysArg: 3.59 ± 0.576
6.336LysSer: 6.336 ± 0.679
5.808LysThr: 5.808 ± 0.64
3.907LysVal: 3.907 ± 0.746
1.056LysTrp: 1.056 ± 0.305
3.379LysTyr: 3.379 ± 0.608
0.0LysXaa: 0.0 ± 0.0
Leu
6.758LeuAla: 6.758 ± 0.756
0.317LeuCys: 0.317 ± 0.208
6.336LeuAsp: 6.336 ± 0.597
7.286LeuGlu: 7.286 ± 1.004
3.273LeuPhe: 3.273 ± 0.439
5.913LeuGly: 5.913 ± 0.783
1.056LeuHis: 1.056 ± 0.362
6.125LeuIle: 6.125 ± 0.903
8.448LeuLys: 8.448 ± 0.854
6.864LeuLeu: 6.864 ± 1.03
2.429LeuMet: 2.429 ± 0.726
5.174LeuAsn: 5.174 ± 0.708
2.957LeuPro: 2.957 ± 0.615
2.746LeuGln: 2.746 ± 0.625
3.696LeuArg: 3.696 ± 0.571
7.075LeuSer: 7.075 ± 0.783
5.069LeuThr: 5.069 ± 1.083
4.541LeuVal: 4.541 ± 0.774
0.95LeuTrp: 0.95 ± 0.472
2.64LeuTyr: 2.64 ± 0.44
0.0LeuXaa: 0.0 ± 0.0
Met
1.69MetAla: 1.69 ± 0.436
0.422MetCys: 0.422 ± 0.22
0.95MetAsp: 0.95 ± 0.34
2.534MetGlu: 2.534 ± 0.763
0.634MetPhe: 0.634 ± 0.334
1.478MetGly: 1.478 ± 0.326
0.0MetHis: 0.0 ± 0.0
1.267MetIle: 1.267 ± 0.411
2.534MetLys: 2.534 ± 0.556
1.69MetLeu: 1.69 ± 0.338
0.739MetMet: 0.739 ± 0.31
2.112MetAsn: 2.112 ± 0.497
0.528MetPro: 0.528 ± 0.325
0.422MetGln: 0.422 ± 0.214
1.162MetArg: 1.162 ± 0.494
2.112MetSer: 2.112 ± 0.505
2.112MetThr: 2.112 ± 0.422
1.795MetVal: 1.795 ± 0.42
0.106MetTrp: 0.106 ± 0.095
0.106MetTyr: 0.106 ± 0.126
0.0MetXaa: 0.0 ± 0.0
Asn
3.59AsnAla: 3.59 ± 0.685
0.106AsnCys: 0.106 ± 0.124
3.379AsnAsp: 3.379 ± 0.671
5.174AsnGlu: 5.174 ± 0.503
2.112AsnPhe: 2.112 ± 0.369
4.329AsnGly: 4.329 ± 1.037
1.162AsnHis: 1.162 ± 0.35
3.907AsnIle: 3.907 ± 0.685
4.224AsnLys: 4.224 ± 0.575
5.808AsnLeu: 5.808 ± 0.828
0.845AsnMet: 0.845 ± 0.377
2.534AsnAsn: 2.534 ± 0.439
2.323AsnPro: 2.323 ± 0.543
2.64AsnGln: 2.64 ± 0.658
3.062AsnArg: 3.062 ± 0.462
3.485AsnSer: 3.485 ± 0.511
2.64AsnThr: 2.64 ± 0.446
4.013AsnVal: 4.013 ± 0.626
0.845AsnTrp: 0.845 ± 0.294
1.267AsnTyr: 1.267 ± 0.281
0.0AsnXaa: 0.0 ± 0.0
Pro
1.584ProAla: 1.584 ± 0.483
0.211ProCys: 0.211 ± 0.17
1.584ProAsp: 1.584 ± 0.525
1.901ProGlu: 1.901 ± 0.601
0.739ProPhe: 0.739 ± 0.277
0.845ProGly: 0.845 ± 0.353
0.422ProHis: 0.422 ± 0.215
1.267ProIle: 1.267 ± 0.403
2.112ProLys: 2.112 ± 0.469
2.323ProLeu: 2.323 ± 0.425
0.634ProMet: 0.634 ± 0.297
1.056ProAsn: 1.056 ± 0.417
0.528ProPro: 0.528 ± 0.266
1.795ProGln: 1.795 ± 0.341
1.267ProArg: 1.267 ± 0.328
1.478ProSer: 1.478 ± 0.443
1.584ProThr: 1.584 ± 0.374
1.795ProVal: 1.795 ± 0.335
0.528ProTrp: 0.528 ± 0.265
1.162ProTyr: 1.162 ± 0.476
0.0ProXaa: 0.0 ± 0.0
Gln
3.907GlnAla: 3.907 ± 0.647
0.0GlnCys: 0.0 ± 0.0
1.795GlnAsp: 1.795 ± 0.468
4.329GlnGlu: 4.329 ± 0.919
1.267GlnPhe: 1.267 ± 0.283
1.69GlnGly: 1.69 ± 0.398
0.317GlnHis: 0.317 ± 0.143
2.534GlnIle: 2.534 ± 0.524
2.746GlnLys: 2.746 ± 0.511
3.801GlnLeu: 3.801 ± 0.614
0.739GlnMet: 0.739 ± 0.253
2.534GlnAsn: 2.534 ± 0.547
1.267GlnPro: 1.267 ± 0.468
1.267GlnGln: 1.267 ± 0.249
2.112GlnArg: 2.112 ± 0.657
3.273GlnSer: 3.273 ± 0.716
2.218GlnThr: 2.218 ± 0.448
2.957GlnVal: 2.957 ± 0.582
0.106GlnTrp: 0.106 ± 0.085
1.056GlnTyr: 1.056 ± 0.338
0.0GlnXaa: 0.0 ± 0.0
Arg
2.534ArgAla: 2.534 ± 0.393
0.422ArgCys: 0.422 ± 0.227
1.373ArgAsp: 1.373 ± 0.324
3.273ArgGlu: 3.273 ± 0.541
1.478ArgPhe: 1.478 ± 0.365
2.218ArgGly: 2.218 ± 0.567
0.95ArgHis: 0.95 ± 0.459
3.273ArgIle: 3.273 ± 0.592
3.696ArgLys: 3.696 ± 0.805
4.646ArgLeu: 4.646 ± 0.585
1.056ArgMet: 1.056 ± 0.257
2.746ArgAsn: 2.746 ± 0.714
1.69ArgPro: 1.69 ± 0.429
2.112ArgGln: 2.112 ± 0.578
2.746ArgArg: 2.746 ± 0.602
2.218ArgSer: 2.218 ± 0.342
2.323ArgThr: 2.323 ± 0.835
3.062ArgVal: 3.062 ± 0.401
0.739ArgTrp: 0.739 ± 0.302
2.429ArgTyr: 2.429 ± 0.65
0.0ArgXaa: 0.0 ± 0.0
Ser
4.013SerAla: 4.013 ± 1.013
0.211SerCys: 0.211 ± 0.129
3.59SerAsp: 3.59 ± 0.818
5.174SerGlu: 5.174 ± 0.841
3.379SerPhe: 3.379 ± 0.613
4.752SerGly: 4.752 ± 0.719
0.845SerHis: 0.845 ± 0.444
4.435SerIle: 4.435 ± 0.67
5.069SerLys: 5.069 ± 0.714
5.491SerLeu: 5.491 ± 0.83
1.056SerMet: 1.056 ± 0.388
4.013SerAsn: 4.013 ± 0.664
1.267SerPro: 1.267 ± 0.279
2.534SerGln: 2.534 ± 0.628
2.746SerArg: 2.746 ± 0.544
4.118SerSer: 4.118 ± 0.989
4.857SerThr: 4.857 ± 0.792
4.224SerVal: 4.224 ± 1.122
1.056SerTrp: 1.056 ± 0.403
2.429SerTyr: 2.429 ± 0.447
0.0SerXaa: 0.0 ± 0.0
Thr
4.329ThrAla: 4.329 ± 0.819
0.422ThrCys: 0.422 ± 0.207
3.696ThrAsp: 3.696 ± 0.848
3.907ThrGlu: 3.907 ± 0.643
2.429ThrPhe: 2.429 ± 0.507
4.646ThrGly: 4.646 ± 0.736
1.056ThrHis: 1.056 ± 0.558
4.541ThrIle: 4.541 ± 0.622
4.857ThrLys: 4.857 ± 0.52
5.491ThrLeu: 5.491 ± 0.569
0.845ThrMet: 0.845 ± 0.242
3.379ThrAsn: 3.379 ± 0.572
1.267ThrPro: 1.267 ± 0.437
2.323ThrGln: 2.323 ± 0.483
2.851ThrArg: 2.851 ± 0.526
3.801ThrSer: 3.801 ± 0.764
4.541ThrThr: 4.541 ± 0.775
4.541ThrVal: 4.541 ± 1.117
1.373ThrTrp: 1.373 ± 0.356
1.901ThrTyr: 1.901 ± 0.451
0.0ThrXaa: 0.0 ± 0.0
Val
3.696ValAla: 3.696 ± 0.738
0.422ValCys: 0.422 ± 0.186
4.435ValAsp: 4.435 ± 0.585
6.23ValGlu: 6.23 ± 0.716
2.429ValPhe: 2.429 ± 0.775
3.59ValGly: 3.59 ± 0.757
0.528ValHis: 0.528 ± 0.247
2.323ValIle: 2.323 ± 0.543
4.752ValLys: 4.752 ± 0.654
4.435ValLeu: 4.435 ± 0.881
1.584ValMet: 1.584 ± 0.267
3.485ValAsn: 3.485 ± 0.66
1.478ValPro: 1.478 ± 0.336
1.901ValGln: 1.901 ± 0.717
2.957ValArg: 2.957 ± 0.663
2.64ValSer: 2.64 ± 0.55
4.541ValThr: 4.541 ± 0.593
4.224ValVal: 4.224 ± 0.783
0.845ValTrp: 0.845 ± 0.277
1.795ValTyr: 1.795 ± 0.497
0.0ValXaa: 0.0 ± 0.0
Trp
0.317TrpAla: 0.317 ± 0.156
0.106TrpCys: 0.106 ± 0.1
0.317TrpAsp: 0.317 ± 0.179
1.162TrpGlu: 1.162 ± 0.343
1.373TrpPhe: 1.373 ± 0.452
1.056TrpGly: 1.056 ± 0.327
0.317TrpHis: 0.317 ± 0.186
0.95TrpIle: 0.95 ± 0.391
0.95TrpLys: 0.95 ± 0.599
1.373TrpLeu: 1.373 ± 0.423
0.634TrpMet: 0.634 ± 0.254
1.162TrpAsn: 1.162 ± 0.326
0.106TrpPro: 0.106 ± 0.085
0.528TrpGln: 0.528 ± 0.214
0.528TrpArg: 0.528 ± 0.208
1.584TrpSer: 1.584 ± 0.568
0.845TrpThr: 0.845 ± 0.231
0.739TrpVal: 0.739 ± 0.234
0.211TrpTrp: 0.211 ± 0.277
0.634TrpTyr: 0.634 ± 0.548
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.006TyrAla: 2.006 ± 0.407
0.422TyrCys: 0.422 ± 0.304
1.69TyrAsp: 1.69 ± 0.428
2.534TyrGlu: 2.534 ± 0.587
2.323TyrPhe: 2.323 ± 0.625
2.323TyrGly: 2.323 ± 0.478
0.739TyrHis: 0.739 ± 0.243
1.901TyrIle: 1.901 ± 0.522
2.746TyrLys: 2.746 ± 0.578
4.329TyrLeu: 4.329 ± 0.92
0.739TyrMet: 0.739 ± 0.237
2.112TyrAsn: 2.112 ± 0.525
1.162TyrPro: 1.162 ± 0.333
1.901TyrGln: 1.901 ± 0.439
2.006TyrArg: 2.006 ± 0.697
2.64TyrSer: 2.64 ± 0.342
1.478TyrThr: 1.478 ± 0.33
1.69TyrVal: 1.69 ± 0.428
0.528TyrTrp: 0.528 ± 0.21
2.112TyrTyr: 2.112 ± 0.663
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 45 proteins (9471 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski