Amino acid dipepetide frequency for Streptococcus satellite phage Javan653

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.0AlaAla: 0.0 ± 0.0
0.317AlaCys: 0.317 ± 0.257
3.802AlaAsp: 3.802 ± 1.581
5.703AlaGlu: 5.703 ± 1.276
2.535AlaPhe: 2.535 ± 0.98
2.218AlaGly: 2.218 ± 0.901
2.535AlaHis: 2.535 ± 0.65
2.535AlaIle: 2.535 ± 0.869
3.485AlaLys: 3.485 ± 0.944
6.337AlaLeu: 6.337 ± 0.977
0.951AlaMet: 0.951 ± 0.786
3.169AlaAsn: 3.169 ± 0.865
2.218AlaPro: 2.218 ± 0.963
1.267AlaGln: 1.267 ± 0.494
2.535AlaArg: 2.535 ± 0.878
2.852AlaSer: 2.852 ± 0.731
1.901AlaThr: 1.901 ± 0.685
4.119AlaVal: 4.119 ± 1.178
0.317AlaTrp: 0.317 ± 0.258
3.485AlaTyr: 3.485 ± 0.999
0.0AlaXaa: 0.0 ± 0.0
Cys
0.634CysAla: 0.634 ± 0.444
0.0CysCys: 0.0 ± 0.0
0.951CysAsp: 0.951 ± 0.47
0.317CysGlu: 0.317 ± 0.309
0.634CysPhe: 0.634 ± 0.617
0.951CysGly: 0.951 ± 0.586
0.0CysHis: 0.0 ± 0.0
0.951CysIle: 0.951 ± 0.562
0.0CysLys: 0.0 ± 0.0
0.634CysLeu: 0.634 ± 0.626
0.0CysMet: 0.0 ± 0.0
0.317CysAsn: 0.317 ± 0.257
0.317CysPro: 0.317 ± 0.292
0.634CysGln: 0.634 ± 0.585
0.951CysArg: 0.951 ± 0.494
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.634CysTyr: 0.634 ± 0.425
0.0CysXaa: 0.0 ± 0.0
Asp
1.901AspAla: 1.901 ± 0.895
1.267AspCys: 1.267 ± 0.674
2.218AspAsp: 2.218 ± 0.932
4.119AspGlu: 4.119 ± 0.914
5.07AspPhe: 5.07 ± 0.814
3.485AspGly: 3.485 ± 0.934
0.0AspHis: 0.0 ± 0.0
6.971AspIle: 6.971 ± 1.127
5.387AspLys: 5.387 ± 0.877
6.971AspLeu: 6.971 ± 1.548
2.535AspMet: 2.535 ± 0.731
5.387AspAsn: 5.387 ± 0.793
1.584AspPro: 1.584 ± 0.416
0.634AspGln: 0.634 ± 0.33
1.901AspArg: 1.901 ± 0.585
2.535AspSer: 2.535 ± 0.761
3.485AspThr: 3.485 ± 0.871
1.267AspVal: 1.267 ± 0.527
0.0AspTrp: 0.0 ± 0.0
3.485AspTyr: 3.485 ± 1.081
0.0AspXaa: 0.0 ± 0.0
Glu
6.654GluAla: 6.654 ± 1.327
0.0GluCys: 0.0 ± 0.0
3.485GluAsp: 3.485 ± 1.097
4.119GluGlu: 4.119 ± 0.884
2.852GluPhe: 2.852 ± 1.045
1.901GluGly: 1.901 ± 0.568
1.584GluHis: 1.584 ± 0.617
5.703GluIle: 5.703 ± 1.324
4.436GluLys: 4.436 ± 1.279
9.506GluLeu: 9.506 ± 1.528
1.267GluMet: 1.267 ± 0.631
3.802GluAsn: 3.802 ± 1.419
1.267GluPro: 1.267 ± 0.598
2.852GluGln: 2.852 ± 0.55
2.535GluArg: 2.535 ± 1.114
4.436GluSer: 4.436 ± 1.153
3.169GluThr: 3.169 ± 0.69
4.436GluVal: 4.436 ± 1.186
1.267GluTrp: 1.267 ± 0.594
1.584GluTyr: 1.584 ± 0.644
0.0GluXaa: 0.0 ± 0.0
Phe
0.951PheAla: 0.951 ± 0.6
0.0PheCys: 0.0 ± 0.0
5.703PheAsp: 5.703 ± 1.434
1.901PheGlu: 1.901 ± 0.87
1.901PhePhe: 1.901 ± 0.708
2.852PheGly: 2.852 ± 1.294
0.634PheHis: 0.634 ± 0.372
5.07PheIle: 5.07 ± 1.388
3.802PheLys: 3.802 ± 1.063
3.802PheLeu: 3.802 ± 0.769
0.634PheMet: 0.634 ± 0.424
3.802PheAsn: 3.802 ± 0.79
1.584PhePro: 1.584 ± 0.802
1.584PheGln: 1.584 ± 0.863
1.901PheArg: 1.901 ± 0.737
3.802PheSer: 3.802 ± 1.112
1.584PheThr: 1.584 ± 0.544
2.218PheVal: 2.218 ± 0.887
0.0PheTrp: 0.0 ± 0.0
2.218PheTyr: 2.218 ± 0.846
0.0PheXaa: 0.0 ± 0.0
Gly
2.852GlyAla: 2.852 ± 1.421
0.634GlyCys: 0.634 ± 0.398
3.485GlyAsp: 3.485 ± 1.278
3.169GlyGlu: 3.169 ± 0.83
3.485GlyPhe: 3.485 ± 0.938
2.218GlyGly: 2.218 ± 0.911
0.317GlyHis: 0.317 ± 0.258
4.753GlyIle: 4.753 ± 0.949
3.485GlyLys: 3.485 ± 1.025
3.802GlyLeu: 3.802 ± 1.422
0.0GlyMet: 0.0 ± 0.0
2.218GlyAsn: 2.218 ± 0.788
0.0GlyPro: 0.0 ± 0.0
1.584GlyGln: 1.584 ± 0.566
3.169GlyArg: 3.169 ± 0.77
2.218GlySer: 2.218 ± 0.844
4.753GlyThr: 4.753 ± 1.26
3.169GlyVal: 3.169 ± 1.11
0.634GlyTrp: 0.634 ± 0.514
3.802GlyTyr: 3.802 ± 1.142
0.0GlyXaa: 0.0 ± 0.0
His
0.951HisAla: 0.951 ± 0.475
0.634HisCys: 0.634 ± 0.345
0.0HisAsp: 0.0 ± 0.0
0.951HisGlu: 0.951 ± 0.544
0.951HisPhe: 0.951 ± 0.774
1.901HisGly: 1.901 ± 0.625
0.634HisHis: 0.634 ± 0.444
1.901HisIle: 1.901 ± 0.739
1.901HisLys: 1.901 ± 0.724
1.901HisLeu: 1.901 ± 0.745
0.317HisMet: 0.317 ± 0.323
0.634HisAsn: 0.634 ± 0.617
0.317HisPro: 0.317 ± 0.277
0.634HisGln: 0.634 ± 0.402
1.267HisArg: 1.267 ± 0.639
0.634HisSer: 0.634 ± 0.398
1.901HisThr: 1.901 ± 0.622
0.634HisVal: 0.634 ± 0.344
0.0HisTrp: 0.0 ± 0.0
1.901HisTyr: 1.901 ± 0.618
0.0HisXaa: 0.0 ± 0.0
Ile
5.387IleAla: 5.387 ± 1.636
0.317IleCys: 0.317 ± 0.323
4.436IleAsp: 4.436 ± 1.437
5.387IleGlu: 5.387 ± 1.329
1.584IlePhe: 1.584 ± 0.732
2.535IleGly: 2.535 ± 0.732
1.901IleHis: 1.901 ± 0.539
6.02IleIle: 6.02 ± 1.523
6.654IleLys: 6.654 ± 1.453
5.703IleLeu: 5.703 ± 0.873
0.634IleMet: 0.634 ± 0.387
5.07IleAsn: 5.07 ± 1.667
4.436IlePro: 4.436 ± 1.078
2.535IleGln: 2.535 ± 0.755
2.218IleArg: 2.218 ± 0.703
4.119IleSer: 4.119 ± 1.006
6.02IleThr: 6.02 ± 0.695
3.485IleVal: 3.485 ± 0.824
0.317IleTrp: 0.317 ± 0.309
3.169IleTyr: 3.169 ± 0.839
0.0IleXaa: 0.0 ± 0.0
Lys
7.921LysAla: 7.921 ± 2.233
0.317LysCys: 0.317 ± 0.309
4.119LysAsp: 4.119 ± 0.794
9.823LysGlu: 9.823 ± 2.092
2.218LysPhe: 2.218 ± 0.652
5.387LysGly: 5.387 ± 1.575
2.852LysHis: 2.852 ± 0.877
6.654LysIle: 6.654 ± 1.247
8.872LysLys: 8.872 ± 1.639
6.971LysLeu: 6.971 ± 1.238
1.584LysMet: 1.584 ± 0.805
5.07LysAsn: 5.07 ± 1.551
3.485LysPro: 3.485 ± 0.83
3.485LysGln: 3.485 ± 0.785
4.753LysArg: 4.753 ± 1.123
6.337LysSer: 6.337 ± 1.532
6.337LysThr: 6.337 ± 1.522
4.436LysVal: 4.436 ± 0.89
0.0LysTrp: 0.0 ± 0.0
2.852LysTyr: 2.852 ± 0.832
0.0LysXaa: 0.0 ± 0.0
Leu
5.387LeuAla: 5.387 ± 1.441
0.951LeuCys: 0.951 ± 0.535
10.139LeuAsp: 10.139 ± 1.223
7.288LeuGlu: 7.288 ± 1.669
3.169LeuPhe: 3.169 ± 1.328
4.119LeuGly: 4.119 ± 1.045
1.267LeuHis: 1.267 ± 0.39
7.288LeuIle: 7.288 ± 1.506
9.823LeuLys: 9.823 ± 1.684
9.823LeuLeu: 9.823 ± 1.39
1.584LeuMet: 1.584 ± 0.564
7.921LeuAsn: 7.921 ± 1.047
3.485LeuPro: 3.485 ± 0.915
3.485LeuGln: 3.485 ± 0.85
3.485LeuArg: 3.485 ± 1.51
6.654LeuSer: 6.654 ± 1.543
5.703LeuThr: 5.703 ± 1.366
4.119LeuVal: 4.119 ± 1.547
0.951LeuTrp: 0.951 ± 0.562
4.436LeuTyr: 4.436 ± 1.222
0.0LeuXaa: 0.0 ± 0.0
Met
0.634MetAla: 0.634 ± 0.414
0.0MetCys: 0.0 ± 0.0
1.267MetAsp: 1.267 ± 0.619
0.951MetGlu: 0.951 ± 0.573
0.634MetPhe: 0.634 ± 0.477
0.634MetGly: 0.634 ± 0.446
0.0MetHis: 0.0 ± 0.0
0.951MetIle: 0.951 ± 0.541
2.535MetLys: 2.535 ± 0.724
2.852MetLeu: 2.852 ± 0.815
0.317MetMet: 0.317 ± 0.258
2.535MetAsn: 2.535 ± 0.845
0.634MetPro: 0.634 ± 0.43
0.317MetGln: 0.317 ± 0.257
1.267MetArg: 1.267 ± 0.483
0.317MetSer: 0.317 ± 0.257
1.267MetThr: 1.267 ± 0.844
1.901MetVal: 1.901 ± 0.817
0.317MetTrp: 0.317 ± 0.353
0.317MetTyr: 0.317 ± 0.277
0.0MetXaa: 0.0 ± 0.0
Asn
3.169AsnAla: 3.169 ± 0.934
0.317AsnCys: 0.317 ± 0.258
2.218AsnAsp: 2.218 ± 0.396
3.169AsnGlu: 3.169 ± 0.886
1.901AsnPhe: 1.901 ± 0.741
4.436AsnGly: 4.436 ± 1.335
0.634AsnHis: 0.634 ± 0.372
4.753AsnIle: 4.753 ± 1.451
7.605AsnLys: 7.605 ± 1.353
6.654AsnLeu: 6.654 ± 1.317
0.634AsnMet: 0.634 ± 0.425
4.436AsnAsn: 4.436 ± 1.265
3.169AsnPro: 3.169 ± 0.747
2.535AsnGln: 2.535 ± 0.913
4.436AsnArg: 4.436 ± 1.251
3.802AsnSer: 3.802 ± 1.397
4.119AsnThr: 4.119 ± 1.239
1.584AsnVal: 1.584 ± 0.758
0.634AsnTrp: 0.634 ± 0.452
3.169AsnTyr: 3.169 ± 1.068
0.0AsnXaa: 0.0 ± 0.0
Pro
0.317ProAla: 0.317 ± 0.258
0.0ProCys: 0.0 ± 0.0
2.852ProAsp: 2.852 ± 1.314
2.535ProGlu: 2.535 ± 0.687
2.852ProPhe: 2.852 ± 0.79
0.951ProGly: 0.951 ± 0.401
0.634ProHis: 0.634 ± 0.329
2.535ProIle: 2.535 ± 0.872
3.802ProLys: 3.802 ± 1.059
2.218ProLeu: 2.218 ± 0.694
0.951ProMet: 0.951 ± 0.453
2.218ProAsn: 2.218 ± 0.772
1.584ProPro: 1.584 ± 0.645
0.0ProGln: 0.0 ± 0.0
2.218ProArg: 2.218 ± 0.663
1.584ProSer: 1.584 ± 0.451
1.584ProThr: 1.584 ± 0.46
2.218ProVal: 2.218 ± 0.708
0.0ProTrp: 0.0 ± 0.0
1.901ProTyr: 1.901 ± 0.635
0.0ProXaa: 0.0 ± 0.0
Gln
2.535GlnAla: 2.535 ± 0.847
0.317GlnCys: 0.317 ± 0.309
0.634GlnAsp: 0.634 ± 0.445
3.169GlnGlu: 3.169 ± 0.786
1.901GlnPhe: 1.901 ± 0.621
1.901GlnGly: 1.901 ± 0.783
0.951GlnHis: 0.951 ± 0.551
0.951GlnIle: 0.951 ± 0.506
3.485GlnLys: 3.485 ± 0.873
4.436GlnLeu: 4.436 ± 1.119
0.0GlnMet: 0.0 ± 0.0
0.951GlnAsn: 0.951 ± 0.682
0.951GlnPro: 0.951 ± 0.47
2.218GlnGln: 2.218 ± 0.623
1.584GlnArg: 1.584 ± 0.514
3.485GlnSer: 3.485 ± 0.982
2.218GlnThr: 2.218 ± 0.879
2.218GlnVal: 2.218 ± 0.541
0.317GlnTrp: 0.317 ± 0.257
0.634GlnTyr: 0.634 ± 0.384
0.0GlnXaa: 0.0 ± 0.0
Arg
2.218ArgAla: 2.218 ± 0.954
0.317ArgCys: 0.317 ± 0.292
2.535ArgAsp: 2.535 ± 0.555
2.218ArgGlu: 2.218 ± 0.793
3.169ArgPhe: 3.169 ± 0.89
2.218ArgGly: 2.218 ± 1.044
2.218ArgHis: 2.218 ± 1.072
1.584ArgIle: 1.584 ± 0.746
6.971ArgLys: 6.971 ± 1.322
5.703ArgLeu: 5.703 ± 1.136
1.267ArgMet: 1.267 ± 0.558
3.802ArgAsn: 3.802 ± 0.791
1.584ArgPro: 1.584 ± 0.586
0.951ArgGln: 0.951 ± 0.595
1.901ArgArg: 1.901 ± 0.775
0.951ArgSer: 0.951 ± 0.421
3.485ArgThr: 3.485 ± 0.767
2.218ArgVal: 2.218 ± 0.955
0.634ArgTrp: 0.634 ± 0.473
3.169ArgTyr: 3.169 ± 1.146
0.0ArgXaa: 0.0 ± 0.0
Ser
1.901SerAla: 1.901 ± 0.628
0.317SerCys: 0.317 ± 0.292
3.802SerAsp: 3.802 ± 0.919
2.218SerGlu: 2.218 ± 0.944
3.802SerPhe: 3.802 ± 0.911
2.218SerGly: 2.218 ± 0.656
0.634SerHis: 0.634 ± 0.343
4.436SerIle: 4.436 ± 0.915
6.654SerLys: 6.654 ± 1.047
5.703SerLeu: 5.703 ± 1.088
2.852SerMet: 2.852 ± 0.945
2.218SerAsn: 2.218 ± 1.0
0.951SerPro: 0.951 ± 0.65
2.535SerGln: 2.535 ± 0.972
3.802SerArg: 3.802 ± 1.012
2.535SerSer: 2.535 ± 0.912
3.485SerThr: 3.485 ± 1.032
3.802SerVal: 3.802 ± 0.88
0.951SerTrp: 0.951 ± 0.523
3.485SerTyr: 3.485 ± 0.747
0.0SerXaa: 0.0 ± 0.0
Thr
3.802ThrAla: 3.802 ± 1.327
0.951ThrCys: 0.951 ± 0.516
2.535ThrAsp: 2.535 ± 0.93
3.485ThrGlu: 3.485 ± 1.226
1.901ThrPhe: 1.901 ± 0.758
5.07ThrGly: 5.07 ± 0.834
1.267ThrHis: 1.267 ± 0.66
3.169ThrIle: 3.169 ± 0.962
4.753ThrLys: 4.753 ± 1.688
6.02ThrLeu: 6.02 ± 1.151
2.218ThrMet: 2.218 ± 0.893
3.802ThrAsn: 3.802 ± 1.11
1.267ThrPro: 1.267 ± 0.709
2.852ThrGln: 2.852 ± 1.157
3.802ThrArg: 3.802 ± 1.093
4.436ThrSer: 4.436 ± 1.453
2.852ThrThr: 2.852 ± 0.697
4.753ThrVal: 4.753 ± 1.028
0.634ThrTrp: 0.634 ± 0.473
2.218ThrTyr: 2.218 ± 0.861
0.0ThrXaa: 0.0 ± 0.0
Val
4.753ValAla: 4.753 ± 1.067
0.951ValCys: 0.951 ± 0.635
2.852ValAsp: 2.852 ± 0.9
2.852ValGlu: 2.852 ± 1.027
1.267ValPhe: 1.267 ± 0.548
2.852ValGly: 2.852 ± 0.723
0.317ValHis: 0.317 ± 0.258
3.169ValIle: 3.169 ± 1.263
3.802ValLys: 3.802 ± 1.228
5.07ValLeu: 5.07 ± 1.089
1.267ValMet: 1.267 ± 0.476
3.169ValAsn: 3.169 ± 0.973
2.218ValPro: 2.218 ± 0.807
2.218ValGln: 2.218 ± 0.869
1.267ValArg: 1.267 ± 0.501
3.169ValSer: 3.169 ± 0.804
5.387ValThr: 5.387 ± 1.479
4.436ValVal: 4.436 ± 0.959
0.317ValTrp: 0.317 ± 0.257
2.218ValTyr: 2.218 ± 0.946
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.317TrpAsp: 0.317 ± 0.258
1.267TrpGlu: 1.267 ± 0.534
0.317TrpPhe: 0.317 ± 0.257
0.0TrpGly: 0.0 ± 0.0
0.317TrpHis: 0.317 ± 0.309
0.0TrpIle: 0.0 ± 0.0
0.317TrpLys: 0.317 ± 0.258
1.584TrpLeu: 1.584 ± 0.636
0.317TrpMet: 0.317 ± 0.353
0.317TrpAsn: 0.317 ± 0.309
0.0TrpPro: 0.0 ± 0.0
0.317TrpGln: 0.317 ± 0.257
0.634TrpArg: 0.634 ± 0.346
0.634TrpSer: 0.634 ± 0.383
0.0TrpThr: 0.0 ± 0.0
1.267TrpVal: 1.267 ± 0.491
0.317TrpTrp: 0.317 ± 0.258
0.317TrpTyr: 0.317 ± 0.257
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.634TyrAla: 0.634 ± 0.387
0.317TyrCys: 0.317 ± 0.313
2.852TyrAsp: 2.852 ± 0.868
2.218TyrGlu: 2.218 ± 0.78
3.802TyrPhe: 3.802 ± 1.22
1.901TyrGly: 1.901 ± 0.674
1.267TyrHis: 1.267 ± 0.55
2.535TyrIle: 2.535 ± 0.703
5.703TyrLys: 5.703 ± 1.664
5.387TyrLeu: 5.387 ± 1.058
0.0TyrMet: 0.0 ± 0.0
2.535TyrAsn: 2.535 ± 0.905
1.901TyrPro: 1.901 ± 0.789
2.218TyrGln: 2.218 ± 1.124
3.485TyrArg: 3.485 ± 0.983
3.802TyrSer: 3.802 ± 0.68
2.535TyrThr: 2.535 ± 0.784
1.267TyrVal: 1.267 ± 0.517
0.634TyrTrp: 0.634 ± 0.585
3.802TyrTyr: 3.802 ± 1.02
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 14 proteins (3157 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski