Amino acid dipepetide frequency for Streptococcus satellite phage Javan627

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.293AlaAla: 0.293 ± 0.31
0.88AlaCys: 0.88 ± 0.389
2.933AlaAsp: 2.933 ± 0.89
4.692AlaGlu: 4.692 ± 1.609
1.466AlaPhe: 1.466 ± 0.573
2.933AlaGly: 2.933 ± 0.884
0.0AlaHis: 0.0 ± 0.0
3.812AlaIle: 3.812 ± 1.076
5.865AlaLys: 5.865 ± 1.83
4.692AlaLeu: 4.692 ± 1.34
1.173AlaMet: 1.173 ± 0.499
3.519AlaAsn: 3.519 ± 1.129
0.587AlaPro: 0.587 ± 0.469
2.346AlaGln: 2.346 ± 0.69
1.76AlaArg: 1.76 ± 0.735
2.933AlaSer: 2.933 ± 0.832
2.639AlaThr: 2.639 ± 0.752
2.053AlaVal: 2.053 ± 0.54
0.587AlaTrp: 0.587 ± 0.46
1.76AlaTyr: 1.76 ± 0.721
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.293CysCys: 0.293 ± 0.278
0.0CysAsp: 0.0 ± 0.0
0.293CysGlu: 0.293 ± 0.278
0.587CysPhe: 0.587 ± 0.314
0.0CysGly: 0.0 ± 0.0
0.293CysHis: 0.293 ± 0.3
0.88CysIle: 0.88 ± 0.69
0.587CysLys: 0.587 ± 0.435
1.466CysLeu: 1.466 ± 0.708
0.88CysMet: 0.88 ± 0.501
0.587CysAsn: 0.587 ± 0.378
0.0CysPro: 0.0 ± 0.0
0.587CysGln: 0.587 ± 0.327
0.587CysArg: 0.587 ± 0.315
0.293CysSer: 0.293 ± 0.336
0.0CysThr: 0.0 ± 0.0
0.293CysVal: 0.293 ± 0.23
0.0CysTrp: 0.0 ± 0.0
0.293CysTyr: 0.293 ± 0.23
0.0CysXaa: 0.0 ± 0.0
Asp
1.76AspAla: 1.76 ± 0.887
0.293AspCys: 0.293 ± 0.278
5.279AspAsp: 5.279 ± 1.477
5.279AspGlu: 5.279 ± 1.503
2.346AspPhe: 2.346 ± 0.676
1.466AspGly: 1.466 ± 0.678
0.587AspHis: 0.587 ± 0.372
5.865AspIle: 5.865 ± 1.525
7.625AspLys: 7.625 ± 2.03
5.572AspLeu: 5.572 ± 1.477
1.466AspMet: 1.466 ± 0.669
5.279AspAsn: 5.279 ± 1.473
1.466AspPro: 1.466 ± 0.54
1.466AspGln: 1.466 ± 0.643
1.466AspArg: 1.466 ± 0.493
3.812AspSer: 3.812 ± 1.054
2.053AspThr: 2.053 ± 0.499
4.106AspVal: 4.106 ± 1.301
0.587AspTrp: 0.587 ± 0.39
3.226AspTyr: 3.226 ± 0.659
0.0AspXaa: 0.0 ± 0.0
Glu
2.639GluAla: 2.639 ± 0.969
0.88GluCys: 0.88 ± 0.484
5.865GluAsp: 5.865 ± 1.38
6.452GluGlu: 6.452 ± 1.729
4.399GluPhe: 4.399 ± 1.245
1.173GluGly: 1.173 ± 0.482
0.587GluHis: 0.587 ± 0.308
6.452GluIle: 6.452 ± 1.494
8.504GluLys: 8.504 ± 2.182
12.023GluLeu: 12.023 ± 1.772
2.053GluMet: 2.053 ± 0.757
6.158GluAsn: 6.158 ± 1.579
2.053GluPro: 2.053 ± 0.639
2.346GluGln: 2.346 ± 0.851
4.399GluArg: 4.399 ± 1.596
4.692GluSer: 4.692 ± 0.896
4.399GluThr: 4.399 ± 0.925
5.865GluVal: 5.865 ± 1.112
0.587GluTrp: 0.587 ± 0.454
3.812GluTyr: 3.812 ± 0.821
0.0GluXaa: 0.0 ± 0.0
Phe
1.76PheAla: 1.76 ± 0.562
0.293PheCys: 0.293 ± 0.278
4.106PheAsp: 4.106 ± 0.9
5.279PheGlu: 5.279 ± 1.181
3.226PhePhe: 3.226 ± 0.792
3.226PheGly: 3.226 ± 0.87
0.587PheHis: 0.587 ± 0.396
3.519PheIle: 3.519 ± 1.02
3.226PheLys: 3.226 ± 0.873
3.226PheLeu: 3.226 ± 0.991
0.293PheMet: 0.293 ± 0.258
1.466PheAsn: 1.466 ± 0.577
1.173PhePro: 1.173 ± 0.682
0.293PheGln: 0.293 ± 0.278
3.519PheArg: 3.519 ± 1.372
2.346PheSer: 2.346 ± 0.678
1.76PheThr: 1.76 ± 0.55
2.933PheVal: 2.933 ± 0.666
0.293PheTrp: 0.293 ± 0.23
0.88PheTyr: 0.88 ± 0.707
0.0PheXaa: 0.0 ± 0.0
Gly
2.346GlyAla: 2.346 ± 0.556
1.173GlyCys: 1.173 ± 0.477
1.76GlyAsp: 1.76 ± 0.552
2.346GlyGlu: 2.346 ± 0.775
1.76GlyPhe: 1.76 ± 0.806
1.466GlyGly: 1.466 ± 0.617
0.293GlyHis: 0.293 ± 0.256
3.519GlyIle: 3.519 ± 0.872
2.639GlyLys: 2.639 ± 0.662
3.519GlyLeu: 3.519 ± 0.829
1.76GlyMet: 1.76 ± 0.59
2.639GlyAsn: 2.639 ± 0.727
0.0GlyPro: 0.0 ± 0.0
0.88GlyGln: 0.88 ± 0.439
0.587GlyArg: 0.587 ± 0.339
1.76GlySer: 1.76 ± 0.585
2.053GlyThr: 2.053 ± 0.953
3.812GlyVal: 3.812 ± 0.884
0.587GlyTrp: 0.587 ± 0.456
1.173GlyTyr: 1.173 ± 0.425
0.0GlyXaa: 0.0 ± 0.0
His
0.587HisAla: 0.587 ± 0.512
0.0HisCys: 0.0 ± 0.0
0.88HisAsp: 0.88 ± 0.415
0.587HisGlu: 0.587 ± 0.361
1.173HisPhe: 1.173 ± 0.635
0.293HisGly: 0.293 ± 0.256
0.587HisHis: 0.587 ± 0.402
1.466HisIle: 1.466 ± 0.912
1.173HisLys: 1.173 ± 0.491
1.76HisLeu: 1.76 ± 0.521
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
0.587HisPro: 0.587 ± 0.386
0.0HisGln: 0.0 ± 0.0
0.88HisArg: 0.88 ± 0.415
1.466HisSer: 1.466 ± 0.651
1.466HisThr: 1.466 ± 0.655
0.293HisVal: 0.293 ± 0.29
0.293HisTrp: 0.293 ± 0.321
0.587HisTyr: 0.587 ± 0.349
0.0HisXaa: 0.0 ± 0.0
Ile
4.692IleAla: 4.692 ± 1.4
0.0IleCys: 0.0 ± 0.0
7.625IleAsp: 7.625 ± 1.509
8.211IleGlu: 8.211 ± 1.913
4.692IlePhe: 4.692 ± 0.943
2.933IleGly: 2.933 ± 0.919
1.76IleHis: 1.76 ± 0.689
4.692IleIle: 4.692 ± 1.177
6.745IleLys: 6.745 ± 1.139
4.985IleLeu: 4.985 ± 1.4
1.173IleMet: 1.173 ± 0.631
6.452IleAsn: 6.452 ± 1.222
1.76IlePro: 1.76 ± 0.655
4.399IleGln: 4.399 ± 0.873
2.053IleArg: 2.053 ± 0.559
7.625IleSer: 7.625 ± 1.209
4.106IleThr: 4.106 ± 1.068
2.639IleVal: 2.639 ± 0.762
0.587IleTrp: 0.587 ± 0.385
2.933IleTyr: 2.933 ± 0.745
0.0IleXaa: 0.0 ± 0.0
Lys
4.399LysAla: 4.399 ± 1.213
0.293LysCys: 0.293 ± 0.325
2.346LysAsp: 2.346 ± 0.967
11.73LysGlu: 11.73 ± 1.947
2.053LysPhe: 2.053 ± 0.948
2.053LysGly: 2.053 ± 0.809
1.76LysHis: 1.76 ± 0.647
9.677LysIle: 9.677 ± 1.527
9.677LysLys: 9.677 ± 1.498
9.677LysLeu: 9.677 ± 1.622
3.519LysMet: 3.519 ± 0.78
6.452LysAsn: 6.452 ± 1.414
2.346LysPro: 2.346 ± 0.824
4.985LysGln: 4.985 ± 1.07
6.158LysArg: 6.158 ± 1.123
6.452LysSer: 6.452 ± 1.282
6.452LysThr: 6.452 ± 1.171
5.279LysVal: 5.279 ± 0.919
0.88LysTrp: 0.88 ± 0.495
4.106LysTyr: 4.106 ± 1.179
0.0LysXaa: 0.0 ± 0.0
Leu
7.038LeuAla: 7.038 ± 1.315
0.587LeuCys: 0.587 ± 0.375
8.798LeuAsp: 8.798 ± 1.497
9.091LeuGlu: 9.091 ± 1.809
3.812LeuPhe: 3.812 ± 1.285
3.519LeuGly: 3.519 ± 0.775
1.173LeuHis: 1.173 ± 0.504
8.504LeuIle: 8.504 ± 1.516
8.798LeuLys: 8.798 ± 1.198
11.73LeuLeu: 11.73 ± 2.09
2.053LeuMet: 2.053 ± 0.775
10.557LeuAsn: 10.557 ± 1.698
2.053LeuPro: 2.053 ± 0.659
5.865LeuGln: 5.865 ± 1.714
1.76LeuArg: 1.76 ± 0.662
6.452LeuSer: 6.452 ± 1.793
4.692LeuThr: 4.692 ± 1.254
5.572LeuVal: 5.572 ± 1.581
0.293LeuTrp: 0.293 ± 0.256
3.226LeuTyr: 3.226 ± 0.744
0.0LeuXaa: 0.0 ± 0.0
Met
0.88MetAla: 0.88 ± 0.46
0.0MetCys: 0.0 ± 0.0
1.76MetAsp: 1.76 ± 0.775
2.053MetGlu: 2.053 ± 0.986
0.587MetPhe: 0.587 ± 0.322
0.293MetGly: 0.293 ± 0.23
0.0MetHis: 0.0 ± 0.0
1.173MetIle: 1.173 ± 0.529
2.053MetLys: 2.053 ± 0.593
4.106MetLeu: 4.106 ± 0.902
0.293MetMet: 0.293 ± 0.23
2.053MetAsn: 2.053 ± 0.759
0.293MetPro: 0.293 ± 0.336
1.173MetGln: 1.173 ± 0.554
0.587MetArg: 0.587 ± 0.375
1.466MetSer: 1.466 ± 0.74
1.173MetThr: 1.173 ± 0.571
1.466MetVal: 1.466 ± 0.645
0.0MetTrp: 0.0 ± 0.0
1.173MetTyr: 1.173 ± 0.491
0.0MetXaa: 0.0 ± 0.0
Asn
4.692AsnAla: 4.692 ± 0.989
0.0AsnCys: 0.0 ± 0.0
2.933AsnAsp: 2.933 ± 0.489
8.798AsnGlu: 8.798 ± 1.477
2.933AsnPhe: 2.933 ± 0.96
4.692AsnGly: 4.692 ± 0.916
1.173AsnHis: 1.173 ± 0.502
4.985AsnIle: 4.985 ± 1.894
9.677AsnLys: 9.677 ± 1.544
7.625AsnLeu: 7.625 ± 2.03
0.88AsnMet: 0.88 ± 0.655
5.865AsnAsn: 5.865 ± 1.308
1.76AsnPro: 1.76 ± 0.807
2.346AsnGln: 2.346 ± 0.91
2.346AsnArg: 2.346 ± 0.595
3.226AsnSer: 3.226 ± 0.73
2.933AsnThr: 2.933 ± 0.716
3.812AsnVal: 3.812 ± 1.097
0.88AsnTrp: 0.88 ± 0.47
3.519AsnTyr: 3.519 ± 0.631
0.0AsnXaa: 0.0 ± 0.0
Pro
0.88ProAla: 0.88 ± 0.495
0.293ProCys: 0.293 ± 0.325
2.053ProAsp: 2.053 ± 0.665
1.76ProGlu: 1.76 ± 0.577
0.88ProPhe: 0.88 ± 0.464
0.0ProGly: 0.0 ± 0.0
0.0ProHis: 0.0 ± 0.0
1.76ProIle: 1.76 ± 0.608
1.76ProLys: 1.76 ± 0.707
3.226ProLeu: 3.226 ± 0.982
0.0ProMet: 0.0 ± 0.0
2.639ProAsn: 2.639 ± 0.934
0.88ProPro: 0.88 ± 0.434
0.0ProGln: 0.0 ± 0.0
1.466ProArg: 1.466 ± 0.549
0.88ProSer: 0.88 ± 0.467
1.76ProThr: 1.76 ± 0.609
0.88ProVal: 0.88 ± 0.537
0.0ProTrp: 0.0 ± 0.0
2.053ProTyr: 2.053 ± 0.783
0.0ProXaa: 0.0 ± 0.0
Gln
4.399GlnAla: 4.399 ± 0.951
0.587GlnCys: 0.587 ± 0.361
1.466GlnAsp: 1.466 ± 0.767
2.933GlnGlu: 2.933 ± 0.812
1.76GlnPhe: 1.76 ± 0.668
1.76GlnGly: 1.76 ± 0.656
0.587GlnHis: 0.587 ± 0.349
3.519GlnIle: 3.519 ± 1.047
3.226GlnLys: 3.226 ± 0.917
3.226GlnLeu: 3.226 ± 1.018
0.587GlnMet: 0.587 ± 0.367
2.639GlnAsn: 2.639 ± 0.921
1.173GlnPro: 1.173 ± 0.69
0.88GlnGln: 0.88 ± 0.69
0.88GlnArg: 0.88 ± 0.382
2.053GlnSer: 2.053 ± 0.781
0.587GlnThr: 0.587 ± 0.363
1.76GlnVal: 1.76 ± 0.722
0.293GlnTrp: 0.293 ± 0.282
2.346GlnTyr: 2.346 ± 0.618
0.0GlnXaa: 0.0 ± 0.0
Arg
2.933ArgAla: 2.933 ± 1.001
0.293ArgCys: 0.293 ± 0.278
2.639ArgAsp: 2.639 ± 0.698
3.226ArgGlu: 3.226 ± 0.959
2.053ArgPhe: 2.053 ± 0.967
0.293ArgGly: 0.293 ± 0.23
1.76ArgHis: 1.76 ± 0.646
3.519ArgIle: 3.519 ± 0.91
4.692ArgLys: 4.692 ± 1.279
4.399ArgLeu: 4.399 ± 0.748
0.88ArgMet: 0.88 ± 0.704
2.639ArgAsn: 2.639 ± 0.641
0.88ArgPro: 0.88 ± 0.488
2.346ArgGln: 2.346 ± 0.579
0.293ArgArg: 0.293 ± 0.23
1.466ArgSer: 1.466 ± 0.624
3.226ArgThr: 3.226 ± 0.795
1.76ArgVal: 1.76 ± 0.704
0.587ArgTrp: 0.587 ± 0.375
2.053ArgTyr: 2.053 ± 0.562
0.0ArgXaa: 0.0 ± 0.0
Ser
2.639SerAla: 2.639 ± 0.709
0.88SerCys: 0.88 ± 0.42
2.933SerAsp: 2.933 ± 0.793
2.639SerGlu: 2.639 ± 0.695
3.812SerPhe: 3.812 ± 0.72
2.346SerGly: 2.346 ± 0.733
0.587SerHis: 0.587 ± 0.314
6.452SerIle: 6.452 ± 1.062
6.158SerLys: 6.158 ± 1.127
6.158SerLeu: 6.158 ± 1.727
1.76SerMet: 1.76 ± 0.78
4.399SerAsn: 4.399 ± 0.917
2.053SerPro: 2.053 ± 0.814
2.933SerGln: 2.933 ± 1.025
2.933SerArg: 2.933 ± 0.947
2.933SerSer: 2.933 ± 1.253
4.399SerThr: 4.399 ± 1.362
3.812SerVal: 3.812 ± 1.218
0.587SerTrp: 0.587 ± 0.508
3.519SerTyr: 3.519 ± 1.014
0.0SerXaa: 0.0 ± 0.0
Thr
1.466ThrAla: 1.466 ± 0.552
0.0ThrCys: 0.0 ± 0.0
2.053ThrAsp: 2.053 ± 0.754
3.519ThrGlu: 3.519 ± 0.838
2.053ThrPhe: 2.053 ± 0.727
2.346ThrGly: 2.346 ± 0.789
1.466ThrHis: 1.466 ± 0.679
2.639ThrIle: 2.639 ± 0.857
3.812ThrLys: 3.812 ± 0.88
7.038ThrLeu: 7.038 ± 1.038
0.88ThrMet: 0.88 ± 0.467
4.692ThrAsn: 4.692 ± 0.847
2.053ThrPro: 2.053 ± 0.784
1.76ThrGln: 1.76 ± 0.614
2.933ThrArg: 2.933 ± 0.726
2.933ThrSer: 2.933 ± 1.378
1.76ThrThr: 1.76 ± 0.513
2.346ThrVal: 2.346 ± 1.328
0.88ThrTrp: 0.88 ± 0.6
2.346ThrTyr: 2.346 ± 0.548
0.0ThrXaa: 0.0 ± 0.0
Val
2.933ValAla: 2.933 ± 0.815
0.293ValCys: 0.293 ± 0.23
2.639ValAsp: 2.639 ± 0.932
3.812ValGlu: 3.812 ± 0.867
1.76ValPhe: 1.76 ± 0.656
2.053ValGly: 2.053 ± 0.699
0.587ValHis: 0.587 ± 0.315
4.692ValIle: 4.692 ± 1.048
7.331ValLys: 7.331 ± 1.171
7.625ValLeu: 7.625 ± 1.377
1.173ValMet: 1.173 ± 0.729
3.226ValAsn: 3.226 ± 0.599
1.466ValPro: 1.466 ± 0.578
0.88ValGln: 0.88 ± 0.421
1.76ValArg: 1.76 ± 0.703
5.572ValSer: 5.572 ± 1.012
2.639ValThr: 2.639 ± 0.679
2.639ValVal: 2.639 ± 0.943
0.293ValTrp: 0.293 ± 0.23
1.76ValTyr: 1.76 ± 0.719
0.0ValXaa: 0.0 ± 0.0
Trp
0.293TrpAla: 0.293 ± 0.256
0.293TrpCys: 0.293 ± 0.278
0.88TrpAsp: 0.88 ± 0.644
1.173TrpGlu: 1.173 ± 0.473
0.293TrpPhe: 0.293 ± 0.3
0.587TrpGly: 0.587 ± 0.315
0.0TrpHis: 0.0 ± 0.0
0.587TrpIle: 0.587 ± 0.415
0.88TrpLys: 0.88 ± 0.447
0.293TrpLeu: 0.293 ± 0.329
0.0TrpMet: 0.0 ± 0.0
0.587TrpAsn: 0.587 ± 0.369
0.293TrpPro: 0.293 ± 0.3
0.293TrpGln: 0.293 ± 0.28
0.587TrpArg: 0.587 ± 0.339
0.88TrpSer: 0.88 ± 0.42
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.587TrpTrp: 0.587 ± 0.366
0.587TrpTyr: 0.587 ± 0.315
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.587TyrAla: 0.587 ± 0.434
0.587TyrCys: 0.587 ± 0.395
2.053TyrAsp: 2.053 ± 0.723
1.466TyrGlu: 1.466 ± 0.49
1.76TyrPhe: 1.76 ± 0.823
2.639TyrGly: 2.639 ± 0.808
0.293TyrHis: 0.293 ± 0.29
2.346TyrIle: 2.346 ± 0.607
5.865TyrLys: 5.865 ± 1.398
3.226TyrLeu: 3.226 ± 1.051
1.466TyrMet: 1.466 ± 0.476
3.226TyrAsn: 3.226 ± 0.975
0.293TyrPro: 0.293 ± 0.23
0.88TyrGln: 0.88 ± 0.537
4.985TyrArg: 4.985 ± 1.314
4.692TyrSer: 4.692 ± 1.346
0.88TyrThr: 0.88 ± 0.494
3.812TyrVal: 3.812 ± 0.812
0.293TyrTrp: 0.293 ± 0.321
1.466TyrTyr: 1.466 ± 0.505
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 20 proteins (3411 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski