Amino acid dipepetide frequency for Streptococcus satellite phage Javan353

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.0AlaAla: 0.0 ± 0.0
0.323AlaCys: 0.323 ± 0.32
2.259AlaAsp: 2.259 ± 1.093
2.259AlaGlu: 2.259 ± 0.688
2.259AlaPhe: 2.259 ± 0.818
1.291AlaGly: 1.291 ± 0.515
0.323AlaHis: 0.323 ± 0.32
1.291AlaIle: 1.291 ± 0.717
6.131AlaLys: 6.131 ± 1.76
4.195AlaLeu: 4.195 ± 1.458
1.613AlaMet: 1.613 ± 0.954
3.55AlaAsn: 3.55 ± 0.764
0.645AlaPro: 0.645 ± 0.418
0.645AlaGln: 0.645 ± 0.386
2.581AlaArg: 2.581 ± 0.75
1.936AlaSer: 1.936 ± 0.998
3.55AlaThr: 3.55 ± 0.603
2.581AlaVal: 2.581 ± 0.861
0.0AlaTrp: 0.0 ± 0.0
3.227AlaTyr: 3.227 ± 1.048
0.0AlaXaa: 0.0 ± 0.0
Cys
0.645CysAla: 0.645 ± 0.459
0.0CysCys: 0.0 ± 0.0
0.968CysAsp: 0.968 ± 0.421
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.645CysGly: 0.645 ± 0.509
0.0CysHis: 0.0 ± 0.0
0.323CysIle: 0.323 ± 0.353
0.323CysLys: 0.323 ± 0.357
0.968CysLeu: 0.968 ± 0.563
0.323CysMet: 0.323 ± 0.397
0.323CysAsn: 0.323 ± 0.395
0.645CysPro: 0.645 ± 0.459
0.323CysGln: 0.323 ± 0.287
0.0CysArg: 0.0 ± 0.0
0.323CysSer: 0.323 ± 0.32
0.645CysThr: 0.645 ± 0.358
0.323CysVal: 0.323 ± 0.296
0.0CysTrp: 0.0 ± 0.0
0.645CysTyr: 0.645 ± 0.395
0.0CysXaa: 0.0 ± 0.0
Asp
0.323AspAla: 0.323 ± 0.336
0.645AspCys: 0.645 ± 0.519
4.518AspAsp: 4.518 ± 1.634
2.259AspGlu: 2.259 ± 0.66
6.131AspPhe: 6.131 ± 1.348
3.872AspGly: 3.872 ± 0.919
0.323AspHis: 0.323 ± 0.251
7.099AspIle: 7.099 ± 1.807
7.099AspLys: 7.099 ± 1.48
8.39AspLeu: 8.39 ± 1.489
1.291AspMet: 1.291 ± 0.585
2.904AspAsn: 2.904 ± 0.815
0.323AspPro: 0.323 ± 0.357
0.645AspGln: 0.645 ± 0.383
1.936AspArg: 1.936 ± 0.772
4.518AspSer: 4.518 ± 1.194
2.904AspThr: 2.904 ± 1.165
4.195AspVal: 4.195 ± 0.86
0.323AspTrp: 0.323 ± 0.296
3.227AspTyr: 3.227 ± 1.151
0.0AspXaa: 0.0 ± 0.0
Glu
2.904GluAla: 2.904 ± 1.057
1.291GluCys: 1.291 ± 0.538
4.84GluAsp: 4.84 ± 1.608
3.55GluGlu: 3.55 ± 1.059
3.872GluPhe: 3.872 ± 0.835
1.936GluGly: 1.936 ± 0.991
0.968GluHis: 0.968 ± 0.477
8.067GluIle: 8.067 ± 1.511
10.003GluLys: 10.003 ± 2.415
7.422GluLeu: 7.422 ± 1.573
2.259GluMet: 2.259 ± 1.015
7.099GluAsn: 7.099 ± 1.924
0.645GluPro: 0.645 ± 0.386
4.195GluGln: 4.195 ± 0.872
3.55GluArg: 3.55 ± 1.129
4.518GluSer: 4.518 ± 1.03
4.195GluThr: 4.195 ± 1.497
2.581GluVal: 2.581 ± 0.95
0.968GluTrp: 0.968 ± 0.48
3.227GluTyr: 3.227 ± 0.982
0.0GluXaa: 0.0 ± 0.0
Phe
1.291PheAla: 1.291 ± 0.686
0.323PheCys: 0.323 ± 0.296
3.872PheAsp: 3.872 ± 0.847
5.808PheGlu: 5.808 ± 1.453
3.55PhePhe: 3.55 ± 1.242
3.227PheGly: 3.227 ± 0.751
0.968PheHis: 0.968 ± 0.612
1.613PheIle: 1.613 ± 0.501
4.195PheLys: 4.195 ± 1.244
5.486PheLeu: 5.486 ± 1.177
0.323PheMet: 0.323 ± 0.34
2.904PheAsn: 2.904 ± 0.866
1.291PhePro: 1.291 ± 0.851
1.613PheGln: 1.613 ± 0.914
1.613PheArg: 1.613 ± 0.575
4.518PheSer: 4.518 ± 1.193
2.259PheThr: 2.259 ± 0.923
3.55PheVal: 3.55 ± 1.066
0.323PheTrp: 0.323 ± 0.251
1.613PheTyr: 1.613 ± 0.874
0.0PheXaa: 0.0 ± 0.0
Gly
1.613GlyAla: 1.613 ± 0.783
0.323GlyCys: 0.323 ± 0.251
1.936GlyAsp: 1.936 ± 0.853
1.613GlyGlu: 1.613 ± 0.8
2.259GlyPhe: 2.259 ± 0.68
1.613GlyGly: 1.613 ± 1.148
0.968GlyHis: 0.968 ± 0.454
2.581GlyIle: 2.581 ± 0.81
5.486GlyLys: 5.486 ± 1.543
3.872GlyLeu: 3.872 ± 1.169
0.645GlyMet: 0.645 ± 0.434
2.581GlyAsn: 2.581 ± 0.842
0.0GlyPro: 0.0 ± 0.0
1.291GlyGln: 1.291 ± 0.498
0.645GlyArg: 0.645 ± 0.371
3.227GlySer: 3.227 ± 1.326
3.55GlyThr: 3.55 ± 1.131
2.904GlyVal: 2.904 ± 0.84
0.645GlyTrp: 0.645 ± 0.387
3.55GlyTyr: 3.55 ± 1.279
0.0GlyXaa: 0.0 ± 0.0
His
2.259HisAla: 2.259 ± 0.92
0.0HisCys: 0.0 ± 0.0
0.968HisAsp: 0.968 ± 0.437
0.968HisGlu: 0.968 ± 0.557
0.0HisPhe: 0.0 ± 0.0
0.0HisGly: 0.0 ± 0.0
0.323HisHis: 0.323 ± 0.284
1.613HisIle: 1.613 ± 0.616
1.936HisLys: 1.936 ± 0.705
1.613HisLeu: 1.613 ± 0.864
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
0.0HisPro: 0.0 ± 0.0
0.968HisGln: 0.968 ± 0.482
0.323HisArg: 0.323 ± 0.32
1.613HisSer: 1.613 ± 0.82
2.259HisThr: 2.259 ± 0.827
0.323HisVal: 0.323 ± 0.294
0.323HisTrp: 0.323 ± 0.294
0.968HisTyr: 0.968 ± 0.481
0.0HisXaa: 0.0 ± 0.0
Ile
1.613IleAla: 1.613 ± 0.648
0.323IleCys: 0.323 ± 0.32
4.195IleAsp: 4.195 ± 1.194
5.808IleGlu: 5.808 ± 1.425
3.55IlePhe: 3.55 ± 1.133
2.904IleGly: 2.904 ± 0.709
0.968IleHis: 0.968 ± 0.457
3.872IleIle: 3.872 ± 1.025
8.712IleLys: 8.712 ± 1.271
9.681IleLeu: 9.681 ± 1.907
0.645IleMet: 0.645 ± 0.511
6.776IleAsn: 6.776 ± 1.187
2.259IlePro: 2.259 ± 0.606
2.904IleGln: 2.904 ± 0.879
1.936IleArg: 1.936 ± 0.631
6.131IleSer: 6.131 ± 1.037
4.195IleThr: 4.195 ± 0.99
2.904IleVal: 2.904 ± 0.913
0.323IleTrp: 0.323 ± 0.32
1.936IleTyr: 1.936 ± 0.624
0.0IleXaa: 0.0 ± 0.0
Lys
5.808LysAla: 5.808 ± 1.674
0.645LysCys: 0.645 ± 0.509
5.486LysAsp: 5.486 ± 1.36
10.971LysGlu: 10.971 ± 1.426
2.904LysPhe: 2.904 ± 1.127
5.486LysGly: 5.486 ± 1.524
2.581LysHis: 2.581 ± 0.732
9.681LysIle: 9.681 ± 1.833
11.294LysLys: 11.294 ± 2.414
7.744LysLeu: 7.744 ± 1.695
3.872LysMet: 3.872 ± 1.403
7.099LysAsn: 7.099 ± 1.738
2.259LysPro: 2.259 ± 0.783
5.163LysGln: 5.163 ± 1.148
7.099LysArg: 7.099 ± 1.1
5.486LysSer: 5.486 ± 1.407
8.39LysThr: 8.39 ± 1.587
5.808LysVal: 5.808 ± 1.359
0.968LysTrp: 0.968 ± 0.451
5.163LysTyr: 5.163 ± 1.307
0.0LysXaa: 0.0 ± 0.0
Leu
4.84LeuAla: 4.84 ± 1.551
1.291LeuCys: 1.291 ± 0.66
7.744LeuAsp: 7.744 ± 1.414
10.326LeuGlu: 10.326 ± 2.026
5.808LeuPhe: 5.808 ± 1.658
6.131LeuGly: 6.131 ± 1.056
1.936LeuHis: 1.936 ± 0.873
6.454LeuIle: 6.454 ± 1.365
10.649LeuLys: 10.649 ± 1.492
6.454LeuLeu: 6.454 ± 1.042
2.581LeuMet: 2.581 ± 0.769
5.163LeuAsn: 5.163 ± 1.431
2.904LeuPro: 2.904 ± 1.095
3.872LeuGln: 3.872 ± 0.764
2.581LeuArg: 2.581 ± 0.623
6.776LeuSer: 6.776 ± 1.184
4.518LeuThr: 4.518 ± 1.257
6.454LeuVal: 6.454 ± 1.354
0.645LeuTrp: 0.645 ± 0.519
1.936LeuTyr: 1.936 ± 0.817
0.0LeuXaa: 0.0 ± 0.0
Met
0.323MetAla: 0.323 ± 0.284
0.0MetCys: 0.0 ± 0.0
3.227MetAsp: 3.227 ± 1.029
3.227MetGlu: 3.227 ± 1.302
0.323MetPhe: 0.323 ± 0.3
0.645MetGly: 0.645 ± 0.515
0.0MetHis: 0.0 ± 0.0
1.613MetIle: 1.613 ± 0.557
1.291MetLys: 1.291 ± 0.515
1.936MetLeu: 1.936 ± 0.69
0.0MetMet: 0.0 ± 0.0
1.291MetAsn: 1.291 ± 0.689
0.645MetPro: 0.645 ± 0.425
0.645MetGln: 0.645 ± 0.459
1.291MetArg: 1.291 ± 0.575
0.645MetSer: 0.645 ± 0.46
2.581MetThr: 2.581 ± 1.316
1.613MetVal: 1.613 ± 0.49
0.0MetTrp: 0.0 ± 0.0
0.968MetTyr: 0.968 ± 0.557
0.0MetXaa: 0.0 ± 0.0
Asn
2.581AsnAla: 2.581 ± 0.932
0.323AsnCys: 0.323 ± 0.251
2.259AsnAsp: 2.259 ± 0.856
2.581AsnGlu: 2.581 ± 1.004
4.84AsnPhe: 4.84 ± 1.216
3.227AsnGly: 3.227 ± 1.054
1.291AsnHis: 1.291 ± 0.525
4.84AsnIle: 4.84 ± 1.337
6.776AsnLys: 6.776 ± 1.269
6.131AsnLeu: 6.131 ± 1.345
0.968AsnMet: 0.968 ± 0.567
3.872AsnAsn: 3.872 ± 0.978
2.904AsnPro: 2.904 ± 0.824
3.55AsnGln: 3.55 ± 1.102
1.613AsnArg: 1.613 ± 0.742
3.872AsnSer: 3.872 ± 1.492
4.518AsnThr: 4.518 ± 1.448
2.581AsnVal: 2.581 ± 0.679
0.323AsnTrp: 0.323 ± 0.353
4.518AsnTyr: 4.518 ± 0.961
0.0AsnXaa: 0.0 ± 0.0
Pro
0.323ProAla: 0.323 ± 0.251
0.0ProCys: 0.0 ± 0.0
1.613ProAsp: 1.613 ± 0.866
1.936ProGlu: 1.936 ± 0.863
0.968ProPhe: 0.968 ± 0.557
0.323ProGly: 0.323 ± 0.353
0.0ProHis: 0.0 ± 0.0
1.613ProIle: 1.613 ± 0.683
4.84ProLys: 4.84 ± 1.371
0.645ProLeu: 0.645 ± 0.435
0.323ProMet: 0.323 ± 0.296
0.968ProAsn: 0.968 ± 0.396
0.645ProPro: 0.645 ± 0.416
0.968ProGln: 0.968 ± 0.751
0.968ProArg: 0.968 ± 0.435
1.613ProSer: 1.613 ± 0.805
1.291ProThr: 1.291 ± 0.851
0.323ProVal: 0.323 ± 0.251
0.0ProTrp: 0.0 ± 0.0
0.968ProTyr: 0.968 ± 0.465
0.0ProXaa: 0.0 ± 0.0
Gln
4.518GlnAla: 4.518 ± 1.141
0.645GlnCys: 0.645 ± 0.469
2.904GlnAsp: 2.904 ± 0.761
3.55GlnGlu: 3.55 ± 0.917
0.968GlnPhe: 0.968 ± 0.66
0.323GlnGly: 0.323 ± 0.397
1.291GlnHis: 1.291 ± 0.618
1.613GlnIle: 1.613 ± 0.467
4.195GlnLys: 4.195 ± 1.025
3.227GlnLeu: 3.227 ± 0.854
0.968GlnMet: 0.968 ± 0.561
1.936GlnAsn: 1.936 ± 0.838
0.0GlnPro: 0.0 ± 0.0
0.645GlnGln: 0.645 ± 0.386
3.227GlnArg: 3.227 ± 0.771
1.613GlnSer: 1.613 ± 0.923
2.904GlnThr: 2.904 ± 0.957
2.904GlnVal: 2.904 ± 0.848
0.0GlnTrp: 0.0 ± 0.0
1.936GlnTyr: 1.936 ± 0.686
0.0GlnXaa: 0.0 ± 0.0
Arg
1.613ArgAla: 1.613 ± 0.627
0.0ArgCys: 0.0 ± 0.0
2.904ArgAsp: 2.904 ± 0.641
3.872ArgGlu: 3.872 ± 0.697
2.581ArgPhe: 2.581 ± 0.781
1.291ArgGly: 1.291 ± 0.699
0.645ArgHis: 0.645 ± 0.383
2.259ArgIle: 2.259 ± 0.763
8.067ArgLys: 8.067 ± 1.29
3.55ArgLeu: 3.55 ± 0.797
0.323ArgMet: 0.323 ± 0.336
3.55ArgAsn: 3.55 ± 0.814
0.323ArgPro: 0.323 ± 0.353
2.259ArgGln: 2.259 ± 0.804
2.581ArgArg: 2.581 ± 0.814
1.613ArgSer: 1.613 ± 0.607
1.936ArgThr: 1.936 ± 0.939
0.968ArgVal: 0.968 ± 0.422
0.0ArgTrp: 0.0 ± 0.0
2.259ArgTyr: 2.259 ± 0.837
0.0ArgXaa: 0.0 ± 0.0
Ser
1.291SerAla: 1.291 ± 0.618
0.323SerCys: 0.323 ± 0.353
3.872SerAsp: 3.872 ± 0.745
4.84SerGlu: 4.84 ± 1.292
3.55SerPhe: 3.55 ± 1.038
3.227SerGly: 3.227 ± 1.093
1.936SerHis: 1.936 ± 0.826
3.55SerIle: 3.55 ± 1.317
7.422SerLys: 7.422 ± 2.017
8.067SerLeu: 8.067 ± 1.716
1.936SerMet: 1.936 ± 0.938
2.259SerAsn: 2.259 ± 0.996
1.613SerPro: 1.613 ± 0.721
3.227SerGln: 3.227 ± 0.934
1.291SerArg: 1.291 ± 0.56
4.518SerSer: 4.518 ± 1.45
1.936SerThr: 1.936 ± 0.79
4.518SerVal: 4.518 ± 1.122
0.645SerTrp: 0.645 ± 0.481
2.904SerTyr: 2.904 ± 0.802
0.0SerXaa: 0.0 ± 0.0
Thr
3.227ThrAla: 3.227 ± 0.891
0.0ThrCys: 0.0 ± 0.0
2.259ThrAsp: 2.259 ± 0.771
3.55ThrGlu: 3.55 ± 1.13
1.936ThrPhe: 1.936 ± 0.717
4.195ThrGly: 4.195 ± 1.061
0.645ThrHis: 0.645 ± 0.452
5.163ThrIle: 5.163 ± 1.255
5.163ThrLys: 5.163 ± 1.315
7.422ThrLeu: 7.422 ± 1.132
1.291ThrMet: 1.291 ± 0.591
3.872ThrAsn: 3.872 ± 0.823
1.291ThrPro: 1.291 ± 0.574
2.259ThrGln: 2.259 ± 1.045
4.84ThrArg: 4.84 ± 1.89
1.936ThrSer: 1.936 ± 0.891
2.904ThrThr: 2.904 ± 1.068
4.84ThrVal: 4.84 ± 1.407
0.968ThrTrp: 0.968 ± 0.539
1.936ThrTyr: 1.936 ± 1.265
0.0ThrXaa: 0.0 ± 0.0
Val
2.904ValAla: 2.904 ± 0.918
0.323ValCys: 0.323 ± 0.296
4.518ValAsp: 4.518 ± 1.383
5.486ValGlu: 5.486 ± 1.441
1.291ValPhe: 1.291 ± 0.524
0.323ValGly: 0.323 ± 0.284
0.645ValHis: 0.645 ± 0.573
4.518ValIle: 4.518 ± 1.027
4.518ValLys: 4.518 ± 1.634
4.84ValLeu: 4.84 ± 0.981
0.645ValMet: 0.645 ± 0.499
3.55ValAsn: 3.55 ± 0.87
1.613ValPro: 1.613 ± 0.645
1.936ValGln: 1.936 ± 0.943
1.291ValArg: 1.291 ± 0.607
5.486ValSer: 5.486 ± 1.34
3.872ValThr: 3.872 ± 1.24
3.227ValVal: 3.227 ± 0.901
0.323ValTrp: 0.323 ± 0.33
2.904ValTyr: 2.904 ± 0.921
0.0ValXaa: 0.0 ± 0.0
Trp
0.968TrpAla: 0.968 ± 0.452
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.936TrpGlu: 1.936 ± 0.746
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.968TrpIle: 0.968 ± 0.543
0.323TrpLys: 0.323 ± 0.353
0.968TrpLeu: 0.968 ± 0.631
0.645TrpMet: 0.645 ± 0.409
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.323TrpGln: 0.323 ± 0.251
0.0TrpArg: 0.0 ± 0.0
0.323TrpSer: 0.323 ± 0.251
0.0TrpThr: 0.0 ± 0.0
0.645TrpVal: 0.645 ± 0.37
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.613TyrAla: 1.613 ± 0.844
0.645TyrCys: 0.645 ± 0.641
2.581TyrAsp: 2.581 ± 0.898
3.55TyrGlu: 3.55 ± 1.264
3.55TyrPhe: 3.55 ± 0.785
0.645TyrGly: 0.645 ± 0.371
0.645TyrHis: 0.645 ± 0.44
3.227TyrIle: 3.227 ± 0.873
5.486TyrLys: 5.486 ± 1.339
6.454TyrLeu: 6.454 ± 1.114
1.291TyrMet: 1.291 ± 0.574
3.872TyrAsn: 3.872 ± 0.894
0.323TyrPro: 0.323 ± 0.32
1.936TyrGln: 1.936 ± 0.551
3.227TyrArg: 3.227 ± 1.267
2.259TyrSer: 2.259 ± 1.05
1.291TyrThr: 1.291 ± 0.79
0.968TyrVal: 0.968 ± 0.563
0.323TyrTrp: 0.323 ± 0.296
2.259TyrTyr: 2.259 ± 0.978
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 18 proteins (3100 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski