Amino acid dipepetide frequency for Streptococcus satellite phage Javan421

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.47AlaAla: 3.47 ± 0.818
1.068AlaCys: 1.068 ± 0.532
4.004AlaAsp: 4.004 ± 1.042
4.805AlaGlu: 4.805 ± 1.631
1.602AlaPhe: 1.602 ± 0.639
3.47AlaGly: 3.47 ± 1.14
0.534AlaHis: 0.534 ± 0.323
6.407AlaIle: 6.407 ± 1.14
3.737AlaLys: 3.737 ± 0.683
4.004AlaLeu: 4.004 ± 0.7
0.801AlaMet: 0.801 ± 0.661
4.004AlaAsn: 4.004 ± 1.191
0.801AlaPro: 0.801 ± 0.6
2.67AlaGln: 2.67 ± 0.653
1.869AlaArg: 1.869 ± 0.78
2.67AlaSer: 2.67 ± 0.698
3.47AlaThr: 3.47 ± 1.116
2.936AlaVal: 2.936 ± 0.982
1.068AlaTrp: 1.068 ± 0.453
3.737AlaTyr: 3.737 ± 0.919
0.0AlaXaa: 0.0 ± 0.0
Cys
0.267CysAla: 0.267 ± 0.253
0.0CysCys: 0.0 ± 0.0
0.534CysAsp: 0.534 ± 0.391
0.267CysGlu: 0.267 ± 0.296
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.267CysLys: 0.267 ± 0.225
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.534CysAsn: 0.534 ± 0.316
0.267CysPro: 0.267 ± 0.253
0.801CysGln: 0.801 ± 0.551
0.267CysArg: 0.267 ± 0.265
0.534CysSer: 0.534 ± 0.317
0.267CysThr: 0.267 ± 0.247
0.801CysVal: 0.801 ± 0.41
0.0CysTrp: 0.0 ± 0.0
0.267CysTyr: 0.267 ± 0.291
0.0CysXaa: 0.0 ± 0.0
Asp
2.403AspAla: 2.403 ± 0.697
0.267AspCys: 0.267 ± 0.225
3.737AspAsp: 3.737 ± 1.208
3.47AspGlu: 3.47 ± 0.88
2.136AspPhe: 2.136 ± 0.7
2.403AspGly: 2.403 ± 0.841
0.0AspHis: 0.0 ± 0.0
7.475AspIle: 7.475 ± 1.57
6.407AspLys: 6.407 ± 1.267
5.072AspLeu: 5.072 ± 0.751
2.136AspMet: 2.136 ± 0.741
4.271AspAsn: 4.271 ± 0.891
1.068AspPro: 1.068 ± 0.477
1.068AspGln: 1.068 ± 0.619
1.869AspArg: 1.869 ± 0.693
4.004AspSer: 4.004 ± 1.024
1.869AspThr: 1.869 ± 0.612
3.203AspVal: 3.203 ± 0.933
0.534AspTrp: 0.534 ± 0.403
3.47AspTyr: 3.47 ± 1.169
0.0AspXaa: 0.0 ± 0.0
Glu
2.936GluAla: 2.936 ± 0.923
0.0GluCys: 0.0 ± 0.0
3.737GluAsp: 3.737 ± 1.183
6.407GluGlu: 6.407 ± 1.885
2.403GluPhe: 2.403 ± 0.941
2.936GluGly: 2.936 ± 0.879
1.335GluHis: 1.335 ± 0.561
6.941GluIle: 6.941 ± 1.329
6.941GluLys: 6.941 ± 1.063
12.547GluLeu: 12.547 ± 2.093
2.403GluMet: 2.403 ± 0.876
5.339GluAsn: 5.339 ± 0.802
2.936GluPro: 2.936 ± 0.826
3.47GluGln: 3.47 ± 0.979
4.004GluArg: 4.004 ± 0.98
2.136GluSer: 2.136 ± 0.908
4.538GluThr: 4.538 ± 0.964
5.606GluVal: 5.606 ± 1.158
1.869GluTrp: 1.869 ± 0.84
3.737GluTyr: 3.737 ± 1.309
0.0GluXaa: 0.0 ± 0.0
Phe
1.869PheAla: 1.869 ± 0.783
0.267PheCys: 0.267 ± 0.253
2.67PheAsp: 2.67 ± 0.708
1.869PheGlu: 1.869 ± 0.904
2.136PhePhe: 2.136 ± 0.757
3.203PheGly: 3.203 ± 1.155
0.801PheHis: 0.801 ± 0.341
2.67PheIle: 2.67 ± 0.908
4.004PheLys: 4.004 ± 1.23
3.203PheLeu: 3.203 ± 0.629
2.136PheMet: 2.136 ± 0.594
2.136PheAsn: 2.136 ± 0.901
1.068PhePro: 1.068 ± 0.586
1.068PheGln: 1.068 ± 0.473
1.068PheArg: 1.068 ± 0.535
3.47PheSer: 3.47 ± 0.735
1.068PheThr: 1.068 ± 0.482
2.136PheVal: 2.136 ± 0.876
0.267PheTrp: 0.267 ± 0.213
0.534PheTyr: 0.534 ± 0.405
0.0PheXaa: 0.0 ± 0.0
Gly
2.67GlyAla: 2.67 ± 0.919
0.534GlyCys: 0.534 ± 0.314
1.869GlyAsp: 1.869 ± 0.521
3.203GlyGlu: 3.203 ± 1.011
1.869GlyPhe: 1.869 ± 0.452
3.737GlyGly: 3.737 ± 1.582
1.068GlyHis: 1.068 ± 0.46
5.072GlyIle: 5.072 ± 1.443
6.407GlyLys: 6.407 ± 1.499
5.072GlyLeu: 5.072 ± 1.156
0.801GlyMet: 0.801 ± 0.419
4.004GlyAsn: 4.004 ± 0.902
0.534GlyPro: 0.534 ± 0.451
2.136GlyGln: 2.136 ± 0.763
0.801GlyArg: 0.801 ± 0.385
1.602GlySer: 1.602 ± 0.566
1.602GlyThr: 1.602 ± 0.559
4.004GlyVal: 4.004 ± 0.95
0.534GlyTrp: 0.534 ± 0.404
2.403GlyTyr: 2.403 ± 0.681
0.0GlyXaa: 0.0 ± 0.0
His
0.534HisAla: 0.534 ± 0.383
0.0HisCys: 0.0 ± 0.0
0.534HisAsp: 0.534 ± 0.329
2.136HisGlu: 2.136 ± 0.913
0.801HisPhe: 0.801 ± 0.411
1.869HisGly: 1.869 ± 0.563
0.534HisHis: 0.534 ± 0.342
0.534HisIle: 0.534 ± 0.331
0.801HisLys: 0.801 ± 0.488
2.136HisLeu: 2.136 ± 0.593
0.534HisMet: 0.534 ± 0.403
1.068HisAsn: 1.068 ± 0.461
0.0HisPro: 0.0 ± 0.0
0.267HisGln: 0.267 ± 0.292
0.801HisArg: 0.801 ± 0.432
0.534HisSer: 0.534 ± 0.316
1.335HisThr: 1.335 ± 0.864
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
1.068HisTyr: 1.068 ± 0.587
0.0HisXaa: 0.0 ± 0.0
Ile
4.004IleAla: 4.004 ± 1.252
0.267IleCys: 0.267 ± 0.256
6.14IleAsp: 6.14 ± 1.445
7.208IleGlu: 7.208 ± 1.385
2.936IlePhe: 2.936 ± 0.922
4.271IleGly: 4.271 ± 0.918
1.335IleHis: 1.335 ± 0.524
6.14IleIle: 6.14 ± 1.652
5.873IleLys: 5.873 ± 1.517
9.343IleLeu: 9.343 ± 1.758
1.602IleMet: 1.602 ± 0.581
5.072IleAsn: 5.072 ± 0.919
2.403IlePro: 2.403 ± 0.667
2.936IleGln: 2.936 ± 0.768
3.203IleArg: 3.203 ± 0.891
9.076IleSer: 9.076 ± 2.52
3.47IleThr: 3.47 ± 0.61
4.271IleVal: 4.271 ± 1.023
0.801IleTrp: 0.801 ± 0.392
2.136IleTyr: 2.136 ± 0.771
0.0IleXaa: 0.0 ± 0.0
Lys
6.941LysAla: 6.941 ± 1.847
0.534LysCys: 0.534 ± 0.339
4.004LysAsp: 4.004 ± 1.131
9.343LysGlu: 9.343 ± 1.44
2.136LysPhe: 2.136 ± 0.704
3.47LysGly: 3.47 ± 0.894
2.403LysHis: 2.403 ± 0.613
6.941LysIle: 6.941 ± 1.796
7.742LysLys: 7.742 ± 1.6
9.076LysLeu: 9.076 ± 1.938
1.602LysMet: 1.602 ± 0.638
6.407LysAsn: 6.407 ± 0.918
2.136LysPro: 2.136 ± 0.74
5.072LysGln: 5.072 ± 1.089
2.936LysArg: 2.936 ± 0.771
4.538LysSer: 4.538 ± 1.212
5.606LysThr: 5.606 ± 1.056
3.47LysVal: 3.47 ± 0.954
1.335LysTrp: 1.335 ± 0.774
3.203LysTyr: 3.203 ± 0.897
0.0LysXaa: 0.0 ± 0.0
Leu
8.809LeuAla: 8.809 ± 1.278
0.267LeuCys: 0.267 ± 0.247
7.475LeuAsp: 7.475 ± 1.336
10.411LeuGlu: 10.411 ± 2.004
4.004LeuPhe: 4.004 ± 0.666
5.339LeuGly: 5.339 ± 1.661
1.335LeuHis: 1.335 ± 0.672
5.339LeuIle: 5.339 ± 1.612
8.009LeuLys: 8.009 ± 1.309
11.746LeuLeu: 11.746 ± 1.425
1.335LeuMet: 1.335 ± 0.891
5.072LeuAsn: 5.072 ± 0.906
2.403LeuPro: 2.403 ± 0.547
4.004LeuGln: 4.004 ± 0.81
2.936LeuArg: 2.936 ± 0.834
7.208LeuSer: 7.208 ± 1.141
6.674LeuThr: 6.674 ± 0.851
5.072LeuVal: 5.072 ± 1.059
0.801LeuTrp: 0.801 ± 0.435
4.805LeuTyr: 4.805 ± 1.031
0.0LeuXaa: 0.0 ± 0.0
Met
2.136MetAla: 2.136 ± 0.771
0.0MetCys: 0.0 ± 0.0
2.67MetAsp: 2.67 ± 0.926
2.67MetGlu: 2.67 ± 0.943
0.534MetPhe: 0.534 ± 0.415
0.534MetGly: 0.534 ± 0.391
0.267MetHis: 0.267 ± 0.272
1.869MetIle: 1.869 ± 0.703
1.869MetLys: 1.869 ± 0.835
2.67MetLeu: 2.67 ± 0.885
2.403MetMet: 2.403 ± 0.845
2.136MetAsn: 2.136 ± 0.675
0.267MetPro: 0.267 ± 0.323
1.335MetGln: 1.335 ± 0.759
1.869MetArg: 1.869 ± 0.587
1.068MetSer: 1.068 ± 0.782
2.136MetThr: 2.136 ± 0.557
0.534MetVal: 0.534 ± 0.396
0.0MetTrp: 0.0 ± 0.0
0.267MetTyr: 0.267 ± 0.256
0.0MetXaa: 0.0 ± 0.0
Asn
4.004AsnAla: 4.004 ± 1.067
0.534AsnCys: 0.534 ± 0.385
4.538AsnAsp: 4.538 ± 0.946
3.737AsnGlu: 3.737 ± 0.849
2.136AsnPhe: 2.136 ± 0.709
4.004AsnGly: 4.004 ± 1.078
0.267AsnHis: 0.267 ± 0.294
4.004AsnIle: 4.004 ± 1.514
6.14AsnLys: 6.14 ± 1.302
5.873AsnLeu: 5.873 ± 1.323
2.403AsnMet: 2.403 ± 0.704
1.335AsnAsn: 1.335 ± 0.671
1.869AsnPro: 1.869 ± 0.657
3.203AsnGln: 3.203 ± 0.638
2.936AsnArg: 2.936 ± 0.818
2.936AsnSer: 2.936 ± 0.745
3.47AsnThr: 3.47 ± 0.844
2.403AsnVal: 2.403 ± 0.754
1.068AsnTrp: 1.068 ± 0.6
2.936AsnTyr: 2.936 ± 0.818
0.0AsnXaa: 0.0 ± 0.0
Pro
1.602ProAla: 1.602 ± 0.772
0.0ProCys: 0.0 ± 0.0
0.0ProAsp: 0.0 ± 0.0
3.203ProGlu: 3.203 ± 0.982
2.136ProPhe: 2.136 ± 0.701
0.267ProGly: 0.267 ± 0.247
0.0ProHis: 0.0 ± 0.0
2.67ProIle: 2.67 ± 0.705
2.403ProLys: 2.403 ± 0.769
1.869ProLeu: 1.869 ± 0.681
0.534ProMet: 0.534 ± 0.408
2.403ProAsn: 2.403 ± 0.737
0.801ProPro: 0.801 ± 0.492
0.801ProGln: 0.801 ± 0.481
1.335ProArg: 1.335 ± 0.522
1.068ProSer: 1.068 ± 0.438
2.403ProThr: 2.403 ± 0.727
1.335ProVal: 1.335 ± 0.496
0.267ProTrp: 0.267 ± 0.225
0.534ProTyr: 0.534 ± 0.4
0.0ProXaa: 0.0 ± 0.0
Gln
2.67GlnAla: 2.67 ± 0.629
0.0GlnCys: 0.0 ± 0.0
1.602GlnAsp: 1.602 ± 0.666
4.004GlnGlu: 4.004 ± 1.207
1.335GlnPhe: 1.335 ± 0.519
1.869GlnGly: 1.869 ± 0.568
0.534GlnHis: 0.534 ± 0.448
3.47GlnIle: 3.47 ± 0.901
4.805GlnLys: 4.805 ± 0.915
3.737GlnLeu: 3.737 ± 1.187
0.801GlnMet: 0.801 ± 0.415
2.136GlnAsn: 2.136 ± 0.909
0.801GlnPro: 0.801 ± 0.4
4.538GlnGln: 4.538 ± 1.113
1.869GlnArg: 1.869 ± 0.774
4.538GlnSer: 4.538 ± 0.841
1.602GlnThr: 1.602 ± 0.529
1.869GlnVal: 1.869 ± 0.895
0.0GlnTrp: 0.0 ± 0.0
1.068GlnTyr: 1.068 ± 0.619
0.0GlnXaa: 0.0 ± 0.0
Arg
1.335ArgAla: 1.335 ± 0.564
0.267ArgCys: 0.267 ± 0.284
2.403ArgAsp: 2.403 ± 0.707
4.004ArgGlu: 4.004 ± 1.282
1.068ArgPhe: 1.068 ± 0.522
1.068ArgGly: 1.068 ± 0.55
0.801ArgHis: 0.801 ± 0.364
3.47ArgIle: 3.47 ± 1.108
4.805ArgLys: 4.805 ± 1.063
3.737ArgLeu: 3.737 ± 0.847
2.936ArgMet: 2.936 ± 0.795
1.869ArgAsn: 1.869 ± 0.706
0.801ArgPro: 0.801 ± 0.42
1.602ArgGln: 1.602 ± 0.662
1.335ArgArg: 1.335 ± 0.71
0.801ArgSer: 0.801 ± 0.428
1.602ArgThr: 1.602 ± 0.545
3.203ArgVal: 3.203 ± 0.97
0.534ArgTrp: 0.534 ± 0.344
1.335ArgTyr: 1.335 ± 0.64
0.0ArgXaa: 0.0 ± 0.0
Ser
2.936SerAla: 2.936 ± 0.78
0.267SerCys: 0.267 ± 0.291
4.271SerAsp: 4.271 ± 0.866
5.606SerGlu: 5.606 ± 1.521
2.403SerPhe: 2.403 ± 0.77
1.869SerGly: 1.869 ± 0.683
0.534SerHis: 0.534 ± 0.372
4.271SerIle: 4.271 ± 1.244
6.407SerLys: 6.407 ± 1.125
6.407SerLeu: 6.407 ± 1.282
1.602SerMet: 1.602 ± 0.553
4.004SerAsn: 4.004 ± 0.829
2.936SerPro: 2.936 ± 0.897
2.136SerGln: 2.136 ± 0.796
1.068SerArg: 1.068 ± 0.56
4.004SerSer: 4.004 ± 0.729
2.936SerThr: 2.936 ± 0.971
4.004SerVal: 4.004 ± 1.008
0.267SerTrp: 0.267 ± 0.265
3.47SerTyr: 3.47 ± 0.742
0.0SerXaa: 0.0 ± 0.0
Thr
2.67ThrAla: 2.67 ± 0.906
0.267ThrCys: 0.267 ± 0.253
2.67ThrAsp: 2.67 ± 0.873
3.737ThrGlu: 3.737 ± 0.854
1.068ThrPhe: 1.068 ± 0.433
4.271ThrGly: 4.271 ± 1.131
1.068ThrHis: 1.068 ± 0.492
5.072ThrIle: 5.072 ± 0.904
3.47ThrLys: 3.47 ± 0.807
5.873ThrLeu: 5.873 ± 0.986
0.534ThrMet: 0.534 ± 0.367
3.203ThrAsn: 3.203 ± 0.983
1.869ThrPro: 1.869 ± 0.51
2.403ThrGln: 2.403 ± 0.956
2.67ThrArg: 2.67 ± 0.779
2.136ThrSer: 2.136 ± 0.641
5.072ThrThr: 5.072 ± 1.356
4.538ThrVal: 4.538 ± 1.26
0.267ThrTrp: 0.267 ± 0.284
2.136ThrTyr: 2.136 ± 0.884
0.0ThrXaa: 0.0 ± 0.0
Val
2.936ValAla: 2.936 ± 1.038
0.0ValCys: 0.0 ± 0.0
2.403ValAsp: 2.403 ± 0.617
2.67ValGlu: 2.67 ± 0.821
2.136ValPhe: 2.136 ± 1.113
3.47ValGly: 3.47 ± 0.966
0.534ValHis: 0.534 ± 0.326
6.14ValIle: 6.14 ± 1.178
3.737ValLys: 3.737 ± 1.527
5.339ValLeu: 5.339 ± 1.056
0.801ValMet: 0.801 ± 0.477
2.936ValAsn: 2.936 ± 0.686
2.136ValPro: 2.136 ± 0.657
1.068ValGln: 1.068 ± 0.606
2.67ValArg: 2.67 ± 0.735
5.072ValSer: 5.072 ± 1.546
3.203ValThr: 3.203 ± 1.187
4.538ValVal: 4.538 ± 1.078
1.068ValTrp: 1.068 ± 0.482
3.47ValTyr: 3.47 ± 1.499
0.0ValXaa: 0.0 ± 0.0
Trp
0.267TrpAla: 0.267 ± 0.265
0.0TrpCys: 0.0 ± 0.0
0.267TrpAsp: 0.267 ± 0.265
1.602TrpGlu: 1.602 ± 0.503
0.534TrpPhe: 0.534 ± 0.432
0.267TrpGly: 0.267 ± 0.284
0.534TrpHis: 0.534 ± 0.397
1.068TrpIle: 1.068 ± 0.472
1.602TrpLys: 1.602 ± 0.63
1.602TrpLeu: 1.602 ± 0.646
0.534TrpMet: 0.534 ± 0.433
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.534TrpGln: 0.534 ± 0.402
1.068TrpArg: 1.068 ± 0.59
0.534TrpSer: 0.534 ± 0.314
0.534TrpThr: 0.534 ± 0.338
0.0TrpVal: 0.0 ± 0.0
0.267TrpTrp: 0.267 ± 0.265
0.267TrpTyr: 0.267 ± 0.284
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.67TyrAla: 2.67 ± 1.181
0.534TyrCys: 0.534 ± 0.451
1.335TyrAsp: 1.335 ± 0.766
1.869TyrGlu: 1.869 ± 0.926
4.271TyrPhe: 4.271 ± 0.931
1.602TyrGly: 1.602 ± 0.583
1.602TyrHis: 1.602 ± 0.791
2.936TyrIle: 2.936 ± 1.118
3.47TyrLys: 3.47 ± 1.079
4.004TyrLeu: 4.004 ± 0.98
1.068TyrMet: 1.068 ± 0.561
1.869TyrAsn: 1.869 ± 0.575
0.534TyrPro: 0.534 ± 0.404
2.136TyrGln: 2.136 ± 0.796
2.403TyrArg: 2.403 ± 0.989
3.47TyrSer: 3.47 ± 0.916
2.136TyrThr: 2.136 ± 0.857
2.403TyrVal: 2.403 ± 0.863
0.534TyrTrp: 0.534 ± 0.375
1.335TyrTyr: 1.335 ± 0.545
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 27 proteins (3747 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski