Amino acid dipepetide frequency for Sawgrass virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.19AlaAla: 4.19 ± 2.215
1.397AlaCys: 1.397 ± 0.536
3.352AlaAsp: 3.352 ± 0.982
4.469AlaGlu: 4.469 ± 1.474
1.397AlaPhe: 1.397 ± 0.428
3.073AlaGly: 3.073 ± 1.097
2.514AlaHis: 2.514 ± 0.421
3.911AlaIle: 3.911 ± 0.847
0.838AlaLys: 0.838 ± 0.486
8.38AlaLeu: 8.38 ± 2.345
1.676AlaMet: 1.676 ± 0.738
2.793AlaAsn: 2.793 ± 0.789
3.073AlaPro: 3.073 ± 1.712
3.073AlaGln: 3.073 ± 0.837
5.866AlaArg: 5.866 ± 0.349
4.19AlaSer: 4.19 ± 2.104
3.911AlaThr: 3.911 ± 0.485
4.469AlaVal: 4.469 ± 1.116
1.955AlaTrp: 1.955 ± 0.904
2.514AlaTyr: 2.514 ± 0.58
0.0AlaXaa: 0.0 ± 0.0
Cys
1.117CysAla: 1.117 ± 0.408
0.559CysCys: 0.559 ± 0.288
1.955CysAsp: 1.955 ± 0.455
0.559CysGlu: 0.559 ± 0.288
0.559CysPhe: 0.559 ± 0.288
2.235CysGly: 2.235 ± 0.636
0.279CysHis: 0.279 ± 0.144
1.397CysIle: 1.397 ± 0.296
0.559CysLys: 0.559 ± 0.603
1.117CysLeu: 1.117 ± 0.344
0.0CysMet: 0.0 ± 0.0
0.279CysAsn: 0.279 ± 0.618
1.397CysPro: 1.397 ± 0.81
1.397CysGln: 1.397 ± 0.629
0.559CysArg: 0.559 ± 0.266
1.117CysSer: 1.117 ± 0.408
1.117CysThr: 1.117 ± 0.301
1.117CysVal: 1.117 ± 0.394
0.559CysTrp: 0.559 ± 0.288
0.559CysTyr: 0.559 ± 0.288
0.0CysXaa: 0.0 ± 0.0
Asp
3.073AspAla: 3.073 ± 1.041
0.559AspCys: 0.559 ± 0.266
4.469AspAsp: 4.469 ± 1.552
5.587AspGlu: 5.587 ± 2.041
1.676AspPhe: 1.676 ± 0.484
2.514AspGly: 2.514 ± 0.931
1.676AspHis: 1.676 ± 0.624
2.235AspIle: 2.235 ± 0.907
3.911AspLys: 3.911 ± 0.897
8.38AspLeu: 8.38 ± 1.931
0.559AspMet: 0.559 ± 0.286
2.514AspAsn: 2.514 ± 0.753
5.587AspPro: 5.587 ± 1.212
2.235AspGln: 2.235 ± 0.748
0.838AspArg: 0.838 ± 0.433
2.793AspSer: 2.793 ± 0.879
3.073AspThr: 3.073 ± 0.446
2.793AspVal: 2.793 ± 0.835
1.117AspTrp: 1.117 ± 0.672
1.397AspTyr: 1.397 ± 0.583
0.0AspXaa: 0.0 ± 0.0
Glu
2.793GluAla: 2.793 ± 0.97
1.676GluCys: 1.676 ± 0.799
3.911GluAsp: 3.911 ± 1.063
2.235GluGlu: 2.235 ± 1.166
1.676GluPhe: 1.676 ± 0.865
6.145GluGly: 6.145 ± 0.605
2.235GluHis: 2.235 ± 0.8
1.955GluIle: 1.955 ± 0.531
2.514GluLys: 2.514 ± 0.701
6.983GluLeu: 6.983 ± 2.018
1.397GluMet: 1.397 ± 0.372
1.676GluAsn: 1.676 ± 0.565
2.514GluPro: 2.514 ± 1.11
1.397GluGln: 1.397 ± 0.721
3.073GluArg: 3.073 ± 0.919
3.911GluSer: 3.911 ± 0.576
3.352GluThr: 3.352 ± 1.187
2.235GluVal: 2.235 ± 0.532
0.559GluTrp: 0.559 ± 0.47
1.676GluTyr: 1.676 ± 0.566
0.0GluXaa: 0.0 ± 0.0
Phe
1.955PheAla: 1.955 ± 0.549
0.559PheCys: 0.559 ± 0.29
0.838PheAsp: 0.838 ± 0.741
1.676PheGlu: 1.676 ± 0.637
1.397PhePhe: 1.397 ± 0.555
1.955PheGly: 1.955 ± 0.81
0.559PheHis: 0.559 ± 0.266
1.397PheIle: 1.397 ± 0.404
3.631PheLys: 3.631 ± 1.073
3.073PheLeu: 3.073 ± 0.919
0.838PheMet: 0.838 ± 0.368
1.397PheAsn: 1.397 ± 0.553
2.793PhePro: 2.793 ± 0.647
1.397PheGln: 1.397 ± 0.51
1.676PheArg: 1.676 ± 0.865
2.793PheSer: 2.793 ± 0.944
1.676PheThr: 1.676 ± 0.356
2.514PheVal: 2.514 ± 0.567
0.559PheTrp: 0.559 ± 0.266
0.838PheTyr: 0.838 ± 0.368
0.0PheXaa: 0.0 ± 0.0
Gly
3.911GlyAla: 3.911 ± 0.177
1.955GlyCys: 1.955 ± 0.455
2.793GlyAsp: 2.793 ± 0.585
2.235GlyGlu: 2.235 ± 0.45
2.235GlyPhe: 2.235 ± 0.816
3.352GlyGly: 3.352 ± 1.207
2.514GlyHis: 2.514 ± 0.726
2.793GlyIle: 2.793 ± 0.502
3.073GlyLys: 3.073 ± 0.99
8.101GlyLeu: 8.101 ± 0.625
1.676GlyMet: 1.676 ± 0.356
1.955GlyAsn: 1.955 ± 0.531
3.073GlyPro: 3.073 ± 0.346
1.676GlyGln: 1.676 ± 0.528
4.19GlyArg: 4.19 ± 0.938
6.145GlySer: 6.145 ± 0.476
4.469GlyThr: 4.469 ± 1.233
4.19GlyVal: 4.19 ± 0.967
1.397GlyTrp: 1.397 ± 0.81
2.514GlyTyr: 2.514 ± 1.008
0.0GlyXaa: 0.0 ± 0.0
His
1.676HisAla: 1.676 ± 0.676
0.279HisCys: 0.279 ± 0.302
2.235HisAsp: 2.235 ± 0.532
1.676HisGlu: 1.676 ± 0.691
1.117HisPhe: 1.117 ± 0.607
1.676HisGly: 1.676 ± 0.565
1.117HisHis: 1.117 ± 0.408
1.117HisIle: 1.117 ± 0.537
2.514HisLys: 2.514 ± 0.56
1.117HisLeu: 1.117 ± 0.577
0.838HisMet: 0.838 ± 0.304
0.559HisAsn: 0.559 ± 0.546
2.514HisPro: 2.514 ± 1.076
1.397HisGln: 1.397 ± 0.658
2.235HisArg: 2.235 ± 0.692
1.676HisSer: 1.676 ± 0.356
1.955HisThr: 1.955 ± 0.904
1.397HisVal: 1.397 ± 0.51
0.838HisTrp: 0.838 ± 0.304
1.955HisTyr: 1.955 ± 0.678
0.0HisXaa: 0.0 ± 0.0
Ile
3.352IleAla: 3.352 ± 0.875
1.397IleCys: 1.397 ± 0.518
2.793IleAsp: 2.793 ± 0.689
2.514IleGlu: 2.514 ± 0.597
1.397IlePhe: 1.397 ± 0.553
2.793IleGly: 2.793 ± 1.056
1.397IleHis: 1.397 ± 0.296
2.793IleIle: 2.793 ± 0.605
3.911IleLys: 3.911 ± 0.75
2.514IleLeu: 2.514 ± 0.597
0.279IleMet: 0.279 ± 0.144
1.955IleAsn: 1.955 ± 0.535
4.19IlePro: 4.19 ± 1.22
1.397IleGln: 1.397 ± 0.472
4.19IleArg: 4.19 ± 1.001
3.352IleSer: 3.352 ± 1.16
4.19IleThr: 4.19 ± 1.025
1.676IleVal: 1.676 ± 0.568
0.838IleTrp: 0.838 ± 0.527
2.514IleTyr: 2.514 ± 0.629
0.0IleXaa: 0.0 ± 0.0
Lys
3.631LysAla: 3.631 ± 1.4
0.838LysCys: 0.838 ± 0.389
2.793LysAsp: 2.793 ± 0.504
3.352LysGlu: 3.352 ± 0.922
1.397LysPhe: 1.397 ± 0.555
3.073LysGly: 3.073 ± 0.645
1.117LysHis: 1.117 ± 0.435
3.631LysIle: 3.631 ± 0.821
3.073LysLys: 3.073 ± 0.497
4.749LysLeu: 4.749 ± 1.349
1.676LysMet: 1.676 ± 0.927
2.235LysAsn: 2.235 ± 0.797
1.397LysPro: 1.397 ± 0.372
0.0LysGln: 0.0 ± 0.0
3.631LysArg: 3.631 ± 0.675
3.073LysSer: 3.073 ± 0.802
3.911LysThr: 3.911 ± 1.18
4.469LysVal: 4.469 ± 0.414
1.397LysTrp: 1.397 ± 0.555
1.117LysTyr: 1.117 ± 0.577
0.0LysXaa: 0.0 ± 0.0
Leu
6.425LeuAla: 6.425 ± 1.397
1.117LeuCys: 1.117 ± 0.394
4.749LeuAsp: 4.749 ± 1.449
4.19LeuGlu: 4.19 ± 0.751
3.911LeuPhe: 3.911 ± 0.761
6.983LeuGly: 6.983 ± 1.181
1.955LeuHis: 1.955 ± 0.682
6.704LeuIle: 6.704 ± 0.727
4.469LeuLys: 4.469 ± 1.51
8.939LeuLeu: 8.939 ± 1.224
4.19LeuMet: 4.19 ± 1.896
3.073LeuAsn: 3.073 ± 1.541
5.866LeuPro: 5.866 ± 0.811
3.073LeuGln: 3.073 ± 0.777
9.497LeuArg: 9.497 ± 1.509
7.821LeuSer: 7.821 ± 0.734
6.425LeuThr: 6.425 ± 2.426
8.659LeuVal: 8.659 ± 1.007
1.117LeuTrp: 1.117 ± 0.408
4.19LeuTyr: 4.19 ± 1.681
0.0LeuXaa: 0.0 ± 0.0
Met
2.793MetAla: 2.793 ± 0.692
0.559MetCys: 0.559 ± 0.288
1.955MetAsp: 1.955 ± 0.294
1.955MetGlu: 1.955 ± 0.535
0.838MetPhe: 0.838 ± 0.663
1.955MetGly: 1.955 ± 0.542
0.559MetHis: 0.559 ± 0.266
0.279MetIle: 0.279 ± 0.144
1.117MetLys: 1.117 ± 0.503
2.514MetLeu: 2.514 ± 0.597
0.559MetMet: 0.559 ± 0.288
0.559MetAsn: 0.559 ± 0.29
0.279MetPro: 0.279 ± 0.144
0.0MetGln: 0.0 ± 0.0
1.676MetArg: 1.676 ± 0.489
0.559MetSer: 0.559 ± 0.394
1.117MetThr: 1.117 ± 0.537
0.279MetVal: 0.279 ± 0.448
1.397MetTrp: 1.397 ± 0.472
1.397MetTyr: 1.397 ± 0.51
0.0MetXaa: 0.0 ± 0.0
Asn
2.793AsnAla: 2.793 ± 0.916
0.279AsnCys: 0.279 ± 0.359
3.352AsnAsp: 3.352 ± 1.11
1.955AsnGlu: 1.955 ± 0.455
1.117AsnPhe: 1.117 ± 0.532
1.676AsnGly: 1.676 ± 1.048
0.838AsnHis: 0.838 ± 0.284
1.117AsnIle: 1.117 ± 0.577
1.955AsnLys: 1.955 ± 0.771
3.352AsnLeu: 3.352 ± 0.83
0.559AsnMet: 0.559 ± 0.29
3.352AsnAsn: 3.352 ± 1.275
2.235AsnPro: 2.235 ± 0.669
2.514AsnGln: 2.514 ± 0.982
1.117AsnArg: 1.117 ± 0.435
3.352AsnSer: 3.352 ± 1.41
2.514AsnThr: 2.514 ± 0.9
2.235AsnVal: 2.235 ± 1.704
0.838AsnTrp: 0.838 ± 0.527
0.838AsnTyr: 0.838 ± 0.433
0.0AsnXaa: 0.0 ± 0.0
Pro
5.307ProAla: 5.307 ± 2.358
1.117ProCys: 1.117 ± 0.756
4.749ProAsp: 4.749 ± 1.048
3.631ProGlu: 3.631 ± 0.804
0.838ProPhe: 0.838 ± 0.304
5.028ProGly: 5.028 ± 1.812
2.514ProHis: 2.514 ± 0.556
2.793ProIle: 2.793 ± 0.98
1.397ProLys: 1.397 ± 0.721
6.145ProLeu: 6.145 ± 0.928
0.279ProMet: 0.279 ± 0.144
1.117ProAsn: 1.117 ± 0.301
5.587ProPro: 5.587 ± 3.934
1.676ProGln: 1.676 ± 0.929
3.631ProArg: 3.631 ± 0.974
5.028ProSer: 5.028 ± 1.484
4.469ProThr: 4.469 ± 0.866
3.352ProVal: 3.352 ± 1.221
1.117ProTrp: 1.117 ± 0.532
3.352ProTyr: 3.352 ± 1.487
0.0ProXaa: 0.0 ± 0.0
Gln
3.352GlnAla: 3.352 ± 1.285
0.559GlnCys: 0.559 ± 0.288
1.676GlnAsp: 1.676 ± 0.337
1.955GlnGlu: 1.955 ± 1.059
0.838GlnPhe: 0.838 ± 0.433
3.631GlnGly: 3.631 ± 0.66
0.559GlnHis: 0.559 ± 0.288
0.838GlnIle: 0.838 ± 0.55
1.117GlnLys: 1.117 ± 0.577
4.19GlnLeu: 4.19 ± 1.025
1.117GlnMet: 1.117 ± 0.408
1.676GlnAsn: 1.676 ± 0.637
1.397GlnPro: 1.397 ± 0.404
0.559GlnGln: 0.559 ± 0.266
1.117GlnArg: 1.117 ± 0.435
1.676GlnSer: 1.676 ± 0.356
1.676GlnThr: 1.676 ± 0.799
3.073GlnVal: 3.073 ± 0.964
0.559GlnTrp: 0.559 ± 0.288
0.279GlnTyr: 0.279 ± 0.144
0.0GlnXaa: 0.0 ± 0.0
Arg
4.19ArgAla: 4.19 ± 0.522
0.559ArgCys: 0.559 ± 0.29
2.793ArgAsp: 2.793 ± 1.105
4.19ArgGlu: 4.19 ± 1.188
2.793ArgPhe: 2.793 ± 1.105
3.073ArgGly: 3.073 ± 0.707
1.955ArgHis: 1.955 ± 0.678
2.514ArgIle: 2.514 ± 0.786
3.352ArgLys: 3.352 ± 0.585
7.542ArgLeu: 7.542 ± 0.904
1.397ArgMet: 1.397 ± 0.553
1.397ArgAsn: 1.397 ± 0.536
3.911ArgPro: 3.911 ± 1.083
3.073ArgGln: 3.073 ± 0.45
4.469ArgArg: 4.469 ± 0.181
3.631ArgSer: 3.631 ± 1.158
3.073ArgThr: 3.073 ± 1.08
5.307ArgVal: 5.307 ± 1.405
0.838ArgTrp: 0.838 ± 0.433
1.676ArgTyr: 1.676 ± 0.691
0.0ArgXaa: 0.0 ± 0.0
Ser
5.028SerAla: 5.028 ± 1.215
1.117SerCys: 1.117 ± 0.532
4.19SerAsp: 4.19 ± 0.609
3.073SerGlu: 3.073 ± 1.116
1.955SerPhe: 1.955 ± 0.84
3.631SerGly: 3.631 ± 1.504
1.676SerHis: 1.676 ± 0.45
4.469SerIle: 4.469 ± 1.197
3.911SerLys: 3.911 ± 0.659
8.101SerLeu: 8.101 ± 0.899
1.117SerMet: 1.117 ± 0.394
2.514SerAsn: 2.514 ± 0.966
3.073SerPro: 3.073 ± 2.044
1.117SerGln: 1.117 ± 0.394
3.352SerArg: 3.352 ± 1.126
4.469SerSer: 4.469 ± 1.405
5.866SerThr: 5.866 ± 1.409
6.145SerVal: 6.145 ± 1.419
1.955SerTrp: 1.955 ± 0.754
1.397SerTyr: 1.397 ± 0.472
0.0SerXaa: 0.0 ± 0.0
Thr
5.866ThrAla: 5.866 ± 1.266
0.838ThrCys: 0.838 ± 0.304
3.352ThrAsp: 3.352 ± 1.126
2.514ThrGlu: 2.514 ± 0.726
3.631ThrPhe: 3.631 ± 1.271
3.073ThrGly: 3.073 ± 0.691
2.793ThrHis: 2.793 ± 0.98
3.631ThrIle: 3.631 ± 0.824
2.793ThrLys: 2.793 ± 0.504
6.983ThrLeu: 6.983 ± 0.769
0.559ThrMet: 0.559 ± 0.288
2.514ThrAsn: 2.514 ± 0.567
6.425ThrPro: 6.425 ± 2.248
1.676ThrGln: 1.676 ± 0.659
4.19ThrArg: 4.19 ± 0.698
4.749ThrSer: 4.749 ± 0.962
3.911ThrThr: 3.911 ± 1.124
4.19ThrVal: 4.19 ± 1.5
1.117ThrTrp: 1.117 ± 0.408
1.676ThrTyr: 1.676 ± 0.489
0.0ThrXaa: 0.0 ± 0.0
Val
3.352ValAla: 3.352 ± 1.602
1.955ValCys: 1.955 ± 0.771
2.514ValAsp: 2.514 ± 0.661
2.793ValGlu: 2.793 ± 0.744
3.073ValPhe: 3.073 ± 1.272
3.352ValGly: 3.352 ± 0.789
1.397ValHis: 1.397 ± 0.472
4.749ValIle: 4.749 ± 1.606
3.631ValLys: 3.631 ± 0.635
5.028ValLeu: 5.028 ± 1.178
1.955ValMet: 1.955 ± 0.921
2.514ValAsn: 2.514 ± 0.589
4.469ValPro: 4.469 ± 0.838
1.676ValGln: 1.676 ± 0.568
2.793ValArg: 2.793 ± 0.757
5.028ValSer: 5.028 ± 0.827
5.307ValThr: 5.307 ± 1.564
5.028ValVal: 5.028 ± 0.844
1.117ValTrp: 1.117 ± 0.394
3.352ValTyr: 3.352 ± 0.688
0.0ValXaa: 0.0 ± 0.0
Trp
1.117TrpAla: 1.117 ± 0.394
0.0TrpCys: 0.0 ± 0.0
1.397TrpAsp: 1.397 ± 0.372
1.676TrpGlu: 1.676 ± 0.637
0.279TrpPhe: 0.279 ± 0.359
1.955TrpGly: 1.955 ± 1.073
1.117TrpHis: 1.117 ± 0.503
0.279TrpIle: 0.279 ± 0.618
0.559TrpLys: 0.559 ± 0.29
2.235TrpLeu: 2.235 ± 0.784
0.838TrpMet: 0.838 ± 0.816
2.235TrpAsn: 2.235 ± 0.864
0.559TrpPro: 0.559 ± 0.556
0.838TrpGln: 0.838 ± 0.304
0.838TrpArg: 0.838 ± 0.55
0.838TrpSer: 0.838 ± 0.433
1.117TrpThr: 1.117 ± 0.585
0.559TrpVal: 0.559 ± 0.288
0.279TrpTrp: 0.279 ± 0.144
1.397TrpTyr: 1.397 ± 0.404
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.117TyrAla: 1.117 ± 0.394
1.117TyrCys: 1.117 ± 0.394
1.397TyrAsp: 1.397 ± 0.565
1.676TyrGlu: 1.676 ± 0.356
1.397TyrPhe: 1.397 ± 0.536
3.073TyrGly: 3.073 ± 0.707
1.117TyrHis: 1.117 ± 0.759
0.838TyrIle: 0.838 ± 0.304
2.235TyrLys: 2.235 ± 0.748
3.911TyrLeu: 3.911 ± 1.677
0.559TyrMet: 0.559 ± 0.288
1.676TyrAsn: 1.676 ± 0.489
2.793TyrPro: 2.793 ± 1.316
1.676TyrGln: 1.676 ± 0.637
2.793TyrArg: 2.793 ± 0.943
1.955TyrSer: 1.955 ± 0.294
3.352TyrThr: 3.352 ± 1.364
1.397TyrVal: 1.397 ± 0.629
0.559TyrTrp: 0.559 ± 0.266
1.117TyrTyr: 1.117 ± 0.847
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (3581 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski