Amino acid dipepetide frequency for Atractylodes mild mottle virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.97AlaAla: 2.97 ± 1.056
0.0AlaCys: 0.0 ± 0.0
3.394AlaAsp: 3.394 ± 1.169
2.97AlaGlu: 2.97 ± 0.885
2.97AlaPhe: 2.97 ± 0.78
1.697AlaGly: 1.697 ± 0.69
1.273AlaHis: 1.273 ± 0.348
1.697AlaIle: 1.697 ± 0.299
5.515AlaLys: 5.515 ± 1.332
2.97AlaLeu: 2.97 ± 1.718
0.424AlaMet: 0.424 ± 0.31
3.394AlaAsn: 3.394 ± 0.989
0.849AlaPro: 0.849 ± 0.365
0.849AlaGln: 0.849 ± 0.382
1.273AlaArg: 1.273 ± 0.931
3.394AlaSer: 3.394 ± 1.47
2.546AlaThr: 2.546 ± 0.906
3.394AlaVal: 3.394 ± 1.48
0.0AlaTrp: 0.0 ± 0.0
2.546AlaTyr: 2.546 ± 1.337
0.0AlaXaa: 0.0 ± 0.0
Cys
0.424CysAla: 0.424 ± 0.377
1.273CysCys: 1.273 ± 0.692
1.273CysAsp: 1.273 ± 0.348
1.697CysGlu: 1.697 ± 0.686
0.849CysPhe: 0.849 ± 0.382
0.849CysGly: 0.849 ± 0.483
0.0CysHis: 0.0 ± 0.0
0.424CysIle: 0.424 ± 0.466
1.697CysLys: 1.697 ± 0.856
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.849CysAsn: 0.849 ± 0.83
2.121CysPro: 2.121 ± 0.64
1.273CysGln: 1.273 ± 0.801
0.849CysArg: 0.849 ± 0.382
1.697CysSer: 1.697 ± 0.626
0.0CysThr: 0.0 ± 0.0
1.273CysVal: 1.273 ± 0.58
0.424CysTrp: 0.424 ± 0.377
0.424CysTyr: 0.424 ± 0.466
0.0CysXaa: 0.0 ± 0.0
Asp
2.121AspAla: 2.121 ± 0.614
1.273AspCys: 1.273 ± 0.419
3.818AspAsp: 3.818 ± 0.896
5.94AspGlu: 5.94 ± 1.969
2.121AspPhe: 2.121 ± 0.745
0.849AspGly: 0.849 ± 0.45
0.849AspHis: 0.849 ± 0.527
6.364AspIle: 6.364 ± 1.736
3.818AspLys: 3.818 ± 0.742
4.243AspLeu: 4.243 ± 0.931
0.424AspMet: 0.424 ± 0.377
3.818AspAsn: 3.818 ± 1.47
2.97AspPro: 2.97 ± 1.11
4.667AspGln: 4.667 ± 0.505
3.394AspArg: 3.394 ± 0.93
6.364AspSer: 6.364 ± 0.918
2.97AspThr: 2.97 ± 0.936
1.697AspVal: 1.697 ± 0.67
0.0AspTrp: 0.0 ± 0.0
1.273AspTyr: 1.273 ± 0.692
0.0AspXaa: 0.0 ± 0.0
Glu
4.667GluAla: 4.667 ± 1.432
0.849GluCys: 0.849 ± 0.382
6.364GluAsp: 6.364 ± 1.141
6.364GluGlu: 6.364 ± 0.927
5.091GluPhe: 5.091 ± 1.306
1.273GluGly: 1.273 ± 1.17
2.546GluHis: 2.546 ± 1.169
5.515GluIle: 5.515 ± 1.682
8.485GluLys: 8.485 ± 2.836
6.364GluLeu: 6.364 ± 1.241
0.424GluMet: 0.424 ± 0.367
5.515GluAsn: 5.515 ± 0.827
2.121GluPro: 2.121 ± 1.341
3.394GluGln: 3.394 ± 0.797
3.394GluArg: 3.394 ± 1.072
4.243GluSer: 4.243 ± 1.843
2.121GluThr: 2.121 ± 0.742
5.091GluVal: 5.091 ± 2.251
0.849GluTrp: 0.849 ± 0.45
1.273GluTyr: 1.273 ± 0.419
0.0GluXaa: 0.0 ± 0.0
Phe
1.697PheAla: 1.697 ± 0.481
2.121PheCys: 2.121 ± 0.85
1.697PheAsp: 1.697 ± 0.619
2.121PheGlu: 2.121 ± 0.498
3.394PhePhe: 3.394 ± 1.235
2.97PheGly: 2.97 ± 0.893
0.849PheHis: 0.849 ± 0.382
2.546PheIle: 2.546 ± 1.35
4.667PheLys: 4.667 ± 1.342
4.243PheLeu: 4.243 ± 2.289
0.424PheMet: 0.424 ± 0.466
2.546PheAsn: 2.546 ± 1.232
2.97PhePro: 2.97 ± 1.065
2.97PheGln: 2.97 ± 0.583
1.697PheArg: 1.697 ± 0.626
6.364PheSer: 6.364 ± 0.778
2.546PheThr: 2.546 ± 1.108
2.97PheVal: 2.97 ± 0.836
0.849PheTrp: 0.849 ± 0.62
0.849PheTyr: 0.849 ± 0.754
0.0PheXaa: 0.0 ± 0.0
Gly
1.273GlyAla: 1.273 ± 0.918
0.424GlyCys: 0.424 ± 0.377
2.546GlyAsp: 2.546 ± 0.46
2.121GlyGlu: 2.121 ± 0.891
2.121GlyPhe: 2.121 ± 0.956
0.849GlyGly: 0.849 ± 0.365
1.697GlyHis: 1.697 ± 0.619
4.243GlyIle: 4.243 ± 0.931
6.364GlyLys: 6.364 ± 1.024
3.818GlyLeu: 3.818 ± 0.749
1.273GlyMet: 1.273 ± 0.71
3.394GlyAsn: 3.394 ± 1.735
1.273GlyPro: 1.273 ± 0.645
0.424GlyGln: 0.424 ± 0.466
1.273GlyArg: 1.273 ± 0.726
5.515GlySer: 5.515 ± 1.128
1.273GlyThr: 1.273 ± 0.664
0.849GlyVal: 0.849 ± 0.45
0.0GlyTrp: 0.0 ± 0.0
1.697GlyTyr: 1.697 ± 0.947
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.849HisCys: 0.849 ± 0.563
0.0HisAsp: 0.0 ± 0.0
0.424HisGlu: 0.424 ± 0.35
0.849HisPhe: 0.849 ± 0.527
0.424HisGly: 0.424 ± 0.31
1.697HisHis: 1.697 ± 0.69
2.546HisIle: 2.546 ± 0.725
2.97HisLys: 2.97 ± 0.997
2.121HisLeu: 2.121 ± 0.914
0.849HisMet: 0.849 ± 0.365
0.849HisAsn: 0.849 ± 0.502
1.273HisPro: 1.273 ± 0.692
1.273HisGln: 1.273 ± 0.419
0.849HisArg: 0.849 ± 0.62
2.121HisSer: 2.121 ± 0.701
0.424HisThr: 0.424 ± 0.377
2.97HisVal: 2.97 ± 1.12
0.424HisTrp: 0.424 ± 0.31
1.273HisTyr: 1.273 ± 0.348
0.0HisXaa: 0.0 ± 0.0
Ile
2.546IleAla: 2.546 ± 0.4
1.697IleCys: 1.697 ± 0.626
6.364IleAsp: 6.364 ± 2.083
5.091IleGlu: 5.091 ± 0.462
2.121IlePhe: 2.121 ± 0.606
2.121IleGly: 2.121 ± 0.989
1.273IleHis: 1.273 ± 0.477
4.667IleIle: 4.667 ± 1.84
7.213IleLys: 7.213 ± 1.478
8.485IleLeu: 8.485 ± 1.73
1.697IleMet: 1.697 ± 0.67
5.515IleAsn: 5.515 ± 1.574
3.818IlePro: 3.818 ± 0.875
1.697IleGln: 1.697 ± 0.626
2.121IleArg: 2.121 ± 1.149
4.243IleSer: 4.243 ± 1.561
3.394IleThr: 3.394 ± 0.838
2.121IleVal: 2.121 ± 0.747
0.0IleTrp: 0.0 ± 0.0
3.394IleTyr: 3.394 ± 0.894
0.0IleXaa: 0.0 ± 0.0
Lys
5.091LysAla: 5.091 ± 1.508
0.849LysCys: 0.849 ± 0.637
5.94LysAsp: 5.94 ± 1.576
10.182LysGlu: 10.182 ± 2.298
8.061LysPhe: 8.061 ± 2.935
5.515LysGly: 5.515 ± 1.901
2.121LysHis: 2.121 ± 0.71
6.788LysIle: 6.788 ± 1.353
16.546LysLys: 16.546 ± 5.279
6.788LysLeu: 6.788 ± 2.093
2.546LysMet: 2.546 ± 0.995
5.94LysAsn: 5.94 ± 1.463
5.94LysPro: 5.94 ± 1.318
5.091LysGln: 5.091 ± 1.165
4.243LysArg: 4.243 ± 0.969
8.91LysSer: 8.91 ± 1.636
4.243LysThr: 4.243 ± 0.776
5.94LysVal: 5.94 ± 1.112
0.424LysTrp: 0.424 ± 0.31
1.697LysTyr: 1.697 ± 0.677
0.0LysXaa: 0.0 ± 0.0
Leu
4.243LeuAla: 4.243 ± 0.563
1.697LeuCys: 1.697 ± 0.835
5.515LeuAsp: 5.515 ± 0.819
8.061LeuGlu: 8.061 ± 1.975
2.546LeuPhe: 2.546 ± 0.987
6.364LeuGly: 6.364 ± 1.715
1.697LeuHis: 1.697 ± 0.607
8.061LeuIle: 8.061 ± 2.467
11.88LeuLys: 11.88 ± 1.779
7.637LeuLeu: 7.637 ± 3.204
2.546LeuMet: 2.546 ± 0.999
4.243LeuAsn: 4.243 ± 1.342
1.697LeuPro: 1.697 ± 0.607
3.394LeuGln: 3.394 ± 1.192
2.121LeuArg: 2.121 ± 1.058
4.667LeuSer: 4.667 ± 1.016
3.394LeuThr: 3.394 ± 1.758
4.243LeuVal: 4.243 ± 0.686
0.0LeuTrp: 0.0 ± 0.0
1.697LeuTyr: 1.697 ± 0.763
0.0LeuXaa: 0.0 ± 0.0
Met
0.849MetAla: 0.849 ± 0.483
0.424MetCys: 0.424 ± 0.31
1.697MetAsp: 1.697 ± 0.299
0.849MetGlu: 0.849 ± 0.45
0.424MetPhe: 0.424 ± 0.466
0.849MetGly: 0.849 ± 0.527
0.0MetHis: 0.0 ± 0.0
1.697MetIle: 1.697 ± 0.854
0.424MetLys: 0.424 ± 0.31
0.424MetLeu: 0.424 ± 0.377
0.0MetMet: 0.0 ± 0.0
1.273MetAsn: 1.273 ± 0.692
0.849MetPro: 0.849 ± 0.365
0.424MetGln: 0.424 ± 0.31
0.849MetArg: 0.849 ± 0.45
2.546MetSer: 2.546 ± 0.971
1.273MetThr: 1.273 ± 0.692
1.697MetVal: 1.697 ± 0.518
0.424MetTrp: 0.424 ± 0.31
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.97AsnAla: 2.97 ± 1.384
0.424AsnCys: 0.424 ± 0.377
2.97AsnAsp: 2.97 ± 1.406
4.243AsnGlu: 4.243 ± 1.897
2.97AsnPhe: 2.97 ± 0.583
1.697AsnGly: 1.697 ± 0.972
2.546AsnHis: 2.546 ± 0.725
2.121AsnIle: 2.121 ± 0.548
3.818AsnLys: 3.818 ± 1.317
8.485AsnLeu: 8.485 ± 2.962
0.849AsnMet: 0.849 ± 0.401
3.818AsnAsn: 3.818 ± 1.332
4.243AsnPro: 4.243 ± 1.165
2.121AsnGln: 2.121 ± 0.89
2.546AsnArg: 2.546 ± 0.715
2.546AsnSer: 2.546 ± 0.4
3.394AsnThr: 3.394 ± 0.824
2.97AsnVal: 2.97 ± 0.969
0.0AsnTrp: 0.0 ± 0.0
2.97AsnTyr: 2.97 ± 0.835
0.0AsnXaa: 0.0 ± 0.0
Pro
0.849ProAla: 0.849 ± 0.365
0.849ProCys: 0.849 ± 0.701
2.121ProAsp: 2.121 ± 0.973
3.394ProGlu: 3.394 ± 0.575
1.273ProPhe: 1.273 ± 0.93
1.697ProGly: 1.697 ± 0.557
1.697ProHis: 1.697 ± 0.624
2.546ProIle: 2.546 ± 0.437
4.667ProLys: 4.667 ± 1.243
2.546ProLeu: 2.546 ± 1.463
0.849ProMet: 0.849 ± 0.502
4.243ProAsn: 4.243 ± 1.1
4.243ProPro: 4.243 ± 0.917
0.424ProGln: 0.424 ± 0.31
1.697ProArg: 1.697 ± 1.102
4.243ProSer: 4.243 ± 1.532
2.97ProThr: 2.97 ± 0.606
2.546ProVal: 2.546 ± 1.096
0.424ProTrp: 0.424 ± 0.578
0.424ProTyr: 0.424 ± 0.377
0.0ProXaa: 0.0 ± 0.0
Gln
2.97GlnAla: 2.97 ± 0.812
0.849GlnCys: 0.849 ± 0.639
1.273GlnAsp: 1.273 ± 0.692
2.97GlnGlu: 2.97 ± 0.728
1.697GlnPhe: 1.697 ± 0.835
1.697GlnGly: 1.697 ± 0.854
0.424GlnHis: 0.424 ± 0.35
3.818GlnIle: 3.818 ± 1.106
2.97GlnLys: 2.97 ± 0.599
3.818GlnLeu: 3.818 ± 1.974
1.273GlnMet: 1.273 ± 0.692
2.546GlnAsn: 2.546 ± 0.704
1.697GlnPro: 1.697 ± 0.584
2.546GlnGln: 2.546 ± 0.788
1.273GlnArg: 1.273 ± 0.584
2.121GlnSer: 2.121 ± 0.808
2.121GlnThr: 2.121 ± 0.934
3.818GlnVal: 3.818 ± 1.183
0.849GlnTrp: 0.849 ± 0.382
2.121GlnTyr: 2.121 ± 0.989
0.0GlnXaa: 0.0 ± 0.0
Arg
0.424ArgAla: 0.424 ± 0.35
0.424ArgCys: 0.424 ± 0.377
1.273ArgAsp: 1.273 ± 0.654
3.818ArgGlu: 3.818 ± 0.599
0.849ArgPhe: 0.849 ± 0.62
2.546ArgGly: 2.546 ± 0.989
0.849ArgHis: 0.849 ± 0.502
2.546ArgIle: 2.546 ± 1.223
5.091ArgLys: 5.091 ± 1.175
3.818ArgLeu: 3.818 ± 1.486
0.424ArgMet: 0.424 ± 0.31
0.849ArgAsn: 0.849 ± 0.382
0.424ArgPro: 0.424 ± 0.377
1.697ArgGln: 1.697 ± 0.626
1.697ArgArg: 1.697 ± 0.854
2.546ArgSer: 2.546 ± 0.576
3.394ArgThr: 3.394 ± 1.602
1.273ArgVal: 1.273 ± 0.589
0.424ArgTrp: 0.424 ± 0.31
1.273ArgTyr: 1.273 ± 0.584
0.0ArgXaa: 0.0 ± 0.0
Ser
2.121SerAla: 2.121 ± 1.041
0.849SerCys: 0.849 ± 0.45
4.243SerAsp: 4.243 ± 0.684
8.061SerGlu: 8.061 ± 2.666
5.515SerPhe: 5.515 ± 1.101
5.091SerGly: 5.091 ± 0.974
1.273SerHis: 1.273 ± 0.58
4.667SerIle: 4.667 ± 0.68
14.425SerLys: 14.425 ± 2.141
8.061SerLeu: 8.061 ± 2.451
1.273SerMet: 1.273 ± 0.584
2.121SerAsn: 2.121 ± 0.649
1.697SerPro: 1.697 ± 0.607
2.121SerGln: 2.121 ± 0.498
2.97SerArg: 2.97 ± 0.938
11.455SerSer: 11.455 ± 3.681
3.394SerThr: 3.394 ± 1.094
2.97SerVal: 2.97 ± 1.709
0.849SerTrp: 0.849 ± 0.502
1.697SerTyr: 1.697 ± 0.899
0.0SerXaa: 0.0 ± 0.0
Thr
3.394ThrAla: 3.394 ± 0.755
0.424ThrCys: 0.424 ± 0.31
2.97ThrAsp: 2.97 ± 0.792
1.697ThrGlu: 1.697 ± 0.494
1.697ThrPhe: 1.697 ± 0.558
1.697ThrGly: 1.697 ± 0.703
0.424ThrHis: 0.424 ± 0.377
4.243ThrIle: 4.243 ± 1.916
3.394ThrLys: 3.394 ± 1.657
4.243ThrLeu: 4.243 ± 1.306
0.0ThrMet: 0.0 ± 0.0
3.394ThrAsn: 3.394 ± 0.947
2.121ThrPro: 2.121 ± 0.632
2.121ThrGln: 2.121 ± 0.548
1.273ThrArg: 1.273 ± 0.419
5.515ThrSer: 5.515 ± 1.277
2.97ThrThr: 2.97 ± 1.324
2.546ThrVal: 2.546 ± 0.807
0.849ThrTrp: 0.849 ± 0.382
1.697ThrTyr: 1.697 ± 1.094
0.0ThrXaa: 0.0 ± 0.0
Val
1.697ValAla: 1.697 ± 0.67
1.273ValCys: 1.273 ± 0.584
3.818ValAsp: 3.818 ± 1.534
3.394ValGlu: 3.394 ± 1.097
4.243ValPhe: 4.243 ± 1.467
2.97ValGly: 2.97 ± 1.354
1.273ValHis: 1.273 ± 1.398
2.97ValIle: 2.97 ± 0.782
4.667ValLys: 4.667 ± 0.907
3.818ValLeu: 3.818 ± 0.576
0.424ValMet: 0.424 ± 0.35
2.546ValAsn: 2.546 ± 1.234
1.697ValPro: 1.697 ± 0.624
3.818ValGln: 3.818 ± 1.149
1.697ValArg: 1.697 ± 0.686
3.394ValSer: 3.394 ± 1.162
2.121ValThr: 2.121 ± 0.576
1.697ValVal: 1.697 ± 0.703
0.0ValTrp: 0.0 ± 0.0
3.394ValTyr: 3.394 ± 0.801
0.0ValXaa: 0.0 ± 0.0
Trp
0.424TrpAla: 0.424 ± 0.466
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.424TrpGlu: 0.424 ± 0.31
0.424TrpPhe: 0.424 ± 0.31
0.424TrpGly: 0.424 ± 0.31
0.0TrpHis: 0.0 ± 0.0
0.424TrpIle: 0.424 ± 0.377
0.849TrpLys: 0.849 ± 0.365
0.424TrpLeu: 0.424 ± 0.377
0.424TrpMet: 0.424 ± 0.31
0.849TrpAsn: 0.849 ± 0.754
0.0TrpPro: 0.0 ± 0.0
0.849TrpGln: 0.849 ± 0.62
0.0TrpArg: 0.0 ± 0.0
0.849TrpSer: 0.849 ± 0.639
0.424TrpThr: 0.424 ± 0.31
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.424TrpTyr: 0.424 ± 0.35
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.394TyrAla: 3.394 ± 1.162
0.849TyrCys: 0.849 ± 0.365
1.697TyrAsp: 1.697 ± 1.057
2.121TyrGlu: 2.121 ± 1.181
1.273TyrPhe: 1.273 ± 0.654
0.849TyrGly: 0.849 ± 0.365
1.697TyrHis: 1.697 ± 0.557
1.697TyrIle: 1.697 ± 0.947
3.818TyrLys: 3.818 ± 0.37
3.394TyrLeu: 3.394 ± 1.211
0.424TyrMet: 0.424 ± 0.377
0.0TyrAsn: 0.0 ± 0.0
1.697TyrPro: 1.697 ± 0.677
1.697TyrGln: 1.697 ± 0.557
0.424TyrArg: 0.424 ± 0.578
2.121TyrSer: 2.121 ± 0.773
1.697TyrThr: 1.697 ± 0.481
0.849TyrVal: 0.849 ± 0.365
0.424TyrTrp: 0.424 ± 0.35
1.697TyrTyr: 1.697 ± 0.686
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2358 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski