Amino acid dipepetide frequency for Burdock mottle virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.278AlaAla: 7.278 ± 2.078
0.27AlaCys: 0.27 ± 0.15
3.235AlaAsp: 3.235 ± 0.836
5.66AlaGlu: 5.66 ± 1.841
3.235AlaPhe: 3.235 ± 0.833
6.199AlaGly: 6.199 ± 2.749
1.348AlaHis: 1.348 ± 0.751
3.504AlaIle: 3.504 ± 1.355
4.313AlaLys: 4.313 ± 1.153
5.66AlaLeu: 5.66 ± 1.525
2.695AlaMet: 2.695 ± 1.213
3.774AlaAsn: 3.774 ± 0.826
1.348AlaPro: 1.348 ± 0.438
3.235AlaGln: 3.235 ± 1.228
5.121AlaArg: 5.121 ± 1.162
5.93AlaSer: 5.93 ± 2.064
2.695AlaThr: 2.695 ± 0.659
3.235AlaVal: 3.235 ± 0.798
0.539AlaTrp: 0.539 ± 0.3
1.887AlaTyr: 1.887 ± 1.44
0.0AlaXaa: 0.0 ± 0.0
Cys
0.809CysAla: 0.809 ± 0.628
1.078CysCys: 1.078 ± 0.615
1.617CysAsp: 1.617 ± 0.724
1.887CysGlu: 1.887 ± 1.154
1.078CysPhe: 1.078 ± 0.793
2.156CysGly: 2.156 ± 1.811
0.27CysHis: 0.27 ± 0.15
0.539CysIle: 0.539 ± 0.3
0.539CysLys: 0.539 ± 1.402
2.965CysLeu: 2.965 ± 1.163
0.0CysMet: 0.0 ± 0.0
1.617CysAsn: 1.617 ± 1.723
0.539CysPro: 0.539 ± 0.394
0.27CysGln: 0.27 ± 0.15
0.539CysArg: 0.539 ± 0.57
0.809CysSer: 0.809 ± 0.451
0.0CysThr: 0.0 ± 0.0
2.426CysVal: 2.426 ± 1.75
0.0CysTrp: 0.0 ± 0.0
0.539CysTyr: 0.539 ± 0.57
0.0CysXaa: 0.0 ± 0.0
Asp
4.582AspAla: 4.582 ± 1.192
2.695AspCys: 2.695 ± 0.873
3.774AspAsp: 3.774 ± 0.954
3.504AspGlu: 3.504 ± 0.716
1.887AspPhe: 1.887 ± 0.738
4.582AspGly: 4.582 ± 1.603
0.809AspHis: 0.809 ± 0.609
3.774AspIle: 3.774 ± 0.809
2.426AspLys: 2.426 ± 0.525
7.817AspLeu: 7.817 ± 1.611
1.617AspMet: 1.617 ± 0.689
1.617AspAsn: 1.617 ± 0.949
1.887AspPro: 1.887 ± 0.585
0.539AspGln: 0.539 ± 0.361
1.078AspArg: 1.078 ± 0.405
5.66AspSer: 5.66 ± 1.585
2.426AspThr: 2.426 ± 0.633
6.739AspVal: 6.739 ± 0.98
1.887AspTrp: 1.887 ± 0.398
1.887AspTyr: 1.887 ± 0.784
0.0AspXaa: 0.0 ± 0.0
Glu
2.156GluAla: 2.156 ± 1.211
0.539GluCys: 0.539 ± 0.57
2.156GluAsp: 2.156 ± 0.759
3.235GluGlu: 3.235 ± 0.631
5.121GluPhe: 5.121 ± 1.198
2.965GluGly: 2.965 ± 1.033
1.078GluHis: 1.078 ± 0.753
2.965GluIle: 2.965 ± 0.601
2.965GluLys: 2.965 ± 0.611
8.086GluLeu: 8.086 ± 1.629
1.078GluMet: 1.078 ± 0.601
2.426GluAsn: 2.426 ± 0.572
1.617GluPro: 1.617 ± 0.614
2.156GluGln: 2.156 ± 1.211
3.235GluArg: 3.235 ± 0.816
4.852GluSer: 4.852 ± 1.474
2.965GluThr: 2.965 ± 0.852
6.469GluVal: 6.469 ± 1.836
0.809GluTrp: 0.809 ± 0.441
2.156GluTyr: 2.156 ± 0.83
0.0GluXaa: 0.0 ± 0.0
Phe
3.235PheAla: 3.235 ± 0.605
0.809PheCys: 0.809 ± 0.553
3.774PheAsp: 3.774 ± 0.83
2.965PheGlu: 2.965 ± 1.264
2.426PhePhe: 2.426 ± 0.507
2.695PheGly: 2.695 ± 2.262
1.078PheHis: 1.078 ± 0.577
2.695PheIle: 2.695 ± 0.692
2.426PheLys: 2.426 ± 1.068
4.043PheLeu: 4.043 ± 1.236
0.809PheMet: 0.809 ± 0.451
0.539PheAsn: 0.539 ± 0.639
2.695PhePro: 2.695 ± 1.015
1.887PheGln: 1.887 ± 0.615
2.426PheArg: 2.426 ± 0.717
6.469PheSer: 6.469 ± 1.523
3.774PheThr: 3.774 ± 0.935
4.852PheVal: 4.852 ± 1.313
0.0PheTrp: 0.0 ± 0.0
0.539PheTyr: 0.539 ± 0.3
0.0PheXaa: 0.0 ± 0.0
Gly
3.235GlyAla: 3.235 ± 1.104
2.426GlyCys: 2.426 ± 3.109
4.852GlyAsp: 4.852 ± 2.633
2.965GlyGlu: 2.965 ± 0.494
4.582GlyPhe: 4.582 ± 1.803
6.199GlyGly: 6.199 ± 3.539
1.617GlyHis: 1.617 ± 0.648
2.695GlyIle: 2.695 ± 0.745
5.121GlyLys: 5.121 ± 1.316
4.852GlyLeu: 4.852 ± 2.652
0.809GlyMet: 0.809 ± 0.752
2.965GlyAsn: 2.965 ± 1.547
2.426GlyPro: 2.426 ± 0.525
2.426GlyGln: 2.426 ± 0.74
3.504GlyArg: 3.504 ± 0.683
7.008GlySer: 7.008 ± 2.077
3.235GlyThr: 3.235 ± 0.838
4.043GlyVal: 4.043 ± 2.95
0.809GlyTrp: 0.809 ± 0.451
2.965GlyTyr: 2.965 ± 1.137
0.0GlyXaa: 0.0 ± 0.0
His
1.078HisAla: 1.078 ± 0.763
0.0HisCys: 0.0 ± 0.0
1.078HisAsp: 1.078 ± 0.39
0.539HisGlu: 0.539 ± 0.298
1.617HisPhe: 1.617 ± 0.648
0.809HisGly: 0.809 ± 0.553
1.348HisHis: 1.348 ± 0.436
0.539HisIle: 0.539 ± 0.361
1.887HisLys: 1.887 ± 0.784
1.348HisLeu: 1.348 ± 0.896
1.617HisMet: 1.617 ± 0.423
0.27HisAsn: 0.27 ± 0.15
1.617HisPro: 1.617 ± 0.441
0.539HisGln: 0.539 ± 0.3
1.348HisArg: 1.348 ± 0.436
1.348HisSer: 1.348 ± 0.745
2.156HisThr: 2.156 ± 1.342
1.078HisVal: 1.078 ± 0.615
0.539HisTrp: 0.539 ± 0.3
0.539HisTyr: 0.539 ± 0.3
0.0HisXaa: 0.0 ± 0.0
Ile
3.235IleAla: 3.235 ± 0.91
1.348IleCys: 1.348 ± 1.415
3.235IleAsp: 3.235 ± 0.536
2.426IleGlu: 2.426 ± 1.068
1.348IlePhe: 1.348 ± 0.637
0.539IleGly: 0.539 ± 0.3
1.078IleHis: 1.078 ± 0.39
1.617IleIle: 1.617 ± 0.646
3.774IleLys: 3.774 ± 0.828
3.235IleLeu: 3.235 ± 0.824
0.809IleMet: 0.809 ± 0.451
1.887IleAsn: 1.887 ± 0.828
2.426IlePro: 2.426 ± 0.774
1.348IleGln: 1.348 ± 0.519
2.156IleArg: 2.156 ± 0.971
7.008IleSer: 7.008 ± 2.156
3.774IleThr: 3.774 ± 1.686
2.965IleVal: 2.965 ± 1.635
0.0IleTrp: 0.0 ± 0.0
1.348IleTyr: 1.348 ± 0.751
0.0IleXaa: 0.0 ± 0.0
Lys
2.965LysAla: 2.965 ± 0.494
1.078LysCys: 1.078 ± 0.39
3.504LysAsp: 3.504 ± 0.891
4.582LysGlu: 4.582 ± 0.904
2.695LysPhe: 2.695 ± 1.015
2.695LysGly: 2.695 ± 0.778
0.809LysHis: 0.809 ± 0.609
3.235LysIle: 3.235 ± 1.396
2.965LysLys: 2.965 ± 0.718
5.121LysLeu: 5.121 ± 0.769
1.617LysMet: 1.617 ± 0.982
1.887LysAsn: 1.887 ± 0.431
1.078LysPro: 1.078 ± 0.405
1.617LysGln: 1.617 ± 0.901
3.235LysArg: 3.235 ± 0.457
5.391LysSer: 5.391 ± 1.101
4.043LysThr: 4.043 ± 0.83
4.582LysVal: 4.582 ± 1.165
1.617LysTrp: 1.617 ± 1.649
2.156LysTyr: 2.156 ± 0.598
0.27LysXaa: 0.27 ± 0.345
Leu
6.739LeuAla: 6.739 ± 1.223
0.539LeuCys: 0.539 ± 0.678
4.043LeuAsp: 4.043 ± 1.556
5.66LeuGlu: 5.66 ± 1.319
5.121LeuPhe: 5.121 ± 0.713
5.391LeuGly: 5.391 ± 2.698
2.156LeuHis: 2.156 ± 1.233
3.504LeuIle: 3.504 ± 0.92
6.199LeuLys: 6.199 ± 1.646
7.008LeuLeu: 7.008 ± 1.586
4.043LeuMet: 4.043 ± 1.087
4.582LeuAsn: 4.582 ± 1.207
4.582LeuPro: 4.582 ± 0.543
2.965LeuGln: 2.965 ± 1.605
4.582LeuArg: 4.582 ± 1.269
6.469LeuSer: 6.469 ± 2.709
3.235LeuThr: 3.235 ± 1.063
10.243LeuVal: 10.243 ± 2.242
1.348LeuTrp: 1.348 ± 0.512
2.156LeuTyr: 2.156 ± 1.392
0.0LeuXaa: 0.0 ± 0.0
Met
2.156MetAla: 2.156 ± 0.734
1.078MetCys: 1.078 ± 1.134
1.617MetAsp: 1.617 ± 0.315
1.617MetGlu: 1.617 ± 0.901
1.078MetPhe: 1.078 ± 0.427
1.348MetGly: 1.348 ± 0.644
0.27MetHis: 0.27 ± 0.15
0.809MetIle: 0.809 ± 0.451
0.27MetLys: 0.27 ± 0.345
2.695MetLeu: 2.695 ± 0.544
0.539MetMet: 0.539 ± 0.3
0.809MetAsn: 0.809 ± 0.609
0.27MetPro: 0.27 ± 0.15
1.078MetGln: 1.078 ± 0.601
1.348MetArg: 1.348 ± 0.639
2.965MetSer: 2.965 ± 1.049
2.156MetThr: 2.156 ± 0.75
1.887MetVal: 1.887 ± 0.784
0.0MetTrp: 0.0 ± 0.0
1.078MetTyr: 1.078 ± 0.427
0.0MetXaa: 0.0 ± 0.0
Asn
3.235AsnAla: 3.235 ± 1.169
1.078AsnCys: 1.078 ± 0.39
2.156AsnAsp: 2.156 ± 0.697
1.078AsnGlu: 1.078 ± 0.78
2.695AsnPhe: 2.695 ± 0.739
1.887AsnGly: 1.887 ± 1.109
0.809AsnHis: 0.809 ± 0.628
2.156AsnIle: 2.156 ± 0.837
2.426AsnLys: 2.426 ± 0.717
3.774AsnLeu: 3.774 ± 0.625
1.348AsnMet: 1.348 ± 0.751
1.078AsnAsn: 1.078 ± 0.615
2.426AsnPro: 2.426 ± 0.616
1.348AsnGln: 1.348 ± 0.751
0.809AsnArg: 0.809 ± 0.451
3.504AsnSer: 3.504 ± 1.886
1.887AsnThr: 1.887 ± 0.624
2.965AsnVal: 2.965 ± 0.719
1.078AsnTrp: 1.078 ± 0.427
1.887AsnTyr: 1.887 ± 0.904
0.0AsnXaa: 0.0 ± 0.0
Pro
2.965ProAla: 2.965 ± 1.551
0.0ProCys: 0.0 ± 0.0
1.348ProAsp: 1.348 ± 0.519
2.156ProGlu: 2.156 ± 0.92
2.426ProPhe: 2.426 ± 0.717
1.887ProGly: 1.887 ± 0.716
0.27ProHis: 0.27 ± 0.473
2.695ProIle: 2.695 ± 1.213
1.617ProLys: 1.617 ± 0.614
3.235ProLeu: 3.235 ± 1.228
1.078ProMet: 1.078 ± 0.601
2.965ProAsn: 2.965 ± 1.542
3.504ProPro: 3.504 ± 2.001
0.809ProGln: 0.809 ± 0.451
2.965ProArg: 2.965 ± 0.736
3.235ProSer: 3.235 ± 1.29
1.348ProThr: 1.348 ± 0.604
2.156ProVal: 2.156 ± 0.947
0.0ProTrp: 0.0 ± 0.0
0.539ProTyr: 0.539 ± 0.3
0.0ProXaa: 0.0 ± 0.0
Gln
3.235GlnAla: 3.235 ± 0.958
0.0GlnCys: 0.0 ± 0.0
1.617GlnAsp: 1.617 ± 0.494
2.156GlnGlu: 2.156 ± 0.81
1.617GlnPhe: 1.617 ± 0.901
2.695GlnGly: 2.695 ± 0.528
1.078GlnHis: 1.078 ± 0.722
0.809GlnIle: 0.809 ± 0.553
2.695GlnLys: 2.695 ± 0.594
2.695GlnLeu: 2.695 ± 1.213
0.539GlnMet: 0.539 ± 0.298
1.348GlnAsn: 1.348 ± 0.751
0.539GlnPro: 0.539 ± 0.298
1.078GlnGln: 1.078 ± 0.551
1.617GlnArg: 1.617 ± 1.076
2.426GlnSer: 2.426 ± 0.849
2.965GlnThr: 2.965 ± 0.59
3.235GlnVal: 3.235 ± 1.229
0.27GlnTrp: 0.27 ± 0.15
0.27GlnTyr: 0.27 ± 0.15
0.0GlnXaa: 0.0 ± 0.0
Arg
5.121ArgAla: 5.121 ± 2.454
0.809ArgCys: 0.809 ± 0.362
3.235ArgAsp: 3.235 ± 1.007
4.582ArgGlu: 4.582 ± 0.945
3.235ArgPhe: 3.235 ± 1.878
2.965ArgGly: 2.965 ± 0.919
1.887ArgHis: 1.887 ± 0.716
2.965ArgIle: 2.965 ± 0.83
2.156ArgLys: 2.156 ± 0.512
3.774ArgLeu: 3.774 ± 1.176
1.078ArgMet: 1.078 ± 0.38
2.695ArgAsn: 2.695 ± 0.61
1.617ArgPro: 1.617 ± 0.508
1.887ArgGln: 1.887 ± 1.052
0.809ArgArg: 0.809 ± 0.469
3.235ArgSer: 3.235 ± 1.21
1.617ArgThr: 1.617 ± 0.964
3.504ArgVal: 3.504 ± 1.147
1.078ArgTrp: 1.078 ± 0.427
2.156ArgTyr: 2.156 ± 0.83
0.0ArgXaa: 0.0 ± 0.0
Ser
3.504SerAla: 3.504 ± 1.736
1.348SerCys: 1.348 ± 1.113
7.817SerAsp: 7.817 ± 1.538
6.199SerGlu: 6.199 ± 1.661
3.774SerPhe: 3.774 ± 1.15
11.051SerGly: 11.051 ± 2.848
0.27SerHis: 0.27 ± 0.15
4.043SerIle: 4.043 ± 1.226
5.391SerLys: 5.391 ± 1.43
7.008SerLeu: 7.008 ± 1.22
1.348SerMet: 1.348 ± 1.085
2.156SerAsn: 2.156 ± 1.305
3.235SerPro: 3.235 ± 1.18
1.887SerGln: 1.887 ± 1.014
7.278SerArg: 7.278 ± 1.971
6.739SerSer: 6.739 ± 1.562
5.121SerThr: 5.121 ± 1.418
7.817SerVal: 7.817 ± 0.945
1.348SerTrp: 1.348 ± 0.821
2.156SerTyr: 2.156 ± 0.586
0.0SerXaa: 0.0 ± 0.0
Thr
3.774ThrAla: 3.774 ± 1.56
0.809ThrCys: 0.809 ± 0.451
4.313ThrAsp: 4.313 ± 1.467
2.695ThrGlu: 2.695 ± 0.627
1.617ThrPhe: 1.617 ± 0.575
5.121ThrGly: 5.121 ± 1.877
2.426ThrHis: 2.426 ± 0.95
2.965ThrIle: 2.965 ± 0.501
3.504ThrLys: 3.504 ± 0.838
4.313ThrLeu: 4.313 ± 1.451
0.809ThrMet: 0.809 ± 0.362
0.809ThrAsn: 0.809 ± 0.451
0.809ThrPro: 0.809 ± 0.322
1.348ThrGln: 1.348 ± 0.751
2.156ThrArg: 2.156 ± 1.212
4.043ThrSer: 4.043 ± 0.797
3.504ThrThr: 3.504 ± 1.066
4.043ThrVal: 4.043 ± 1.135
0.539ThrTrp: 0.539 ± 0.298
1.887ThrTyr: 1.887 ± 0.698
0.0ThrXaa: 0.0 ± 0.0
Val
8.895ValAla: 8.895 ± 1.484
2.965ValCys: 2.965 ± 2.416
4.313ValAsp: 4.313 ± 1.437
3.774ValGlu: 3.774 ± 0.991
2.695ValPhe: 2.695 ± 1.203
5.121ValGly: 5.121 ± 1.472
2.426ValHis: 2.426 ± 0.418
1.348ValIle: 1.348 ± 0.438
6.199ValLys: 6.199 ± 2.089
8.625ValLeu: 8.625 ± 3.04
2.426ValMet: 2.426 ± 0.673
3.504ValAsn: 3.504 ± 1.015
4.043ValPro: 4.043 ± 1.234
2.695ValGln: 2.695 ± 0.431
4.043ValArg: 4.043 ± 1.078
8.086ValSer: 8.086 ± 1.383
3.235ValThr: 3.235 ± 0.609
9.164ValVal: 9.164 ± 3.089
0.0ValTrp: 0.0 ± 0.0
2.156ValTyr: 2.156 ± 1.202
0.0ValXaa: 0.0 ± 0.0
Trp
0.809TrpAla: 0.809 ± 0.362
0.27TrpCys: 0.27 ± 0.15
1.078TrpAsp: 1.078 ± 0.716
0.539TrpGlu: 0.539 ± 0.3
0.539TrpPhe: 0.539 ± 0.361
0.539TrpGly: 0.539 ± 0.55
0.0TrpHis: 0.0 ± 0.0
0.27TrpIle: 0.27 ± 0.15
0.0TrpLys: 0.0 ± 0.0
1.617TrpLeu: 1.617 ± 0.52
0.0TrpMet: 0.0 ± 0.0
1.078TrpAsn: 1.078 ± 0.427
0.539TrpPro: 0.539 ± 0.681
1.078TrpGln: 1.078 ± 0.39
0.539TrpArg: 0.539 ± 0.3
1.078TrpSer: 1.078 ± 0.39
0.809TrpThr: 0.809 ± 0.752
1.617TrpVal: 1.617 ± 0.938
0.0TrpTrp: 0.0 ± 0.0
0.539TrpTyr: 0.539 ± 0.3
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.156TyrAla: 2.156 ± 0.81
0.539TyrCys: 0.539 ± 0.298
2.426TyrAsp: 2.426 ± 0.751
0.809TyrGlu: 0.809 ± 0.451
1.348TyrPhe: 1.348 ± 0.751
1.887TyrGly: 1.887 ± 0.971
0.27TyrHis: 0.27 ± 0.15
2.426TyrIle: 2.426 ± 0.947
0.539TyrLys: 0.539 ± 0.394
2.695TyrLeu: 2.695 ± 0.806
0.539TyrMet: 0.539 ± 0.3
1.617TyrAsn: 1.617 ± 0.494
0.0TyrPro: 0.0 ± 0.0
2.426TyrGln: 2.426 ± 0.641
1.617TyrArg: 1.617 ± 0.727
3.235TyrSer: 3.235 ± 0.956
0.539TyrThr: 0.539 ± 0.298
2.695TyrVal: 2.695 ± 1.111
1.078TyrTrp: 1.078 ± 0.577
1.617TyrTyr: 1.617 ± 0.441
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.27XaaGln: 0.27 ± 0.345
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (3711 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski