Amino acid dipepetide frequency for Artemisia virus A

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.734AlaAla: 4.734 ± 1.45
1.578AlaCys: 1.578 ± 0.614
4.734AlaAsp: 4.734 ± 1.906
8.417AlaGlu: 8.417 ± 2.347
4.734AlaPhe: 4.734 ± 1.761
2.104AlaGly: 2.104 ± 0.798
0.526AlaHis: 0.526 ± 0.338
2.104AlaIle: 2.104 ± 0.929
6.312AlaLys: 6.312 ± 1.316
3.156AlaLeu: 3.156 ± 0.438
2.63AlaMet: 2.63 ± 1.501
1.578AlaAsn: 1.578 ± 2.932
2.63AlaPro: 2.63 ± 3.26
2.104AlaGln: 2.104 ± 0.929
4.208AlaArg: 4.208 ± 1.783
7.365AlaSer: 7.365 ± 3.035
5.786AlaThr: 5.786 ± 2.723
3.156AlaVal: 3.156 ± 1.632
1.052AlaTrp: 1.052 ± 0.434
3.156AlaTyr: 3.156 ± 1.228
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.526CysAsp: 0.526 ± 0.338
1.578CysGlu: 1.578 ± 2.074
0.526CysPhe: 0.526 ± 0.338
1.052CysGly: 1.052 ± 2.2
0.0CysHis: 0.0 ± 0.0
2.104CysIle: 2.104 ± 0.506
1.052CysLys: 1.052 ± 0.999
2.63CysLeu: 2.63 ± 1.2
0.526CysMet: 0.526 ± 0.279
0.0CysAsn: 0.0 ± 0.0
2.104CysPro: 2.104 ± 0.782
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
1.578CysSer: 1.578 ± 0.865
1.052CysThr: 1.052 ± 0.434
1.052CysVal: 1.052 ± 0.434
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.63AspAla: 2.63 ± 1.008
0.526AspCys: 0.526 ± 0.338
4.734AspAsp: 4.734 ± 1.85
4.734AspGlu: 4.734 ± 1.586
4.734AspPhe: 4.734 ± 1.85
0.0AspGly: 0.0 ± 0.0
0.0AspHis: 0.0 ± 0.0
0.526AspIle: 0.526 ± 0.338
4.208AspLys: 4.208 ± 0.71
2.63AspLeu: 2.63 ± 1.008
1.052AspMet: 1.052 ± 0.434
2.104AspAsn: 2.104 ± 1.427
2.63AspPro: 2.63 ± 1.008
1.052AspGln: 1.052 ± 0.434
0.0AspArg: 0.0 ± 0.0
6.839AspSer: 6.839 ± 2.565
1.578AspThr: 1.578 ± 0.865
5.786AspVal: 5.786 ± 1.621
3.156AspTrp: 3.156 ± 1.011
1.578AspTyr: 1.578 ± 0.614
0.0AspXaa: 0.0 ± 0.0
Glu
5.786GluAla: 5.786 ± 2.381
0.0GluCys: 0.0 ± 0.0
3.156GluAsp: 3.156 ± 1.228
3.682GluGlu: 3.682 ± 1.493
4.734GluPhe: 4.734 ± 1.272
3.156GluGly: 3.156 ± 1.228
2.104GluHis: 2.104 ± 0.975
4.208GluIle: 4.208 ± 3.119
4.208GluLys: 4.208 ± 0.982
8.417GluLeu: 8.417 ± 1.358
2.104GluMet: 2.104 ± 0.506
4.208GluAsn: 4.208 ± 1.734
5.786GluPro: 5.786 ± 2.278
3.156GluGln: 3.156 ± 0.438
4.208GluArg: 4.208 ± 1.612
2.63GluSer: 2.63 ± 0.548
6.839GluThr: 6.839 ± 3.479
2.104GluVal: 2.104 ± 2.387
0.526GluTrp: 0.526 ± 0.338
0.526GluTyr: 0.526 ± 0.338
0.526GluXaa: 0.526 ± 0.338
Phe
4.734PheAla: 4.734 ± 1.099
0.526PheCys: 0.526 ± 0.338
2.63PheAsp: 2.63 ± 0.838
2.63PheGlu: 2.63 ± 0.814
3.682PhePhe: 3.682 ± 0.976
4.208PheGly: 4.208 ± 0.982
1.578PheHis: 1.578 ± 0.614
0.526PheIle: 0.526 ± 0.338
3.682PheLys: 3.682 ± 1.253
2.104PheLeu: 2.104 ± 0.891
0.0PheMet: 0.0 ± 0.0
0.526PheAsn: 0.526 ± 0.338
1.578PhePro: 1.578 ± 0.614
2.104PheGln: 2.104 ± 0.891
2.104PheArg: 2.104 ± 1.427
4.734PheSer: 4.734 ± 1.072
3.682PheThr: 3.682 ± 1.425
3.156PheVal: 3.156 ± 0.438
0.0PheTrp: 0.0 ± 0.0
1.578PheTyr: 1.578 ± 0.614
0.0PheXaa: 0.0 ± 0.0
Gly
5.786GlyAla: 5.786 ± 2.133
1.578GlyCys: 1.578 ± 0.846
3.156GlyAsp: 3.156 ± 1.741
3.682GlyGlu: 3.682 ± 0.733
3.682GlyPhe: 3.682 ± 0.558
4.208GlyGly: 4.208 ± 1.783
1.578GlyHis: 1.578 ± 0.614
3.156GlyIle: 3.156 ± 1.011
3.682GlyLys: 3.682 ± 1.679
5.786GlyLeu: 5.786 ± 1.422
2.104GlyMet: 2.104 ± 0.891
0.526GlyAsn: 0.526 ± 0.977
1.052GlyPro: 1.052 ± 0.434
0.526GlyGln: 0.526 ± 0.977
5.26GlyArg: 5.26 ± 1.091
6.839GlySer: 6.839 ± 0.786
5.786GlyThr: 5.786 ± 0.933
5.26GlyVal: 5.26 ± 0.345
1.052GlyTrp: 1.052 ± 0.434
3.156GlyTyr: 3.156 ± 0.757
0.0GlyXaa: 0.0 ± 0.0
His
0.526HisAla: 0.526 ± 0.977
0.0HisCys: 0.0 ± 0.0
0.526HisAsp: 0.526 ± 0.338
0.526HisGlu: 0.526 ± 0.338
0.0HisPhe: 0.0 ± 0.0
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
1.052HisIle: 1.052 ± 0.434
0.0HisLys: 0.0 ± 0.0
0.0HisLeu: 0.0 ± 0.0
0.526HisMet: 0.526 ± 0.338
0.526HisAsn: 0.526 ± 0.977
1.578HisPro: 1.578 ± 0.614
1.052HisGln: 1.052 ± 0.434
1.052HisArg: 1.052 ± 0.434
1.578HisSer: 1.578 ± 0.614
0.526HisThr: 0.526 ± 0.338
1.052HisVal: 1.052 ± 0.434
0.0HisTrp: 0.0 ± 0.0
1.578HisTyr: 1.578 ± 1.547
0.0HisXaa: 0.0 ± 0.0
Ile
5.26IleAla: 5.26 ± 1.803
0.0IleCys: 0.0 ± 0.0
5.786IleAsp: 5.786 ± 0.933
3.156IleGlu: 3.156 ± 1.011
2.63IlePhe: 2.63 ± 0.838
6.312IleGly: 6.312 ± 1.519
1.052IleHis: 1.052 ± 0.434
2.104IleIle: 2.104 ± 1.613
1.578IleLys: 1.578 ± 0.865
2.104IleLeu: 2.104 ± 1.346
0.0IleMet: 0.0 ± 0.0
0.0IleAsn: 0.0 ± 0.0
3.156IlePro: 3.156 ± 1.326
3.156IleGln: 3.156 ± 0.438
4.208IleArg: 4.208 ± 1.321
4.734IleSer: 4.734 ± 1.621
1.052IleThr: 1.052 ± 0.999
2.104IleVal: 2.104 ± 1.346
1.578IleTrp: 1.578 ± 1.007
0.526IleTyr: 0.526 ± 0.977
0.0IleXaa: 0.0 ± 0.0
Lys
3.156LysAla: 3.156 ± 1.011
1.052LysCys: 1.052 ± 0.999
0.526LysAsp: 0.526 ± 0.338
3.682LysGlu: 3.682 ± 1.679
3.682LysPhe: 3.682 ± 0.558
4.208LysGly: 4.208 ± 1.734
0.526LysHis: 0.526 ± 0.338
5.26LysIle: 5.26 ± 2.102
2.104LysLys: 2.104 ± 3.451
5.786LysLeu: 5.786 ± 1.422
0.526LysMet: 0.526 ± 0.977
0.526LysAsn: 0.526 ± 0.338
3.682LysPro: 3.682 ± 1.323
1.578LysGln: 1.578 ± 0.663
5.786LysArg: 5.786 ± 1.331
5.26LysSer: 5.26 ± 1.499
3.682LysThr: 3.682 ± 1.253
8.417LysVal: 8.417 ± 2.355
1.578LysTrp: 1.578 ± 0.865
1.052LysTyr: 1.052 ± 2.2
0.0LysXaa: 0.0 ± 0.0
Leu
5.26LeuAla: 5.26 ± 3.466
1.052LeuCys: 1.052 ± 0.675
5.26LeuAsp: 5.26 ± 2.016
7.365LeuGlu: 7.365 ± 1.951
2.63LeuPhe: 2.63 ± 1.316
7.365LeuGly: 7.365 ± 1.116
0.0LeuHis: 0.0 ± 0.0
3.682LeuIle: 3.682 ± 1.036
0.526LeuLys: 0.526 ± 0.977
3.156LeuLeu: 3.156 ± 0.438
3.156LeuMet: 3.156 ± 1.228
3.682LeuAsn: 3.682 ± 1.323
2.104LeuPro: 2.104 ± 0.798
3.156LeuGln: 3.156 ± 1.301
2.63LeuArg: 2.63 ± 1.008
9.995LeuSer: 9.995 ± 3.076
4.734LeuThr: 4.734 ± 1.272
6.312LeuVal: 6.312 ± 1.628
2.63LeuTrp: 2.63 ± 0.548
2.63LeuTyr: 2.63 ± 1.008
0.0LeuXaa: 0.0 ± 0.0
Met
3.156MetAla: 3.156 ± 0.438
0.526MetCys: 0.526 ± 0.977
0.0MetAsp: 0.0 ± 0.0
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
1.052MetGly: 1.052 ± 0.675
0.0MetHis: 0.0 ± 0.0
1.052MetIle: 1.052 ± 0.434
1.578MetLys: 1.578 ± 0.614
4.208MetLeu: 4.208 ± 1.734
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
0.526MetPro: 0.526 ± 0.338
0.526MetGln: 0.526 ± 0.338
0.526MetArg: 0.526 ± 0.338
1.578MetSer: 1.578 ± 2.58
2.104MetThr: 2.104 ± 1.613
0.0MetVal: 0.0 ± 0.0
0.526MetTrp: 0.526 ± 0.338
2.104MetTyr: 2.104 ± 0.867
0.0MetXaa: 0.0 ± 0.0
Asn
0.526AsnAla: 0.526 ± 0.476
0.526AsnCys: 0.526 ± 0.338
1.052AsnAsp: 1.052 ± 0.434
3.682AsnGlu: 3.682 ± 0.733
0.526AsnPhe: 0.526 ± 0.338
6.312AsnGly: 6.312 ± 0.883
0.526AsnHis: 0.526 ± 0.338
2.63AsnIle: 2.63 ± 3.729
1.052AsnLys: 1.052 ± 0.434
3.682AsnLeu: 3.682 ± 0.733
0.526AsnMet: 0.526 ± 0.876
1.578AsnAsn: 1.578 ± 0.663
0.526AsnPro: 0.526 ± 0.338
1.052AsnGln: 1.052 ± 0.835
2.63AsnArg: 2.63 ± 2.188
4.208AsnSer: 4.208 ± 1.137
1.052AsnThr: 1.052 ± 1.955
0.526AsnVal: 0.526 ± 0.977
1.578AsnTrp: 1.578 ± 0.663
1.052AsnTyr: 1.052 ± 0.434
0.0AsnXaa: 0.0 ± 0.0
Pro
3.682ProAla: 3.682 ± 0.976
1.052ProCys: 1.052 ± 1.055
1.578ProAsp: 1.578 ± 0.614
4.734ProGlu: 4.734 ± 0.836
1.578ProPhe: 1.578 ± 0.614
4.734ProGly: 4.734 ± 1.336
0.526ProHis: 0.526 ± 0.338
4.208ProIle: 4.208 ± 3.745
4.734ProLys: 4.734 ± 1.85
3.156ProLeu: 3.156 ± 2.492
0.0ProMet: 0.0 ± 0.0
1.052ProAsn: 1.052 ± 0.434
5.786ProPro: 5.786 ± 2.535
4.734ProGln: 4.734 ± 1.85
1.578ProArg: 1.578 ± 0.614
5.26ProSer: 5.26 ± 2.652
2.63ProThr: 2.63 ± 1.298
5.26ProVal: 5.26 ± 1.091
1.052ProTrp: 1.052 ± 0.835
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
1.578GlnAla: 1.578 ± 1.013
0.0GlnCys: 0.0 ± 0.0
1.052GlnAsp: 1.052 ± 0.434
2.104GlnGlu: 2.104 ± 0.891
2.104GlnPhe: 2.104 ± 0.867
0.526GlnGly: 0.526 ± 0.476
0.0GlnHis: 0.0 ± 0.0
1.578GlnIle: 1.578 ± 0.663
1.578GlnLys: 1.578 ± 0.614
4.208GlnLeu: 4.208 ± 1.343
0.0GlnMet: 0.0 ± 0.0
2.104GlnAsn: 2.104 ± 0.506
2.104GlnPro: 2.104 ± 0.798
1.052GlnGln: 1.052 ± 0.434
2.104GlnArg: 2.104 ± 1.669
6.312GlnSer: 6.312 ± 2.41
1.052GlnThr: 1.052 ± 1.955
2.104GlnVal: 2.104 ± 0.867
2.104GlnTrp: 2.104 ± 0.891
2.63GlnTyr: 2.63 ± 1.008
0.0GlnXaa: 0.0 ± 0.0
Arg
3.682ArgAla: 3.682 ± 1.036
1.052ArgCys: 1.052 ± 0.675
5.26ArgAsp: 5.26 ± 1.306
4.208ArgGlu: 4.208 ± 1.612
1.578ArgPhe: 1.578 ± 1.013
4.208ArgGly: 4.208 ± 0.982
0.0ArgHis: 0.0 ± 0.0
3.156ArgIle: 3.156 ± 1.311
4.208ArgLys: 4.208 ± 0.812
3.682ArgLeu: 3.682 ± 1.716
0.0ArgMet: 0.0 ± 0.0
4.208ArgAsn: 4.208 ± 1.012
2.104ArgPro: 2.104 ± 0.506
0.526ArgGln: 0.526 ± 0.977
6.839ArgArg: 6.839 ± 2.442
1.578ArgSer: 1.578 ± 0.846
3.682ArgThr: 3.682 ± 0.558
4.208ArgVal: 4.208 ± 1.156
0.0ArgTrp: 0.0 ± 0.0
1.578ArgTyr: 1.578 ± 0.816
0.0ArgXaa: 0.0 ± 0.0
Ser
7.365SerAla: 7.365 ± 1.531
0.526SerCys: 0.526 ± 0.476
2.104SerAsp: 2.104 ± 0.891
5.26SerGlu: 5.26 ± 2.016
3.156SerPhe: 3.156 ± 1.521
6.839SerGly: 6.839 ± 1.739
1.578SerHis: 1.578 ± 0.816
1.052SerIle: 1.052 ± 0.923
8.417SerLys: 8.417 ± 3.034
9.995SerLeu: 9.995 ± 3.253
0.0SerMet: 0.0 ± 0.0
8.417SerAsn: 8.417 ± 1.565
8.943SerPro: 8.943 ± 1.267
1.052SerGln: 1.052 ± 0.434
3.682SerArg: 3.682 ± 1.425
11.047SerSer: 11.047 ± 2.71
2.63SerThr: 2.63 ± 3.767
12.099SerVal: 12.099 ± 1.885
1.578SerTrp: 1.578 ± 0.614
1.578SerTyr: 1.578 ± 1.007
0.0SerXaa: 0.0 ± 0.0
Thr
6.839ThrAla: 6.839 ± 1.409
2.104ThrCys: 2.104 ± 1.998
0.526ThrAsp: 0.526 ± 0.977
4.734ThrGlu: 4.734 ± 2.473
1.578ThrPhe: 1.578 ± 0.614
5.786ThrGly: 5.786 ± 1.209
0.526ThrHis: 0.526 ± 0.977
3.156ThrIle: 3.156 ± 1.011
2.63ThrLys: 2.63 ± 1.141
2.63ThrLeu: 2.63 ± 0.548
2.63ThrMet: 2.63 ± 1.158
1.578ThrAsn: 1.578 ± 1.786
4.208ThrPro: 4.208 ± 2.185
1.052ThrGln: 1.052 ± 0.923
4.734ThrArg: 4.734 ± 0.836
2.63ThrSer: 2.63 ± 0.838
4.208ThrThr: 4.208 ± 0.71
0.526ThrVal: 0.526 ± 1.1
0.0ThrTrp: 0.0 ± 0.0
0.526ThrTyr: 0.526 ± 1.1
0.526ThrXaa: 0.526 ± 0.977
Val
6.312ValAla: 6.312 ± 1.343
1.052ValCys: 1.052 ± 0.434
5.26ValAsp: 5.26 ± 1.495
5.786ValGlu: 5.786 ± 1.742
3.156ValPhe: 3.156 ± 0.775
2.104ValGly: 2.104 ± 0.506
1.578ValHis: 1.578 ± 0.663
6.839ValIle: 6.839 ± 0.837
7.891ValLys: 7.891 ± 1.197
2.63ValLeu: 2.63 ± 0.548
2.104ValMet: 2.104 ± 0.867
1.578ValAsn: 1.578 ± 0.663
3.156ValPro: 3.156 ± 1.228
4.208ValGln: 4.208 ± 1.156
2.63ValArg: 2.63 ± 2.611
5.26ValSer: 5.26 ± 1.65
0.526ValThr: 0.526 ± 0.977
4.208ValVal: 4.208 ± 1.577
1.052ValTrp: 1.052 ± 0.434
1.578ValTyr: 1.578 ± 1.786
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
1.052TrpCys: 1.052 ± 0.999
0.526TrpAsp: 0.526 ± 0.338
1.578TrpGlu: 1.578 ± 1.007
0.0TrpPhe: 0.0 ± 0.0
1.052TrpGly: 1.052 ± 0.434
0.0TrpHis: 0.0 ± 0.0
1.052TrpIle: 1.052 ± 0.434
2.104TrpLys: 2.104 ± 0.891
2.104TrpLeu: 2.104 ± 0.867
0.0TrpMet: 0.0 ± 0.0
1.052TrpAsn: 1.052 ± 1.726
2.63TrpPro: 2.63 ± 1.2
1.052TrpGln: 1.052 ± 0.434
1.052TrpArg: 1.052 ± 0.434
4.208TrpSer: 4.208 ± 1.612
0.526TrpThr: 0.526 ± 0.977
0.0TrpVal: 0.0 ± 0.0
1.052TrpTrp: 1.052 ± 0.434
0.526TrpTyr: 0.526 ± 0.977
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.052TyrAla: 1.052 ± 0.999
1.578TyrCys: 1.578 ± 2.074
2.104TyrAsp: 2.104 ± 1.613
0.526TyrGlu: 0.526 ± 0.977
1.052TyrPhe: 1.052 ± 0.434
2.63TyrGly: 2.63 ± 0.548
0.0TyrHis: 0.0 ± 0.0
1.052TyrIle: 1.052 ± 0.835
0.526TyrLys: 0.526 ± 0.338
5.26TyrLeu: 5.26 ± 2.102
1.052TyrMet: 1.052 ± 0.434
0.0TyrAsn: 0.0 ± 0.0
1.052TyrPro: 1.052 ± 0.999
2.63TyrGln: 2.63 ± 1.2
0.526TyrArg: 0.526 ± 0.338
3.156TyrSer: 3.156 ± 1.311
0.526TyrThr: 0.526 ± 1.1
1.578TyrVal: 1.578 ± 0.614
1.052TyrTrp: 1.052 ± 0.434
0.526TyrTyr: 0.526 ± 0.977
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.526XaaGln: 0.526 ± 0.338
0.0XaaArg: 0.0 ± 0.0
0.526XaaSer: 0.526 ± 0.977
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (1902 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski