Amino acid dipepetide frequency for Shuangao Insect Virus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.846AlaAla: 3.846 ± 4.361
1.399AlaCys: 1.399 ± 0.726
3.497AlaAsp: 3.497 ± 1.889
2.098AlaGlu: 2.098 ± 1.068
3.147AlaPhe: 3.147 ± 1.145
1.748AlaGly: 1.748 ± 1.416
2.098AlaHis: 2.098 ± 1.781
3.147AlaIle: 3.147 ± 0.088
3.497AlaLys: 3.497 ± 0.646
6.643AlaLeu: 6.643 ± 1.393
2.448AlaMet: 2.448 ± 0.989
4.895AlaAsn: 4.895 ± 0.958
1.049AlaPro: 1.049 ± 0.618
1.748AlaGln: 1.748 ± 1.021
1.399AlaArg: 1.399 ± 0.851
5.594AlaSer: 5.594 ± 3.184
3.497AlaThr: 3.497 ± 1.36
2.797AlaVal: 2.797 ± 1.864
1.049AlaTrp: 1.049 ± 0.759
1.399AlaTyr: 1.399 ± 0.824
0.0AlaXaa: 0.0 ± 0.0
Cys
0.35CysAla: 0.35 ± 0.206
0.699CysCys: 0.699 ± 1.054
1.399CysAsp: 1.399 ± 1.119
1.748CysGlu: 1.748 ± 0.611
0.699CysPhe: 0.699 ± 0.637
0.0CysGly: 0.0 ± 0.0
0.35CysHis: 0.35 ± 0.206
0.699CysIle: 0.699 ± 0.296
1.748CysLys: 1.748 ± 1.381
1.049CysLeu: 1.049 ± 0.643
0.699CysMet: 0.699 ± 0.508
0.35CysAsn: 0.35 ± 0.374
1.399CysPro: 1.399 ± 1.226
0.699CysGln: 0.699 ± 0.637
0.699CysArg: 0.699 ± 0.749
3.147CysSer: 3.147 ± 1.954
1.748CysThr: 1.748 ± 0.38
1.748CysVal: 1.748 ± 0.929
0.699CysTrp: 0.699 ± 0.637
1.049CysTyr: 1.049 ± 0.643
0.0CysXaa: 0.0 ± 0.0
Asp
2.448AspAla: 2.448 ± 0.984
1.399AspCys: 1.399 ± 1.226
4.545AspAsp: 4.545 ± 0.983
3.846AspGlu: 3.846 ± 1.317
2.797AspPhe: 2.797 ± 1.543
2.797AspGly: 2.797 ± 1.734
1.399AspHis: 1.399 ± 0.327
5.944AspIle: 5.944 ± 1.17
6.294AspLys: 6.294 ± 1.808
5.594AspLeu: 5.594 ± 0.853
3.497AspMet: 3.497 ± 1.65
2.797AspAsn: 2.797 ± 1.222
1.049AspPro: 1.049 ± 0.347
2.448AspGln: 2.448 ± 0.543
0.699AspArg: 0.699 ± 0.296
3.497AspSer: 3.497 ± 1.599
2.448AspThr: 2.448 ± 1.048
3.497AspVal: 3.497 ± 0.831
0.699AspTrp: 0.699 ± 0.296
5.594AspTyr: 5.594 ± 2.081
0.0AspXaa: 0.0 ± 0.0
Glu
2.797GluAla: 2.797 ± 0.726
2.098GluCys: 2.098 ± 1.413
3.846GluAsp: 3.846 ± 0.409
2.098GluGlu: 2.098 ± 0.935
1.049GluPhe: 1.049 ± 0.618
1.049GluGly: 1.049 ± 0.411
1.399GluHis: 1.399 ± 0.487
3.497GluIle: 3.497 ± 0.768
4.895GluLys: 4.895 ± 1.595
3.497GluLeu: 3.497 ± 0.646
3.147GluMet: 3.147 ± 0.698
1.399GluAsn: 1.399 ± 0.487
0.699GluPro: 0.699 ± 0.412
3.147GluGln: 3.147 ± 0.971
1.399GluArg: 1.399 ± 0.492
3.846GluSer: 3.846 ± 1.013
4.895GluThr: 4.895 ± 0.871
3.147GluVal: 3.147 ± 0.506
0.0GluTrp: 0.0 ± 0.0
3.147GluTyr: 3.147 ± 0.624
0.0GluXaa: 0.0 ± 0.0
Phe
2.448PheAla: 2.448 ± 2.507
1.399PheCys: 1.399 ± 0.593
2.448PheAsp: 2.448 ± 0.543
1.748PheGlu: 1.748 ± 0.542
0.35PhePhe: 0.35 ± 0.206
1.049PheGly: 1.049 ± 0.463
0.699PheHis: 0.699 ± 0.412
4.196PheIle: 4.196 ± 0.909
4.545PheLys: 4.545 ± 0.816
1.399PheLeu: 1.399 ± 0.327
1.748PheMet: 1.748 ± 0.292
3.497PheAsn: 3.497 ± 0.822
1.049PhePro: 1.049 ± 0.534
1.748PheGln: 1.748 ± 0.38
1.748PheArg: 1.748 ± 0.954
2.098PheSer: 2.098 ± 0.384
2.098PheThr: 2.098 ± 1.055
1.748PheVal: 1.748 ± 0.38
0.0PheTrp: 0.0 ± 0.0
1.049PheTyr: 1.049 ± 1.123
0.0PheXaa: 0.0 ± 0.0
Gly
2.098GlyAla: 2.098 ± 1.569
2.098GlyCys: 2.098 ± 0.908
2.448GlyAsp: 2.448 ± 1.056
1.399GlyGlu: 1.399 ± 0.487
1.049GlyPhe: 1.049 ± 1.089
3.147GlyGly: 3.147 ± 0.708
3.846GlyHis: 3.846 ± 2.061
3.147GlyIle: 3.147 ± 1.03
4.196GlyLys: 4.196 ± 1.399
5.594GlyLeu: 5.594 ± 2.37
0.35GlyMet: 0.35 ± 0.206
0.699GlyAsn: 0.699 ± 0.296
2.448GlyPro: 2.448 ± 2.181
1.049GlyGln: 1.049 ± 0.534
3.846GlyArg: 3.846 ± 0.896
2.797GlySer: 2.797 ± 0.871
1.399GlyThr: 1.399 ± 0.726
2.448GlyVal: 2.448 ± 1.056
0.699GlyTrp: 0.699 ± 0.412
2.098GlyTyr: 2.098 ± 0.852
0.0GlyXaa: 0.0 ± 0.0
His
2.098HisAla: 2.098 ± 0.852
0.0HisCys: 0.0 ± 0.0
2.098HisAsp: 2.098 ± 0.384
1.049HisGlu: 1.049 ± 0.618
0.0HisPhe: 0.0 ± 0.0
0.699HisGly: 0.699 ± 0.749
0.0HisHis: 0.0 ± 0.0
2.797HisIle: 2.797 ± 0.695
2.797HisLys: 2.797 ± 0.976
3.147HisLeu: 3.147 ± 0.836
0.699HisMet: 0.699 ± 0.296
0.699HisAsn: 0.699 ± 0.749
0.699HisPro: 0.699 ± 0.296
1.748HisGln: 1.748 ± 0.663
0.699HisArg: 0.699 ± 0.412
2.098HisSer: 2.098 ± 0.693
0.699HisThr: 0.699 ± 0.412
1.049HisVal: 1.049 ± 0.463
0.0HisTrp: 0.0 ± 0.0
1.748HisTyr: 1.748 ± 0.632
0.0HisXaa: 0.0 ± 0.0
Ile
4.895IleAla: 4.895 ± 1.124
1.049IleCys: 1.049 ± 0.643
7.343IleAsp: 7.343 ± 0.618
5.594IleGlu: 5.594 ± 0.69
2.098IlePhe: 2.098 ± 0.801
4.196IleGly: 4.196 ± 2.115
1.399IleHis: 1.399 ± 0.487
3.846IleIle: 3.846 ± 1.648
5.594IleLys: 5.594 ± 1.58
4.545IleLeu: 4.545 ± 0.43
1.748IleMet: 1.748 ± 1.03
4.545IleAsn: 4.545 ± 0.53
1.748IlePro: 1.748 ± 0.663
3.846IleGln: 3.846 ± 0.881
2.098IleArg: 2.098 ± 1.413
5.594IleSer: 5.594 ± 3.034
5.245IleThr: 5.245 ± 1.254
3.147IleVal: 3.147 ± 1.555
0.35IleTrp: 0.35 ± 0.206
2.797IleTyr: 2.797 ± 0.839
0.0IleXaa: 0.0 ± 0.0
Lys
4.196LysAla: 4.196 ± 2.749
0.699LysCys: 0.699 ± 0.296
5.245LysAsp: 5.245 ± 1.31
3.846LysGlu: 3.846 ± 0.932
3.147LysPhe: 3.147 ± 1.113
5.944LysGly: 5.944 ± 1.081
1.748LysHis: 1.748 ± 1.03
4.895LysIle: 4.895 ± 0.546
6.993LysLys: 6.993 ± 1.425
7.343LysLeu: 7.343 ± 1.953
3.147LysMet: 3.147 ± 1.326
4.545LysAsn: 4.545 ± 0.983
2.797LysPro: 2.797 ± 0.726
4.196LysGln: 4.196 ± 0.426
3.497LysArg: 3.497 ± 0.842
5.245LysSer: 5.245 ± 0.875
7.692LysThr: 7.692 ± 1.313
4.545LysVal: 4.545 ± 0.866
1.049LysTrp: 1.049 ± 0.347
3.497LysTyr: 3.497 ± 0.697
0.0LysXaa: 0.0 ± 0.0
Leu
4.545LeuAla: 4.545 ± 2.041
1.748LeuCys: 1.748 ± 0.954
4.196LeuAsp: 4.196 ± 0.909
5.594LeuGlu: 5.594 ± 0.909
2.098LeuPhe: 2.098 ± 0.852
4.895LeuGly: 4.895 ± 0.871
2.098LeuHis: 2.098 ± 0.457
5.245LeuIle: 5.245 ± 1.918
6.643LeuLys: 6.643 ± 2.947
5.594LeuLeu: 5.594 ± 0.443
2.098LeuMet: 2.098 ± 0.852
6.993LeuAsn: 6.993 ± 0.787
3.147LeuPro: 3.147 ± 1.03
1.748LeuGln: 1.748 ± 0.632
3.147LeuArg: 3.147 ± 0.606
8.042LeuSer: 8.042 ± 2.753
5.245LeuThr: 5.245 ± 3.271
3.846LeuVal: 3.846 ± 0.409
0.35LeuTrp: 0.35 ± 0.206
3.147LeuTyr: 3.147 ± 0.606
0.0LeuXaa: 0.0 ± 0.0
Met
3.147MetAla: 3.147 ± 0.088
0.35MetCys: 0.35 ± 0.206
1.748MetAsp: 1.748 ± 1.03
0.699MetGlu: 0.699 ± 0.425
2.448MetPhe: 2.448 ± 0.693
1.049MetGly: 1.049 ± 0.411
0.0MetHis: 0.0 ± 0.0
2.797MetIle: 2.797 ± 1.211
4.196MetLys: 4.196 ± 0.848
2.797MetLeu: 2.797 ± 0.983
1.399MetMet: 1.399 ± 0.68
3.147MetAsn: 3.147 ± 0.506
3.147MetPro: 3.147 ± 1.084
0.699MetGln: 0.699 ± 0.425
0.0MetArg: 0.0 ± 0.0
3.497MetSer: 3.497 ± 2.131
2.448MetThr: 2.448 ± 0.543
1.399MetVal: 1.399 ± 0.824
0.0MetTrp: 0.0 ± 0.0
1.748MetTyr: 1.748 ± 0.611
0.0MetXaa: 0.0 ± 0.0
Asn
5.594AsnAla: 5.594 ± 1.081
1.049AsnCys: 1.049 ± 0.643
3.497AsnAsp: 3.497 ± 1.395
2.797AsnGlu: 2.797 ± 1.734
2.448AsnPhe: 2.448 ± 1.42
2.797AsnGly: 2.797 ± 0.479
1.049AsnHis: 1.049 ± 0.618
4.895AsnIle: 4.895 ± 1.051
5.245AsnLys: 5.245 ± 0.34
2.797AsnLeu: 2.797 ± 1.175
4.196AsnMet: 4.196 ± 1.712
2.448AsnAsn: 2.448 ± 1.219
2.448AsnPro: 2.448 ± 0.423
3.147AsnGln: 3.147 ± 0.56
1.748AsnArg: 1.748 ± 0.663
4.196AsnSer: 4.196 ± 0.444
3.147AsnThr: 3.147 ± 0.56
3.147AsnVal: 3.147 ± 0.666
0.699AsnTrp: 0.699 ± 0.749
3.147AsnTyr: 3.147 ± 1.084
0.0AsnXaa: 0.0 ± 0.0
Pro
2.448ProAla: 2.448 ± 0.892
0.35ProCys: 0.35 ± 0.374
2.448ProAsp: 2.448 ± 0.543
1.049ProGlu: 1.049 ± 0.618
1.748ProPhe: 1.748 ± 0.632
1.049ProGly: 1.049 ± 0.442
0.35ProHis: 0.35 ± 0.374
3.147ProIle: 3.147 ± 1.448
2.797ProLys: 2.797 ± 1.175
3.497ProLeu: 3.497 ± 0.729
1.399ProMet: 1.399 ± 0.487
3.147ProAsn: 3.147 ± 0.585
1.049ProPro: 1.049 ± 0.643
2.098ProGln: 2.098 ± 0.852
0.699ProArg: 0.699 ± 0.296
3.147ProSer: 3.147 ± 0.506
1.399ProThr: 1.399 ± 0.966
1.748ProVal: 1.748 ± 0.875
0.0ProTrp: 0.0 ± 0.0
2.098ProTyr: 2.098 ± 1.929
0.0ProXaa: 0.0 ± 0.0
Gln
1.748GlnAla: 1.748 ± 1.63
1.049GlnCys: 1.049 ± 0.618
1.748GlnAsp: 1.748 ± 0.663
1.049GlnGlu: 1.049 ± 0.347
2.448GlnPhe: 2.448 ± 0.776
1.748GlnGly: 1.748 ± 1.021
0.35GlnHis: 0.35 ± 0.206
4.545GlnIle: 4.545 ± 0.695
1.748GlnLys: 1.748 ± 0.292
3.497GlnLeu: 3.497 ± 1.117
0.699GlnMet: 0.699 ± 0.296
2.797GlnAsn: 2.797 ± 0.235
1.049GlnPro: 1.049 ± 0.618
1.748GlnGln: 1.748 ± 0.38
1.399GlnArg: 1.399 ± 0.824
3.846GlnSer: 3.846 ± 0.947
2.098GlnThr: 2.098 ± 1.236
2.098GlnVal: 2.098 ± 0.415
0.0GlnTrp: 0.0 ± 0.0
3.147GlnTyr: 3.147 ± 1.084
0.0GlnXaa: 0.0 ± 0.0
Arg
2.098ArgAla: 2.098 ± 1.008
0.35ArgCys: 0.35 ± 0.206
1.049ArgAsp: 1.049 ± 0.411
2.448ArgGlu: 2.448 ± 0.869
2.098ArgPhe: 2.098 ± 1.286
3.147ArgGly: 3.147 ± 0.624
1.399ArgHis: 1.399 ± 0.824
1.049ArgIle: 1.049 ± 0.618
2.448ArgLys: 2.448 ± 0.642
3.846ArgLeu: 3.846 ± 1.07
1.399ArgMet: 1.399 ± 0.824
1.748ArgAsn: 1.748 ± 0.292
1.748ArgPro: 1.748 ± 0.929
1.399ArgGln: 1.399 ± 0.824
1.748ArgArg: 1.748 ± 0.663
1.748ArgSer: 1.748 ± 0.611
4.196ArgThr: 4.196 ± 0.83
3.147ArgVal: 3.147 ± 1.448
0.0ArgTrp: 0.0 ± 0.0
0.699ArgTyr: 0.699 ± 0.296
0.0ArgXaa: 0.0 ± 0.0
Ser
3.497SerAla: 3.497 ± 1.788
2.448SerCys: 2.448 ± 2.12
4.196SerAsp: 4.196 ± 1.594
4.895SerGlu: 4.895 ± 1.227
3.147SerPhe: 3.147 ± 1.29
3.147SerGly: 3.147 ± 3.043
1.399SerHis: 1.399 ± 0.593
4.895SerIle: 4.895 ± 0.605
7.692SerLys: 7.692 ± 1.098
5.594SerLeu: 5.594 ± 0.925
2.448SerMet: 2.448 ± 0.642
5.245SerAsn: 5.245 ± 0.489
4.196SerPro: 4.196 ± 1.664
2.098SerGln: 2.098 ± 0.801
3.497SerArg: 3.497 ± 0.646
4.895SerSer: 4.895 ± 0.871
4.895SerThr: 4.895 ± 1.753
3.846SerVal: 3.846 ± 0.238
1.748SerTrp: 1.748 ± 0.611
1.748SerTyr: 1.748 ± 1.03
0.0SerXaa: 0.0 ± 0.0
Thr
3.497ThrAla: 3.497 ± 1.596
1.399ThrCys: 1.399 ± 1.179
4.196ThrAsp: 4.196 ± 1.388
3.497ThrGlu: 3.497 ± 0.969
3.497ThrPhe: 3.497 ± 1.313
2.797ThrGly: 2.797 ± 1.29
1.399ThrHis: 1.399 ± 0.824
5.594ThrIle: 5.594 ± 1.458
4.196ThrLys: 4.196 ± 1.518
6.993ThrLeu: 6.993 ± 1.024
2.448ThrMet: 2.448 ± 1.478
2.098ThrAsn: 2.098 ± 0.415
1.748ThrPro: 1.748 ± 1.192
2.448ThrGln: 2.448 ± 1.114
3.497ThrArg: 3.497 ± 1.568
5.245ThrSer: 5.245 ± 2.396
3.147ThrThr: 3.147 ± 1.186
3.497ThrVal: 3.497 ± 1.788
0.35ThrTrp: 0.35 ± 0.374
3.497ThrTyr: 3.497 ± 1.65
0.0ThrXaa: 0.0 ± 0.0
Val
2.098ValAla: 2.098 ± 0.636
0.699ValCys: 0.699 ± 0.508
2.797ValAsp: 2.797 ± 0.235
3.147ValGlu: 3.147 ± 0.666
1.399ValPhe: 1.399 ± 0.966
2.797ValGly: 2.797 ± 1.836
2.797ValHis: 2.797 ± 1.247
3.497ValIle: 3.497 ± 1.675
4.196ValLys: 4.196 ± 0.936
4.196ValLeu: 4.196 ± 1.219
1.748ValMet: 1.748 ± 0.806
3.846ValAsn: 3.846 ± 0.947
1.399ValPro: 1.399 ± 0.357
1.049ValGln: 1.049 ± 0.442
3.846ValArg: 3.846 ± 0.932
2.797ValSer: 2.797 ± 0.929
4.545ValThr: 4.545 ± 1.227
4.895ValVal: 4.895 ± 1.932
0.35ValTrp: 0.35 ± 0.206
3.147ValTyr: 3.147 ± 0.669
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.35TrpAsp: 0.35 ± 0.206
0.699TrpGlu: 0.699 ± 0.412
0.35TrpPhe: 0.35 ± 0.206
0.35TrpGly: 0.35 ± 0.374
0.35TrpHis: 0.35 ± 0.374
0.0TrpIle: 0.0 ± 0.0
0.699TrpLys: 0.699 ± 0.296
0.35TrpLeu: 0.35 ± 0.206
0.0TrpMet: 0.0 ± 0.0
1.049TrpAsn: 1.049 ± 1.123
0.35TrpPro: 0.35 ± 0.206
0.35TrpGln: 0.35 ± 0.206
0.0TrpArg: 0.0 ± 0.0
1.049TrpSer: 1.049 ± 0.643
1.399TrpThr: 1.399 ± 1.205
0.699TrpVal: 0.699 ± 0.296
0.35TrpTrp: 0.35 ± 0.206
0.35TrpTyr: 0.35 ± 0.206
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.497TyrAla: 3.497 ± 1.289
0.699TyrCys: 0.699 ± 0.749
4.196TyrAsp: 4.196 ± 0.426
1.748TyrGlu: 1.748 ± 0.542
1.049TyrPhe: 1.049 ± 0.618
2.448TyrGly: 2.448 ± 0.42
0.699TyrHis: 0.699 ± 0.412
3.846TyrIle: 3.846 ± 1.513
3.846TyrLys: 3.846 ± 1.513
2.448TyrLeu: 2.448 ± 0.543
1.049TyrMet: 1.049 ± 0.643
4.545TyrAsn: 4.545 ± 1.018
2.448TyrPro: 2.448 ± 0.82
1.399TyrGln: 1.399 ± 0.487
2.098TyrArg: 2.098 ± 0.693
3.147TyrSer: 3.147 ± 0.971
2.797TyrThr: 2.797 ± 0.974
2.797TyrVal: 2.797 ± 1.247
0.35TyrTrp: 0.35 ± 0.374
3.497TyrTyr: 3.497 ± 1.326
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (2861 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski