Amino acid dipepetide frequency for Pigeon circovirus (PiCV) (Columbid circovirus)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.186AlaAla: 11.186 ± 5.143
0.0AlaCys: 0.0 ± 0.0
4.474AlaAsp: 4.474 ± 2.245
2.237AlaGlu: 2.237 ± 1.234
3.356AlaPhe: 3.356 ± 1.128
2.237AlaGly: 2.237 ± 1.331
1.119AlaHis: 1.119 ± 0.857
1.119AlaIle: 1.119 ± 0.857
2.237AlaLys: 2.237 ± 1.713
6.711AlaLeu: 6.711 ± 2.358
1.119AlaMet: 1.119 ± 0.857
1.119AlaAsn: 1.119 ± 0.908
8.949AlaPro: 8.949 ± 1.952
2.237AlaGln: 2.237 ± 2.157
8.949AlaArg: 8.949 ± 1.879
2.237AlaSer: 2.237 ± 2.769
7.83AlaThr: 7.83 ± 2.301
4.474AlaVal: 4.474 ± 1.25
4.474AlaTrp: 4.474 ± 1.489
0.0AlaTyr: 0.0 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
2.237CysCys: 2.237 ± 1.331
0.0CysAsp: 0.0 ± 0.0
2.237CysGlu: 2.237 ± 0.721
2.237CysPhe: 2.237 ± 0.721
3.356CysGly: 3.356 ± 1.743
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
1.119CysLys: 1.119 ± 0.857
0.0CysLeu: 0.0 ± 0.0
1.119CysMet: 1.119 ± 0.857
2.237CysAsn: 2.237 ± 1.817
1.119CysPro: 1.119 ± 1.079
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
3.356CysSer: 3.356 ± 1.5
0.0CysThr: 0.0 ± 0.0
1.119CysVal: 1.119 ± 0.857
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.474AspAla: 4.474 ± 1.442
2.237AspCys: 2.237 ± 0.721
6.711AspAsp: 6.711 ± 3.937
1.119AspGlu: 1.119 ± 0.857
7.83AspPhe: 7.83 ± 2.308
2.237AspGly: 2.237 ± 0.721
1.119AspHis: 1.119 ± 0.857
1.119AspIle: 1.119 ± 0.908
2.237AspLys: 2.237 ± 0.721
7.83AspLeu: 7.83 ± 1.434
0.0AspMet: 0.0 ± 0.0
1.119AspAsn: 1.119 ± 0.857
3.356AspPro: 3.356 ± 2.725
0.0AspGln: 0.0 ± 0.0
1.119AspArg: 1.119 ± 0.857
2.237AspSer: 2.237 ± 1.817
0.0AspThr: 0.0 ± 0.0
3.356AspVal: 3.356 ± 1.753
0.0AspTrp: 0.0 ± 0.0
3.356AspTyr: 3.356 ± 0.88
0.0AspXaa: 0.0 ± 0.0
Glu
4.474GluAla: 4.474 ± 2.479
1.119GluCys: 1.119 ± 0.908
3.356GluAsp: 3.356 ± 1.297
3.356GluGlu: 3.356 ± 2.57
2.237GluPhe: 2.237 ± 1.234
4.474GluGly: 4.474 ± 1.25
0.0GluHis: 0.0 ± 0.0
3.356GluIle: 3.356 ± 1.83
2.237GluLys: 2.237 ± 1.713
2.237GluLeu: 2.237 ± 0.721
2.237GluMet: 2.237 ± 1.709
0.0GluAsn: 0.0 ± 0.0
0.0GluPro: 0.0 ± 0.0
1.119GluGln: 1.119 ± 0.857
1.119GluArg: 1.119 ± 0.857
0.0GluSer: 0.0 ± 0.0
1.119GluThr: 1.119 ± 0.857
10.067GluVal: 10.067 ± 3.218
1.119GluTrp: 1.119 ± 0.857
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
4.474PheAla: 4.474 ± 3.051
1.119PheCys: 1.119 ± 0.857
2.237PheAsp: 2.237 ± 1.206
3.356PheGlu: 3.356 ± 0.88
2.237PhePhe: 2.237 ± 1.951
3.356PheGly: 3.356 ± 2.125
2.237PheHis: 2.237 ± 1.234
1.119PheIle: 1.119 ± 1.384
5.593PheLys: 5.593 ± 2.031
1.119PheLeu: 1.119 ± 1.079
0.0PheMet: 0.0 ± 0.0
0.0PheAsn: 0.0 ± 0.0
1.119PhePro: 1.119 ± 0.908
3.356PheGln: 3.356 ± 2.007
5.593PheArg: 5.593 ± 2.056
4.474PheSer: 4.474 ± 1.556
6.711PheThr: 6.711 ± 1.975
2.237PheVal: 2.237 ± 1.713
0.0PheTrp: 0.0 ± 0.0
3.356PheTyr: 3.356 ± 1.297
0.0PheXaa: 0.0 ± 0.0
Gly
5.593GlyAla: 5.593 ± 1.369
1.119GlyCys: 1.119 ± 0.857
2.237GlyAsp: 2.237 ± 1.817
2.237GlyGlu: 2.237 ± 1.713
4.474GlyPhe: 4.474 ± 1.295
6.711GlyGly: 6.711 ± 5.284
5.593GlyHis: 5.593 ± 2.531
3.356GlyIle: 3.356 ± 1.128
3.356GlyLys: 3.356 ± 1.83
5.593GlyLeu: 5.593 ± 1.491
2.237GlyMet: 2.237 ± 1.271
4.474GlyAsn: 4.474 ± 1.25
2.237GlyPro: 2.237 ± 1.331
3.356GlyGln: 3.356 ± 0.88
6.711GlyArg: 6.711 ± 1.661
6.711GlySer: 6.711 ± 3.019
3.356GlyThr: 3.356 ± 1.167
4.474GlyVal: 4.474 ± 2.722
1.119GlyTrp: 1.119 ± 0.857
3.356GlyTyr: 3.356 ± 2.624
0.0GlyXaa: 0.0 ± 0.0
His
1.119HisAla: 1.119 ± 1.337
1.119HisCys: 1.119 ± 1.079
1.119HisAsp: 1.119 ± 0.857
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
2.237HisIle: 2.237 ± 1.525
2.237HisLys: 2.237 ± 1.331
5.593HisLeu: 5.593 ± 2.835
0.0HisMet: 0.0 ± 0.0
1.119HisAsn: 1.119 ± 1.337
0.0HisPro: 0.0 ± 0.0
2.237HisGln: 2.237 ± 1.331
2.237HisArg: 2.237 ± 1.206
0.0HisSer: 0.0 ± 0.0
2.237HisThr: 2.237 ± 1.206
0.0HisVal: 0.0 ± 0.0
1.119HisTrp: 1.119 ± 0.857
2.237HisTyr: 2.237 ± 0.721
0.0HisXaa: 0.0 ± 0.0
Ile
4.474IleAla: 4.474 ± 2.073
1.119IleCys: 1.119 ± 0.908
1.119IleAsp: 1.119 ± 0.857
1.119IleGlu: 1.119 ± 0.908
2.237IlePhe: 2.237 ± 1.206
1.119IleGly: 1.119 ± 0.857
0.0IleHis: 0.0 ± 0.0
3.356IleIle: 3.356 ± 1.827
1.119IleLys: 1.119 ± 0.857
0.0IleLeu: 0.0 ± 0.0
0.0IleMet: 0.0 ± 0.0
3.356IleAsn: 3.356 ± 1.83
2.237IlePro: 2.237 ± 0.721
1.119IleGln: 1.119 ± 1.384
5.593IleArg: 5.593 ± 3.721
0.0IleSer: 0.0 ± 0.0
4.474IleThr: 4.474 ± 1.575
4.474IleVal: 4.474 ± 2.479
0.0IleTrp: 0.0 ± 0.0
2.237IleTyr: 2.237 ± 0.721
0.0IleXaa: 0.0 ± 0.0
Lys
3.356LysAla: 3.356 ± 1.827
0.0LysCys: 0.0 ± 0.0
1.119LysAsp: 1.119 ± 0.908
2.237LysGlu: 2.237 ± 1.713
4.474LysPhe: 4.474 ± 1.442
7.83LysGly: 7.83 ± 2.232
0.0LysHis: 0.0 ± 0.0
1.119LysIle: 1.119 ± 0.908
2.237LysLys: 2.237 ± 1.713
2.237LysLeu: 2.237 ± 0.721
2.237LysMet: 2.237 ± 1.124
0.0LysAsn: 0.0 ± 0.0
1.119LysPro: 1.119 ± 0.857
2.237LysGln: 2.237 ± 1.713
4.474LysArg: 4.474 ± 2.076
3.356LysSer: 3.356 ± 1.827
3.356LysThr: 3.356 ± 1.128
4.474LysVal: 4.474 ± 2.245
2.237LysTrp: 2.237 ± 0.721
3.356LysTyr: 3.356 ± 2.57
0.0LysXaa: 0.0 ± 0.0
Leu
5.593LeuAla: 5.593 ± 2.757
2.237LeuCys: 2.237 ± 1.951
1.119LeuAsp: 1.119 ± 0.908
1.119LeuGlu: 1.119 ± 0.857
4.474LeuPhe: 4.474 ± 1.298
5.593LeuGly: 5.593 ± 1.353
1.119LeuHis: 1.119 ± 1.079
3.356LeuIle: 3.356 ± 1.167
5.593LeuLys: 5.593 ± 1.916
10.067LeuLeu: 10.067 ± 4.451
2.237LeuMet: 2.237 ± 1.817
4.474LeuAsn: 4.474 ± 1.556
5.593LeuPro: 5.593 ± 2.623
5.593LeuGln: 5.593 ± 1.353
5.593LeuArg: 5.593 ± 1.194
4.474LeuSer: 4.474 ± 2.457
7.83LeuThr: 7.83 ± 3.095
6.711LeuVal: 6.711 ± 1.975
0.0LeuTrp: 0.0 ± 0.0
1.119LeuTyr: 1.119 ± 1.384
0.0LeuXaa: 0.0 ± 0.0
Met
1.119MetAla: 1.119 ± 0.857
0.0MetCys: 0.0 ± 0.0
1.119MetAsp: 1.119 ± 0.908
1.119MetGlu: 1.119 ± 0.908
0.0MetPhe: 0.0 ± 0.0
1.119MetGly: 1.119 ± 1.384
2.237MetHis: 2.237 ± 1.525
0.0MetIle: 0.0 ± 0.0
2.237MetLys: 2.237 ± 1.713
0.0MetLeu: 0.0 ± 0.0
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
0.0MetPro: 0.0 ± 0.0
0.0MetGln: 0.0 ± 0.0
4.474MetArg: 4.474 ± 2.073
0.0MetSer: 0.0 ± 0.0
1.119MetThr: 1.119 ± 0.857
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
2.237MetTyr: 2.237 ± 1.817
0.0MetXaa: 0.0 ± 0.0
Asn
2.237AsnAla: 2.237 ± 1.206
0.0AsnCys: 0.0 ± 0.0
2.237AsnAsp: 2.237 ± 1.206
1.119AsnGlu: 1.119 ± 0.857
0.0AsnPhe: 0.0 ± 0.0
2.237AsnGly: 2.237 ± 1.525
2.237AsnHis: 2.237 ± 0.721
2.237AsnIle: 2.237 ± 1.817
1.119AsnLys: 1.119 ± 0.857
2.237AsnLeu: 2.237 ± 1.951
0.0AsnMet: 0.0 ± 0.0
1.119AsnAsn: 1.119 ± 0.857
2.237AsnPro: 2.237 ± 1.713
4.474AsnGln: 4.474 ± 2.643
1.119AsnArg: 1.119 ± 0.857
4.474AsnSer: 4.474 ± 1.28
0.0AsnThr: 0.0 ± 0.0
2.237AsnVal: 2.237 ± 0.721
0.0AsnTrp: 0.0 ± 0.0
1.119AsnTyr: 1.119 ± 0.857
0.0AsnXaa: 0.0 ± 0.0
Pro
3.356ProAla: 3.356 ± 2.154
1.119ProCys: 1.119 ± 0.857
5.593ProAsp: 5.593 ± 1.34
0.0ProGlu: 0.0 ± 0.0
4.474ProPhe: 4.474 ± 1.28
3.356ProGly: 3.356 ± 2.57
3.356ProHis: 3.356 ± 1.796
3.356ProIle: 3.356 ± 0.88
1.119ProLys: 1.119 ± 0.857
4.474ProLeu: 4.474 ± 2.89
3.356ProMet: 3.356 ± 2.031
1.119ProAsn: 1.119 ± 0.908
6.711ProPro: 6.711 ± 2.746
4.474ProGln: 4.474 ± 2.245
4.474ProArg: 4.474 ± 1.298
2.237ProSer: 2.237 ± 2.674
4.474ProThr: 4.474 ± 1.295
4.474ProVal: 4.474 ± 3.176
0.0ProTrp: 0.0 ± 0.0
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
5.593GlnAla: 5.593 ± 1.478
1.119GlnCys: 1.119 ± 0.908
0.0GlnAsp: 0.0 ± 0.0
1.119GlnGlu: 1.119 ± 0.857
1.119GlnPhe: 1.119 ± 0.908
4.474GlnGly: 4.474 ± 1.25
1.119GlnHis: 1.119 ± 1.384
1.119GlnIle: 1.119 ± 0.908
3.356GlnLys: 3.356 ± 1.167
5.593GlnLeu: 5.593 ± 1.942
1.119GlnMet: 1.119 ± 1.337
0.0GlnAsn: 0.0 ± 0.0
6.711GlnPro: 6.711 ± 1.575
3.356GlnGln: 3.356 ± 2.154
0.0GlnArg: 0.0 ± 0.0
3.356GlnSer: 3.356 ± 2.007
1.119GlnThr: 1.119 ± 1.384
3.356GlnVal: 3.356 ± 1.842
1.119GlnTrp: 1.119 ± 0.908
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
4.474ArgAla: 4.474 ± 1.25
2.237ArgCys: 2.237 ± 1.331
6.711ArgAsp: 6.711 ± 2.951
2.237ArgGlu: 2.237 ± 1.713
4.474ArgPhe: 4.474 ± 2.722
6.711ArgGly: 6.711 ± 2.126
0.0ArgHis: 0.0 ± 0.0
3.356ArgIle: 3.356 ± 1.399
4.474ArgLys: 4.474 ± 2.896
5.593ArgLeu: 5.593 ± 3.127
0.0ArgMet: 0.0 ± 0.0
3.356ArgAsn: 3.356 ± 1.842
4.474ArgPro: 4.474 ± 2.245
1.119ArgGln: 1.119 ± 0.908
21.253ArgArg: 21.253 ± 10.604
5.593ArgSer: 5.593 ± 2.645
1.119ArgThr: 1.119 ± 0.908
4.474ArgVal: 4.474 ± 2.578
2.237ArgTrp: 2.237 ± 1.713
1.119ArgTyr: 1.119 ± 0.857
0.0ArgXaa: 0.0 ± 0.0
Ser
4.474SerAla: 4.474 ± 1.575
1.119SerCys: 1.119 ± 1.079
3.356SerAsp: 3.356 ± 2.57
5.593SerGlu: 5.593 ± 1.888
0.0SerPhe: 0.0 ± 0.0
7.83SerGly: 7.83 ± 3.619
2.237SerHis: 2.237 ± 1.951
0.0SerIle: 0.0 ± 0.0
2.237SerLys: 2.237 ± 1.713
7.83SerLeu: 7.83 ± 2.878
0.0SerMet: 0.0 ± 0.0
3.356SerAsn: 3.356 ± 1.827
1.119SerPro: 1.119 ± 0.857
2.237SerGln: 2.237 ± 2.769
6.711SerArg: 6.711 ± 0.911
6.711SerSer: 6.711 ± 5.562
2.237SerThr: 2.237 ± 1.951
2.237SerVal: 2.237 ± 1.951
1.119SerTrp: 1.119 ± 1.384
1.119SerTyr: 1.119 ± 0.908
0.0SerXaa: 0.0 ± 0.0
Thr
2.237ThrAla: 2.237 ± 1.206
0.0ThrCys: 0.0 ± 0.0
1.119ThrAsp: 1.119 ± 0.857
5.593ThrGlu: 5.593 ± 3.002
3.356ThrPhe: 3.356 ± 2.725
7.83ThrGly: 7.83 ± 2.878
1.119ThrHis: 1.119 ± 1.337
2.237ThrIle: 2.237 ± 1.557
1.119ThrLys: 1.119 ± 1.384
8.949ThrLeu: 8.949 ± 1.278
0.0ThrMet: 0.0 ± 0.0
2.237ThrAsn: 2.237 ± 1.206
3.356ThrPro: 3.356 ± 0.88
3.356ThrGln: 3.356 ± 1.753
4.474ThrArg: 4.474 ± 2.457
4.474ThrSer: 4.474 ± 1.943
2.237ThrThr: 2.237 ± 1.817
2.237ThrVal: 2.237 ± 1.817
3.356ThrTrp: 3.356 ± 1.399
0.0ThrTyr: 0.0 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
5.593ValAla: 5.593 ± 2.623
0.0ValCys: 0.0 ± 0.0
3.356ValAsp: 3.356 ± 2.725
5.593ValGlu: 5.593 ± 1.867
0.0ValPhe: 0.0 ± 0.0
4.474ValGly: 4.474 ± 1.25
1.119ValHis: 1.119 ± 0.857
5.593ValIle: 5.593 ± 3.377
5.593ValLys: 5.593 ± 1.916
2.237ValLeu: 2.237 ± 0.721
0.0ValMet: 0.0 ± 0.0
2.237ValAsn: 2.237 ± 1.557
7.83ValPro: 7.83 ± 3.095
2.237ValGln: 2.237 ± 1.331
0.0ValArg: 0.0 ± 0.0
6.711ValSer: 6.711 ± 2.882
7.83ValThr: 7.83 ± 2.347
5.593ValVal: 5.593 ± 3.191
1.119ValTrp: 1.119 ± 0.857
1.119ValTyr: 1.119 ± 1.079
0.0ValXaa: 0.0 ± 0.0
Trp
1.119TrpAla: 1.119 ± 0.857
1.119TrpCys: 1.119 ± 0.857
2.237TrpAsp: 2.237 ± 0.721
1.119TrpGlu: 1.119 ± 0.857
3.356TrpPhe: 3.356 ± 2.007
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
1.119TrpIle: 1.119 ± 0.908
1.119TrpLys: 1.119 ± 0.908
3.356TrpLeu: 3.356 ± 2.57
0.0TrpMet: 0.0 ± 0.0
1.119TrpAsn: 1.119 ± 0.857
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
0.0TrpSer: 0.0 ± 0.0
2.237TrpThr: 2.237 ± 1.557
0.0TrpVal: 0.0 ± 0.0
1.119TrpTrp: 1.119 ± 0.857
1.119TrpTyr: 1.119 ± 0.857
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.119TyrAla: 1.119 ± 0.857
1.119TyrCys: 1.119 ± 0.857
2.237TyrAsp: 2.237 ± 0.721
1.119TyrGlu: 1.119 ± 0.908
3.356TyrPhe: 3.356 ± 1.753
3.356TyrGly: 3.356 ± 1.297
0.0TyrHis: 0.0 ± 0.0
0.0TyrIle: 0.0 ± 0.0
1.119TyrLys: 1.119 ± 0.857
2.237TyrLeu: 2.237 ± 1.206
0.0TyrMet: 0.0 ± 0.797
0.0TyrAsn: 0.0 ± 0.0
3.356TyrPro: 3.356 ± 2.57
2.237TyrGln: 2.237 ± 1.557
1.119TyrArg: 1.119 ± 1.384
1.119TyrSer: 1.119 ± 0.857
0.0TyrThr: 0.0 ± 0.0
2.237TyrVal: 2.237 ± 0.721
0.0TyrTrp: 0.0 ± 0.0
2.237TyrTyr: 2.237 ± 1.817
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (895 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski