Amino acid dipepetide frequency for Seal anellovirus 6

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.61AlaAla: 5.61 ± 1.374
1.403AlaCys: 1.403 ± 0.676
5.61AlaAsp: 5.61 ± 3.656
1.403AlaGlu: 1.403 ± 0.676
1.403AlaPhe: 1.403 ± 0.676
7.013AlaGly: 7.013 ± 3.386
1.403AlaHis: 1.403 ± 0.676
4.208AlaIle: 4.208 ± 2.485
0.0AlaLys: 0.0 ± 0.0
1.403AlaLeu: 1.403 ± 2.231
0.0AlaMet: 0.0 ± 0.0
4.208AlaAsn: 4.208 ± 1.618
5.61AlaPro: 5.61 ± 2.172
5.61AlaGln: 5.61 ± 1.416
1.403AlaArg: 1.403 ± 0.676
5.61AlaSer: 5.61 ± 1.416
1.403AlaThr: 1.403 ± 0.676
2.805AlaVal: 2.805 ± 1.351
0.0AlaTrp: 0.0 ± 0.0
1.403AlaTyr: 1.403 ± 0.676
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
1.403CysCys: 1.403 ± 0.676
0.0CysAsp: 0.0 ± 0.0
1.403CysGlu: 1.403 ± 0.676
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
2.805CysHis: 2.805 ± 4.462
1.403CysIle: 1.403 ± 0.676
0.0CysLys: 0.0 ± 0.0
1.403CysLeu: 1.403 ± 0.676
0.0CysMet: 0.0 ± 0.0
1.403CysAsn: 1.403 ± 0.676
0.0CysPro: 0.0 ± 0.0
1.403CysGln: 1.403 ± 0.676
0.0CysArg: 0.0 ± 0.0
1.403CysSer: 1.403 ± 2.231
1.403CysThr: 1.403 ± 0.676
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
2.805CysTyr: 2.805 ± 1.351
0.0CysXaa: 0.0 ± 0.0
Asp
0.0AspAla: 0.0 ± 0.0
0.0AspCys: 0.0 ± 0.0
0.0AspAsp: 0.0 ± 0.0
5.61AspGlu: 5.61 ± 1.374
2.805AspPhe: 2.805 ± 1.086
4.208AspGly: 4.208 ± 1.618
1.403AspHis: 1.403 ± 2.231
2.805AspIle: 2.805 ± 1.828
1.403AspLys: 1.403 ± 0.676
9.818AspLeu: 9.818 ± 3.371
0.0AspMet: 0.0 ± 0.0
1.403AspAsn: 1.403 ± 2.231
7.013AspPro: 7.013 ± 1.978
1.403AspGln: 1.403 ± 0.676
0.0AspArg: 0.0 ± 0.0
5.61AspSer: 5.61 ± 1.416
4.208AspThr: 4.208 ± 2.027
4.208AspVal: 4.208 ± 2.485
0.0AspTrp: 0.0 ± 0.0
4.208AspTyr: 4.208 ± 1.066
0.0AspXaa: 0.0 ± 0.0
Glu
4.208GluAla: 4.208 ± 1.618
0.0GluCys: 0.0 ± 0.0
4.208GluAsp: 4.208 ± 2.485
8.415GluGlu: 8.415 ± 5.629
1.403GluPhe: 1.403 ± 2.231
7.013GluGly: 7.013 ± 0.815
0.0GluHis: 0.0 ± 0.0
0.0GluIle: 0.0 ± 0.0
0.0GluLys: 0.0 ± 0.0
4.208GluLeu: 4.208 ± 3.665
1.403GluMet: 1.403 ± 0.726
2.805GluAsn: 2.805 ± 1.086
1.403GluPro: 1.403 ± 0.676
1.403GluGln: 1.403 ± 0.676
0.0GluArg: 0.0 ± 0.0
8.415GluSer: 8.415 ± 2.131
1.403GluThr: 1.403 ± 1.461
1.403GluVal: 1.403 ± 0.676
0.0GluTrp: 0.0 ± 0.0
2.805GluTyr: 2.805 ± 1.086
0.0GluXaa: 0.0 ± 0.0
Phe
1.403PheAla: 1.403 ± 1.461
2.805PheCys: 2.805 ± 1.828
1.403PheAsp: 1.403 ± 0.676
0.0PheGlu: 0.0 ± 0.0
2.805PhePhe: 2.805 ± 1.351
0.0PheGly: 0.0 ± 0.0
2.805PheHis: 2.805 ± 1.351
0.0PheIle: 0.0 ± 0.0
5.61PheLys: 5.61 ± 1.676
5.61PheLeu: 5.61 ± 2.996
1.403PheMet: 1.403 ± 0.676
0.0PheAsn: 0.0 ± 0.0
0.0PhePro: 0.0 ± 0.0
0.0PheGln: 0.0 ± 0.0
2.805PheArg: 2.805 ± 1.351
2.805PheSer: 2.805 ± 1.086
0.0PheThr: 0.0 ± 0.0
0.0PheVal: 0.0 ± 0.0
2.805PheTrp: 2.805 ± 1.351
2.805PheTyr: 2.805 ± 1.351
0.0PheXaa: 0.0 ± 0.0
Gly
7.013GlyAla: 7.013 ± 1.978
1.403GlyCys: 1.403 ± 0.676
7.013GlyAsp: 7.013 ± 3.386
2.805GlyGlu: 2.805 ± 2.659
1.403GlyPhe: 1.403 ± 1.461
11.22GlyGly: 11.22 ± 10.219
2.805GlyHis: 2.805 ± 1.828
2.805GlyIle: 2.805 ± 1.351
0.0GlyLys: 0.0 ± 0.0
4.208GlyLeu: 4.208 ± 2.027
0.0GlyMet: 0.0 ± 0.0
2.805GlyAsn: 2.805 ± 1.351
4.208GlyPro: 4.208 ± 2.485
2.805GlyGln: 2.805 ± 1.351
2.805GlyArg: 2.805 ± 1.828
5.61GlySer: 5.61 ± 4.101
9.818GlyThr: 9.818 ± 6.058
4.208GlyVal: 4.208 ± 2.027
2.805GlyTrp: 2.805 ± 1.351
4.208GlyTyr: 4.208 ± 1.066
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
1.403HisCys: 1.403 ± 2.231
1.403HisAsp: 1.403 ± 0.676
1.403HisGlu: 1.403 ± 0.676
2.805HisPhe: 2.805 ± 2.659
1.403HisGly: 1.403 ± 0.676
2.805HisHis: 2.805 ± 1.351
1.403HisIle: 1.403 ± 0.676
2.805HisLys: 2.805 ± 1.828
0.0HisLeu: 0.0 ± 0.0
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
1.403HisPro: 1.403 ± 2.231
1.403HisGln: 1.403 ± 1.461
4.208HisArg: 4.208 ± 2.027
2.805HisSer: 2.805 ± 2.659
4.208HisThr: 4.208 ± 1.618
0.0HisVal: 0.0 ± 0.0
2.805HisTrp: 2.805 ± 1.351
2.805HisTyr: 2.805 ± 1.351
0.0HisXaa: 0.0 ± 0.0
Ile
0.0IleAla: 0.0 ± 0.0
0.0IleCys: 0.0 ± 0.0
0.0IleAsp: 0.0 ± 0.0
1.403IleGlu: 1.403 ± 1.461
0.0IlePhe: 0.0 ± 0.0
1.403IleGly: 1.403 ± 2.231
2.805IleHis: 2.805 ± 1.351
0.0IleIle: 0.0 ± 0.0
1.403IleLys: 1.403 ± 0.676
4.208IleLeu: 4.208 ± 2.027
0.0IleMet: 0.0 ± 0.0
0.0IleAsn: 0.0 ± 0.0
4.208IlePro: 4.208 ± 1.618
4.208IleGln: 4.208 ± 1.066
1.403IleArg: 1.403 ± 0.676
2.805IleSer: 2.805 ± 1.351
1.403IleThr: 1.403 ± 0.676
1.403IleVal: 1.403 ± 1.461
2.805IleTrp: 2.805 ± 1.351
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
4.208LysAla: 4.208 ± 1.066
0.0LysCys: 0.0 ± 0.0
0.0LysAsp: 0.0 ± 0.0
2.805LysGlu: 2.805 ± 1.828
1.403LysPhe: 1.403 ± 0.676
7.013LysGly: 7.013 ± 1.946
1.403LysHis: 1.403 ± 1.461
0.0LysIle: 0.0 ± 0.0
4.208LysLys: 4.208 ± 2.027
7.013LysLeu: 7.013 ± 3.549
0.0LysMet: 0.0 ± 0.0
2.805LysAsn: 2.805 ± 1.351
0.0LysPro: 0.0 ± 0.0
0.0LysGln: 0.0 ± 0.0
7.013LysArg: 7.013 ± 0.815
2.805LysSer: 2.805 ± 1.086
4.208LysThr: 4.208 ± 1.066
1.403LysVal: 1.403 ± 0.676
1.403LysTrp: 1.403 ± 2.231
4.208LysTyr: 4.208 ± 1.066
0.0LysXaa: 0.0 ± 0.0
Leu
4.208LeuAla: 4.208 ± 1.066
4.208LeuCys: 4.208 ± 1.618
7.013LeuAsp: 7.013 ± 2.329
5.61LeuGlu: 5.61 ± 2.172
2.805LeuPhe: 2.805 ± 1.086
5.61LeuGly: 5.61 ± 2.172
1.403LeuHis: 1.403 ± 2.231
1.403LeuIle: 1.403 ± 1.461
7.013LeuLys: 7.013 ± 3.546
7.013LeuLeu: 7.013 ± 0.815
0.0LeuMet: 0.0 ± 0.0
0.0LeuAsn: 0.0 ± 0.0
5.61LeuPro: 5.61 ± 2.702
7.013LeuGln: 7.013 ± 3.549
2.805LeuArg: 2.805 ± 1.086
8.415LeuSer: 8.415 ± 1.67
4.208LeuThr: 4.208 ± 1.066
4.208LeuVal: 4.208 ± 1.066
0.0LeuTrp: 0.0 ± 0.0
2.805LeuTyr: 2.805 ± 1.828
0.0LeuXaa: 0.0 ± 0.0
Met
1.403MetAla: 1.403 ± 0.676
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
0.0MetLeu: 0.0 ± 0.0
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
0.0MetPro: 0.0 ± 0.0
0.0MetGln: 0.0 ± 0.0
0.0MetArg: 0.0 ± 0.0
2.805MetSer: 2.805 ± 1.828
1.403MetThr: 1.403 ± 1.461
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
0.0AsnAla: 0.0 ± 0.0
1.403AsnCys: 1.403 ± 0.676
1.403AsnAsp: 1.403 ± 0.676
1.403AsnGlu: 1.403 ± 0.676
1.403AsnPhe: 1.403 ± 2.231
0.0AsnGly: 0.0 ± 0.0
1.403AsnHis: 1.403 ± 2.231
1.403AsnIle: 1.403 ± 0.676
1.403AsnLys: 1.403 ± 0.676
2.805AsnLeu: 2.805 ± 1.086
0.0AsnMet: 0.0 ± 0.0
2.805AsnAsn: 2.805 ± 1.351
4.208AsnPro: 4.208 ± 1.066
1.403AsnGln: 1.403 ± 0.676
2.805AsnArg: 2.805 ± 1.351
1.403AsnSer: 1.403 ± 0.676
1.403AsnThr: 1.403 ± 0.676
0.0AsnVal: 0.0 ± 0.0
4.208AsnTrp: 4.208 ± 2.005
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
2.805ProAla: 2.805 ± 1.351
0.0ProCys: 0.0 ± 0.0
2.805ProAsp: 2.805 ± 1.828
2.805ProGlu: 2.805 ± 1.086
4.208ProPhe: 4.208 ± 2.027
9.818ProGly: 9.818 ± 3.031
1.403ProHis: 1.403 ± 1.461
0.0ProIle: 0.0 ± 0.0
5.61ProLys: 5.61 ± 1.374
5.61ProLeu: 5.61 ± 1.416
0.0ProMet: 0.0 ± 0.0
1.403ProAsn: 1.403 ± 1.461
5.61ProPro: 5.61 ± 3.929
0.0ProGln: 0.0 ± 0.0
7.013ProArg: 7.013 ± 3.378
2.805ProSer: 2.805 ± 1.086
7.013ProThr: 7.013 ± 3.549
0.0ProVal: 0.0 ± 0.0
2.805ProTrp: 2.805 ± 1.351
5.61ProTyr: 5.61 ± 1.416
0.0ProXaa: 0.0 ± 0.0
Gln
4.208GlnAla: 4.208 ± 1.066
0.0GlnCys: 0.0 ± 0.0
8.415GlnAsp: 8.415 ± 0.595
2.805GlnGlu: 2.805 ± 2.659
2.805GlnPhe: 2.805 ± 1.351
1.403GlnGly: 1.403 ± 0.676
0.0GlnHis: 0.0 ± 0.0
0.0GlnIle: 0.0 ± 0.0
2.805GlnLys: 2.805 ± 1.351
1.403GlnLeu: 1.403 ± 0.676
1.403GlnMet: 1.403 ± 1.889
1.403GlnAsn: 1.403 ± 0.676
2.805GlnPro: 2.805 ± 1.086
0.0GlnGln: 0.0 ± 0.0
1.403GlnArg: 1.403 ± 0.676
0.0GlnSer: 0.0 ± 0.0
5.61GlnThr: 5.61 ± 2.172
4.208GlnVal: 4.208 ± 1.618
0.0GlnTrp: 0.0 ± 0.0
1.403GlnTyr: 1.403 ± 0.676
0.0GlnXaa: 0.0 ± 0.0
Arg
4.208ArgAla: 4.208 ± 1.618
0.0ArgCys: 0.0 ± 0.0
0.0ArgAsp: 0.0 ± 0.0
0.0ArgGlu: 0.0 ± 0.0
1.403ArgPhe: 1.403 ± 0.676
2.805ArgGly: 2.805 ± 1.351
4.208ArgHis: 4.208 ± 2.027
4.208ArgIle: 4.208 ± 2.027
2.805ArgLys: 2.805 ± 1.086
8.415ArgLeu: 8.415 ± 2.546
0.0ArgMet: 0.0 ± 0.0
0.0ArgAsn: 0.0 ± 0.0
5.61ArgPro: 5.61 ± 2.702
4.208ArgGln: 4.208 ± 1.618
26.648ArgArg: 26.648 ± 11.161
4.208ArgSer: 4.208 ± 2.485
4.208ArgThr: 4.208 ± 1.066
4.208ArgVal: 4.208 ± 1.066
2.805ArgTrp: 2.805 ± 1.351
5.61ArgTyr: 5.61 ± 2.702
0.0ArgXaa: 0.0 ± 0.0
Ser
5.61SerAla: 5.61 ± 4.101
0.0SerCys: 0.0 ± 0.0
8.415SerAsp: 8.415 ± 3.258
2.805SerGlu: 2.805 ± 2.923
1.403SerPhe: 1.403 ± 0.676
8.415SerGly: 8.415 ± 0.595
4.208SerHis: 4.208 ± 1.066
1.403SerIle: 1.403 ± 0.676
1.403SerLys: 1.403 ± 1.461
5.61SerLeu: 5.61 ± 2.172
0.0SerMet: 0.0 ± 0.0
4.208SerAsn: 4.208 ± 2.005
2.805SerPro: 2.805 ± 1.828
2.805SerGln: 2.805 ± 1.086
5.61SerArg: 5.61 ± 2.172
7.013SerSer: 7.013 ± 3.378
12.623SerThr: 12.623 ± 3.534
2.805SerVal: 2.805 ± 1.351
1.403SerTrp: 1.403 ± 0.676
2.805SerTyr: 2.805 ± 1.086
0.0SerXaa: 0.0 ± 0.0
Thr
7.013ThrAla: 7.013 ± 0.815
1.403ThrCys: 1.403 ± 0.676
2.805ThrAsp: 2.805 ± 1.828
4.208ThrGlu: 4.208 ± 1.066
1.403ThrPhe: 1.403 ± 0.676
9.818ThrGly: 9.818 ± 7.658
0.0ThrHis: 0.0 ± 0.0
4.208ThrIle: 4.208 ± 1.618
5.61ThrLys: 5.61 ± 2.172
5.61ThrLeu: 5.61 ± 3.929
0.0ThrMet: 0.0 ± 0.0
2.805ThrAsn: 2.805 ± 1.351
7.013ThrPro: 7.013 ± 3.546
2.805ThrGln: 2.805 ± 2.659
1.403ThrArg: 1.403 ± 0.676
1.403ThrSer: 1.403 ± 1.461
2.805ThrThr: 2.805 ± 2.659
5.61ThrVal: 5.61 ± 2.702
4.208ThrTrp: 4.208 ± 2.027
2.805ThrTyr: 2.805 ± 1.086
0.0ThrXaa: 0.0 ± 0.0
Val
1.403ValAla: 1.403 ± 0.676
0.0ValCys: 0.0 ± 0.0
2.805ValAsp: 2.805 ± 1.351
0.0ValGlu: 0.0 ± 0.0
0.0ValPhe: 0.0 ± 0.0
0.0ValGly: 0.0 ± 0.0
2.805ValHis: 2.805 ± 1.351
0.0ValIle: 0.0 ± 0.0
1.403ValLys: 1.403 ± 0.676
1.403ValLeu: 1.403 ± 1.461
0.0ValMet: 0.0 ± 0.0
1.403ValAsn: 1.403 ± 0.676
2.805ValPro: 2.805 ± 1.086
2.805ValGln: 2.805 ± 1.351
4.208ValArg: 4.208 ± 2.027
11.22ValSer: 11.22 ± 2.46
4.208ValThr: 4.208 ± 2.027
0.0ValVal: 0.0 ± 0.0
1.403ValTrp: 1.403 ± 0.676
2.805ValTyr: 2.805 ± 1.351
0.0ValXaa: 0.0 ± 0.0
Trp
2.805TrpAla: 2.805 ± 1.351
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
2.805TrpGlu: 2.805 ± 1.351
2.805TrpPhe: 2.805 ± 1.351
0.0TrpGly: 0.0 ± 0.0
1.403TrpHis: 1.403 ± 0.676
0.0TrpIle: 0.0 ± 0.0
4.208TrpLys: 4.208 ± 1.618
1.403TrpLeu: 1.403 ± 0.676
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
1.403TrpPro: 1.403 ± 2.231
2.805TrpGln: 2.805 ± 1.086
4.208TrpArg: 4.208 ± 2.027
2.805TrpSer: 2.805 ± 1.086
0.0TrpThr: 0.0 ± 0.0
4.208TrpVal: 4.208 ± 2.027
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.403TyrAla: 1.403 ± 0.676
1.403TyrCys: 1.403 ± 0.676
2.805TyrAsp: 2.805 ± 1.351
2.805TyrGlu: 2.805 ± 1.351
2.805TyrPhe: 2.805 ± 1.351
2.805TyrGly: 2.805 ± 1.351
0.0TyrHis: 0.0 ± 0.0
4.208TyrIle: 4.208 ± 2.027
4.208TyrLys: 4.208 ± 1.618
4.208TyrLeu: 4.208 ± 2.485
0.0TyrMet: 0.0 ± 0.0
1.403TyrAsn: 1.403 ± 0.676
5.61TyrPro: 5.61 ± 1.416
1.403TyrGln: 1.403 ± 0.676
9.818TyrArg: 9.818 ± 2.413
1.403TyrSer: 1.403 ± 1.461
1.403TyrThr: 1.403 ± 0.676
0.0TyrVal: 0.0 ± 0.0
1.403TyrTrp: 1.403 ± 1.461
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (714 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski