Amino acid dipepetide frequency for Pleurochrysis carterae circular virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.462AlaAla: 6.462 ± 1.214
0.0AlaCys: 0.0 ± 0.0
1.616AlaAsp: 1.616 ± 1.237
4.847AlaGlu: 4.847 ± 1.361
1.616AlaPhe: 1.616 ± 1.15
3.231AlaGly: 3.231 ± 2.474
3.231AlaHis: 3.231 ± 0.607
4.847AlaIle: 4.847 ± 3.45
1.616AlaLys: 1.616 ± 1.15
4.847AlaLeu: 4.847 ± 1.361
3.231AlaMet: 3.231 ± 0.607
3.231AlaAsn: 3.231 ± 2.474
6.462AlaPro: 6.462 ± 2.446
3.231AlaGln: 3.231 ± 0.607
8.078AlaArg: 8.078 ± 3.969
8.078AlaSer: 8.078 ± 3.612
3.231AlaThr: 3.231 ± 2.474
6.462AlaVal: 6.462 ± 5.557
0.0AlaTrp: 0.0 ± 0.0
3.231AlaTyr: 3.231 ± 0.607
0.0AlaXaa: 0.0 ± 0.0
Cys
1.616CysAla: 1.616 ± 1.15
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
1.616CysGlu: 1.616 ± 4.784
0.0CysPhe: 0.0 ± 0.0
1.616CysGly: 1.616 ± 1.15
0.0CysHis: 0.0 ± 0.0
1.616CysIle: 1.616 ± 1.15
6.462CysLys: 6.462 ± 1.214
1.616CysLeu: 1.616 ± 1.15
0.0CysMet: 0.0 ± 0.0
1.616CysAsn: 1.616 ± 1.15
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
1.616CysArg: 1.616 ± 1.237
0.0CysSer: 0.0 ± 0.0
1.616CysThr: 1.616 ± 1.237
0.0CysVal: 0.0 ± 0.0
1.616CysTrp: 1.616 ± 1.15
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.616AspAla: 1.616 ± 1.237
0.0AspCys: 0.0 ± 0.0
4.847AspAsp: 4.847 ± 1.361
1.616AspGlu: 1.616 ± 1.15
3.231AspPhe: 3.231 ± 2.3
3.231AspGly: 3.231 ± 2.3
3.231AspHis: 3.231 ± 0.607
3.231AspIle: 3.231 ± 2.3
4.847AspLys: 4.847 ± 1.361
4.847AspLeu: 4.847 ± 1.361
0.0AspMet: 0.0 ± 0.92
3.231AspAsn: 3.231 ± 0.607
3.231AspPro: 3.231 ± 0.607
3.231AspGln: 3.231 ± 2.3
4.847AspArg: 4.847 ± 1.573
1.616AspSer: 1.616 ± 1.15
4.847AspThr: 4.847 ± 1.361
0.0AspVal: 0.0 ± 0.0
0.0AspTrp: 0.0 ± 0.0
1.616AspTyr: 1.616 ± 4.784
0.0AspXaa: 0.0 ± 0.0
Glu
0.0GluAla: 0.0 ± 0.0
1.616GluCys: 1.616 ± 1.15
6.462GluAsp: 6.462 ± 2.446
4.847GluGlu: 4.847 ± 3.45
4.847GluPhe: 4.847 ± 1.361
1.616GluGly: 1.616 ± 1.15
0.0GluHis: 0.0 ± 0.0
3.231GluIle: 3.231 ± 2.3
0.0GluLys: 0.0 ± 0.0
3.231GluLeu: 3.231 ± 2.3
3.231GluMet: 3.231 ± 2.3
0.0GluAsn: 0.0 ± 0.0
1.616GluPro: 1.616 ± 1.237
1.616GluGln: 1.616 ± 1.15
4.847GluArg: 4.847 ± 4.195
3.231GluSer: 3.231 ± 0.607
0.0GluThr: 0.0 ± 0.0
1.616GluVal: 1.616 ± 1.15
0.0GluTrp: 0.0 ± 0.0
3.231GluTyr: 3.231 ± 2.474
0.0GluXaa: 0.0 ± 0.0
Phe
3.231PheAla: 3.231 ± 0.607
0.0PheCys: 0.0 ± 0.0
3.231PheAsp: 3.231 ± 2.3
1.616PheGlu: 1.616 ± 1.15
0.0PhePhe: 0.0 ± 0.0
0.0PheGly: 0.0 ± 0.0
1.616PheHis: 1.616 ± 1.237
1.616PheIle: 1.616 ± 1.237
8.078PheLys: 8.078 ± 3.572
0.0PheLeu: 0.0 ± 0.0
0.0PheMet: 0.0 ± 0.0
0.0PheAsn: 0.0 ± 0.0
1.616PhePro: 1.616 ± 1.15
0.0PheGln: 0.0 ± 0.0
1.616PheArg: 1.616 ± 1.15
3.231PheSer: 3.231 ± 2.474
8.078PheThr: 8.078 ± 2.038
6.462PheVal: 6.462 ± 2.446
0.0PheTrp: 0.0 ± 0.0
3.231PheTyr: 3.231 ± 2.3
0.0PheXaa: 0.0 ± 0.0
Gly
8.078GlyAla: 8.078 ± 2.038
0.0GlyCys: 0.0 ± 0.0
4.847GlyAsp: 4.847 ± 3.45
1.616GlyGlu: 1.616 ± 1.15
4.847GlyPhe: 4.847 ± 1.361
3.231GlyGly: 3.231 ± 0.607
1.616GlyHis: 1.616 ± 1.15
1.616GlyIle: 1.616 ± 1.237
6.462GlyLys: 6.462 ± 2.446
0.0GlyLeu: 0.0 ± 0.0
0.0GlyMet: 0.0 ± 0.0
3.231GlyAsn: 3.231 ± 2.474
0.0GlyPro: 0.0 ± 0.0
1.616GlyGln: 1.616 ± 1.237
3.231GlyArg: 3.231 ± 0.607
4.847GlySer: 4.847 ± 1.361
8.078GlyThr: 8.078 ± 1.766
3.231GlyVal: 3.231 ± 2.474
0.0GlyTrp: 0.0 ± 0.0
4.847GlyTyr: 4.847 ± 1.573
0.0GlyXaa: 0.0 ± 0.0
His
1.616HisAla: 1.616 ± 1.15
0.0HisCys: 0.0 ± 0.0
1.616HisAsp: 1.616 ± 1.237
1.616HisGlu: 1.616 ± 1.15
1.616HisPhe: 1.616 ± 1.237
0.0HisGly: 0.0 ± 0.0
1.616HisHis: 1.616 ± 1.15
1.616HisIle: 1.616 ± 1.237
1.616HisLys: 1.616 ± 1.237
3.231HisLeu: 3.231 ± 2.3
0.0HisMet: 0.0 ± 0.0
1.616HisAsn: 1.616 ± 1.237
3.231HisPro: 3.231 ± 2.3
0.0HisGln: 0.0 ± 0.0
3.231HisArg: 3.231 ± 2.3
3.231HisSer: 3.231 ± 2.474
1.616HisThr: 1.616 ± 1.237
1.616HisVal: 1.616 ± 1.237
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.231IleAla: 3.231 ± 0.607
0.0IleCys: 0.0 ± 0.0
1.616IleAsp: 1.616 ± 1.237
1.616IleGlu: 1.616 ± 1.15
0.0IlePhe: 0.0 ± 0.0
1.616IleGly: 1.616 ± 1.237
1.616IleHis: 1.616 ± 1.15
0.0IleIle: 0.0 ± 0.0
3.231IleLys: 3.231 ± 2.3
3.231IleLeu: 3.231 ± 2.3
1.616IleMet: 1.616 ± 1.15
0.0IleAsn: 0.0 ± 0.0
3.231IlePro: 3.231 ± 0.607
1.616IleGln: 1.616 ± 1.237
1.616IleArg: 1.616 ± 1.15
1.616IleSer: 1.616 ± 1.237
6.462IleThr: 6.462 ± 4.947
3.231IleVal: 3.231 ± 0.607
3.231IleTrp: 3.231 ± 2.3
3.231IleTyr: 3.231 ± 0.607
0.0IleXaa: 0.0 ± 0.0
Lys
6.462LysAla: 6.462 ± 1.214
4.847LysCys: 4.847 ± 3.45
4.847LysAsp: 4.847 ± 1.361
3.231LysGlu: 3.231 ± 2.3
3.231LysPhe: 3.231 ± 0.607
4.847LysGly: 4.847 ± 1.361
3.231LysHis: 3.231 ± 0.607
1.616LysIle: 1.616 ± 1.15
9.693LysLys: 9.693 ± 6.107
4.847LysLeu: 4.847 ± 1.573
4.847LysMet: 4.847 ± 2.787
3.231LysAsn: 3.231 ± 2.3
1.616LysPro: 1.616 ± 1.15
1.616LysGln: 1.616 ± 1.237
4.847LysArg: 4.847 ± 3.71
6.462LysSer: 6.462 ± 4.217
3.231LysThr: 3.231 ± 0.607
1.616LysVal: 1.616 ± 1.237
1.616LysTrp: 1.616 ± 1.15
4.847LysTyr: 4.847 ± 1.573
0.0LysXaa: 0.0 ± 0.0
Leu
6.462LeuAla: 6.462 ± 2.764
0.0LeuCys: 0.0 ± 0.0
4.847LeuAsp: 4.847 ± 3.45
3.231LeuGlu: 3.231 ± 2.3
0.0LeuPhe: 0.0 ± 0.0
3.231LeuGly: 3.231 ± 0.607
1.616LeuHis: 1.616 ± 1.237
3.231LeuIle: 3.231 ± 0.607
11.309LeuLys: 11.309 ± 8.403
1.616LeuLeu: 1.616 ± 1.15
0.0LeuMet: 0.0 ± 0.0
1.616LeuAsn: 1.616 ± 1.15
1.616LeuPro: 1.616 ± 1.237
1.616LeuGln: 1.616 ± 1.15
1.616LeuArg: 1.616 ± 1.15
8.078LeuSer: 8.078 ± 8.88
1.616LeuThr: 1.616 ± 1.15
3.231LeuVal: 3.231 ± 0.607
3.231LeuTrp: 3.231 ± 4.525
0.0LeuTyr: 0.0 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
3.231MetAla: 3.231 ± 4.743
1.616MetCys: 1.616 ± 1.15
1.616MetAsp: 1.616 ± 1.15
1.616MetGlu: 1.616 ± 1.15
3.231MetPhe: 3.231 ± 0.607
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
6.462MetLeu: 6.462 ± 3.918
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
0.0MetPro: 0.0 ± 0.0
1.616MetGln: 1.616 ± 1.15
0.0MetArg: 0.0 ± 0.0
0.0MetSer: 0.0 ± 0.0
1.616MetThr: 1.616 ± 1.15
4.847MetVal: 4.847 ± 1.573
0.0MetTrp: 0.0 ± 0.0
1.616MetTyr: 1.616 ± 1.237
0.0MetXaa: 0.0 ± 0.0
Asn
0.0AsnAla: 0.0 ± 0.0
0.0AsnCys: 0.0 ± 0.0
0.0AsnAsp: 0.0 ± 0.0
1.616AsnGlu: 1.616 ± 1.237
3.231AsnPhe: 3.231 ± 0.607
0.0AsnGly: 0.0 ± 0.0
0.0AsnHis: 0.0 ± 0.0
1.616AsnIle: 1.616 ± 1.237
1.616AsnLys: 1.616 ± 1.15
3.231AsnLeu: 3.231 ± 2.474
3.231AsnMet: 3.231 ± 2.159
1.616AsnAsn: 1.616 ± 1.15
1.616AsnPro: 1.616 ± 1.237
1.616AsnGln: 1.616 ± 1.15
0.0AsnArg: 0.0 ± 0.0
1.616AsnSer: 1.616 ± 1.237
6.462AsnThr: 6.462 ± 2.446
1.616AsnVal: 1.616 ± 1.15
1.616AsnTrp: 1.616 ± 1.15
1.616AsnTyr: 1.616 ± 1.15
0.0AsnXaa: 0.0 ± 0.0
Pro
4.847ProAla: 4.847 ± 3.45
0.0ProCys: 0.0 ± 0.0
0.0ProAsp: 0.0 ± 0.0
3.231ProGlu: 3.231 ± 2.474
3.231ProPhe: 3.231 ± 0.607
6.462ProGly: 6.462 ± 1.214
1.616ProHis: 1.616 ± 1.15
1.616ProIle: 1.616 ± 1.15
4.847ProLys: 4.847 ± 1.573
1.616ProLeu: 1.616 ± 1.237
1.616ProMet: 1.616 ± 1.15
1.616ProAsn: 1.616 ± 1.237
3.231ProPro: 3.231 ± 4.525
0.0ProGln: 0.0 ± 0.0
4.847ProArg: 4.847 ± 3.71
1.616ProSer: 1.616 ± 1.15
6.462ProThr: 6.462 ± 2.446
3.231ProVal: 3.231 ± 4.743
0.0ProTrp: 0.0 ± 0.0
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
3.231GlnAla: 3.231 ± 2.3
3.231GlnCys: 3.231 ± 0.607
1.616GlnAsp: 1.616 ± 1.15
4.847GlnGlu: 4.847 ± 3.45
1.616GlnPhe: 1.616 ± 1.15
1.616GlnGly: 1.616 ± 1.15
0.0GlnHis: 0.0 ± 0.0
1.616GlnIle: 1.616 ± 1.237
1.616GlnLys: 1.616 ± 1.237
0.0GlnLeu: 0.0 ± 0.0
1.616GlnMet: 1.616 ± 1.15
3.231GlnAsn: 3.231 ± 2.3
0.0GlnPro: 0.0 ± 0.0
0.0GlnGln: 0.0 ± 0.0
3.231GlnArg: 3.231 ± 2.3
1.616GlnSer: 1.616 ± 1.237
4.847GlnThr: 4.847 ± 4.195
3.231GlnVal: 3.231 ± 4.743
0.0GlnTrp: 0.0 ± 0.0
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
3.231ArgAla: 3.231 ± 4.525
0.0ArgCys: 0.0 ± 0.0
0.0ArgAsp: 0.0 ± 0.0
1.616ArgGlu: 1.616 ± 1.15
4.847ArgPhe: 4.847 ± 1.361
3.231ArgGly: 3.231 ± 2.3
1.616ArgHis: 1.616 ± 1.15
3.231ArgIle: 3.231 ± 2.474
3.231ArgLys: 3.231 ± 2.474
4.847ArgLeu: 4.847 ± 5.016
3.231ArgMet: 3.231 ± 4.743
1.616ArgAsn: 1.616 ± 1.15
11.309ArgPro: 11.309 ± 4.198
1.616ArgGln: 1.616 ± 1.15
9.693ArgArg: 9.693 ± 5.21
8.078ArgSer: 8.078 ± 3.983
0.0ArgThr: 0.0 ± 0.0
1.616ArgVal: 1.616 ± 1.237
0.0ArgTrp: 0.0 ± 0.0
4.847ArgTyr: 4.847 ± 1.573
0.0ArgXaa: 0.0 ± 0.0
Ser
4.847SerAla: 4.847 ± 4.195
1.616SerCys: 1.616 ± 1.237
9.693SerAsp: 9.693 ± 4.335
1.616SerGlu: 1.616 ± 1.237
1.616SerPhe: 1.616 ± 1.237
3.231SerGly: 3.231 ± 2.474
1.616SerHis: 1.616 ± 1.237
4.847SerIle: 4.847 ± 1.573
8.078SerLys: 8.078 ± 3.983
1.616SerLeu: 1.616 ± 4.784
1.616SerMet: 1.616 ± 1.15
1.616SerAsn: 1.616 ± 1.15
3.231SerPro: 3.231 ± 2.474
4.847SerGln: 4.847 ± 3.45
4.847SerArg: 4.847 ± 5.016
6.462SerSer: 6.462 ± 3.918
8.078SerThr: 8.078 ± 4.585
4.847SerVal: 4.847 ± 1.573
0.0SerTrp: 0.0 ± 0.0
3.231SerTyr: 3.231 ± 2.474
0.0SerXaa: 0.0 ± 0.0
Thr
9.693ThrAla: 9.693 ± 1.821
1.616ThrCys: 1.616 ± 1.237
4.847ThrAsp: 4.847 ± 1.573
1.616ThrGlu: 1.616 ± 1.237
1.616ThrPhe: 1.616 ± 1.237
11.309ThrGly: 11.309 ± 2.564
0.0ThrHis: 0.0 ± 0.0
1.616ThrIle: 1.616 ± 1.237
4.847ThrLys: 4.847 ± 1.361
6.462ThrLeu: 6.462 ± 2.446
1.616ThrMet: 1.616 ± 1.237
3.231ThrAsn: 3.231 ± 2.474
3.231ThrPro: 3.231 ± 2.3
11.309ThrGln: 11.309 ± 3.395
3.231ThrArg: 3.231 ± 2.474
3.231ThrSer: 3.231 ± 4.525
4.847ThrThr: 4.847 ± 1.361
1.616ThrVal: 1.616 ± 1.237
1.616ThrTrp: 1.616 ± 1.237
3.231ThrTyr: 3.231 ± 2.474
0.0ThrXaa: 0.0 ± 0.0
Val
4.847ValAla: 4.847 ± 1.573
3.231ValCys: 3.231 ± 4.525
0.0ValAsp: 0.0 ± 0.0
1.616ValGlu: 1.616 ± 1.15
3.231ValPhe: 3.231 ± 2.3
3.231ValGly: 3.231 ± 0.607
4.847ValHis: 4.847 ± 3.71
4.847ValIle: 4.847 ± 1.361
0.0ValLys: 0.0 ± 0.0
1.616ValLeu: 1.616 ± 4.784
0.0ValMet: 0.0 ± 0.0
0.0ValAsn: 0.0 ± 0.0
3.231ValPro: 3.231 ± 2.474
0.0ValGln: 0.0 ± 0.0
4.847ValArg: 4.847 ± 5.016
8.078ValSer: 8.078 ± 6.184
8.078ValThr: 8.078 ± 3.983
4.847ValVal: 4.847 ± 1.573
0.0ValTrp: 0.0 ± 0.0
0.0ValTyr: 0.0 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
1.616TrpAla: 1.616 ± 1.15
0.0TrpCys: 0.0 ± 0.0
1.616TrpAsp: 1.616 ± 1.15
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
3.231TrpGly: 3.231 ± 2.3
1.616TrpHis: 1.616 ± 1.15
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
0.0TrpLeu: 0.0 ± 0.0
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
1.616TrpGln: 1.616 ± 4.784
0.0TrpArg: 0.0 ± 0.0
1.616TrpSer: 1.616 ± 1.15
0.0TrpThr: 0.0 ± 0.0
1.616TrpVal: 1.616 ± 1.237
1.616TrpTrp: 1.616 ± 1.15
1.616TrpTyr: 1.616 ± 1.15
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.231TyrAla: 3.231 ± 2.474
3.231TyrCys: 3.231 ± 0.607
1.616TyrAsp: 1.616 ± 1.237
1.616TyrGlu: 1.616 ± 1.237
1.616TyrPhe: 1.616 ± 1.15
6.462TyrGly: 6.462 ± 2.764
0.0TyrHis: 0.0 ± 0.0
0.0TyrIle: 0.0 ± 0.0
3.231TyrLys: 3.231 ± 2.474
4.847TyrLeu: 4.847 ± 4.551
0.0TyrMet: 0.0 ± 0.0
1.616TyrAsn: 1.616 ± 1.237
1.616TyrPro: 1.616 ± 1.15
0.0TyrGln: 0.0 ± 0.0
1.616TyrArg: 1.616 ± 1.237
4.847TyrSer: 4.847 ± 1.361
1.616TyrThr: 1.616 ± 1.237
1.616TyrVal: 1.616 ± 1.237
1.616TyrTrp: 1.616 ± 1.15
1.616TyrTyr: 1.616 ± 1.237
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (620 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski