Amino acid dipepetide frequency for Heterosigma akashiwo RNA virus (strain SOG263) (HaRNAV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.364AlaAla: 7.364 ± 0.0
1.163AlaCys: 1.163 ± 0.0
3.101AlaAsp: 3.101 ± 0.0
5.814AlaGlu: 5.814 ± 0.0
3.101AlaPhe: 3.101 ± 0.0
4.651AlaGly: 4.651 ± 0.0
0.775AlaHis: 0.775 ± 0.0
1.163AlaIle: 1.163 ± 0.0
3.488AlaLys: 3.488 ± 0.0
3.876AlaLeu: 3.876 ± 0.0
1.163AlaMet: 1.163 ± 0.0
3.101AlaAsn: 3.101 ± 0.0
4.651AlaPro: 4.651 ± 0.0
1.938AlaGln: 1.938 ± 0.0
6.977AlaArg: 6.977 ± 0.0
7.752AlaSer: 7.752 ± 0.0
2.713AlaThr: 2.713 ± 0.0
5.814AlaVal: 5.814 ± 0.0
1.938AlaTrp: 1.938 ± 0.0
1.163AlaTyr: 1.163 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.775CysAla: 0.775 ± 0.0
0.388CysCys: 0.388 ± 0.0
0.388CysAsp: 0.388 ± 0.0
1.55CysGlu: 1.55 ± 0.0
0.775CysPhe: 0.775 ± 0.0
0.775CysGly: 0.775 ± 0.0
1.163CysHis: 1.163 ± 0.0
0.388CysIle: 0.388 ± 0.0
1.163CysLys: 1.163 ± 0.0
2.326CysLeu: 2.326 ± 0.0
0.388CysMet: 0.388 ± 0.0
1.163CysAsn: 1.163 ± 0.0
1.938CysPro: 1.938 ± 0.0
0.388CysGln: 0.388 ± 0.0
0.388CysArg: 0.388 ± 0.0
1.163CysSer: 1.163 ± 0.0
2.326CysThr: 2.326 ± 0.0
1.938CysVal: 1.938 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.264AspAla: 4.264 ± 0.0
0.388AspCys: 0.388 ± 0.0
4.264AspAsp: 4.264 ± 0.0
5.814AspGlu: 5.814 ± 0.0
3.488AspPhe: 3.488 ± 0.0
6.589AspGly: 6.589 ± 0.0
0.388AspHis: 0.388 ± 0.0
1.55AspIle: 1.55 ± 0.0
1.55AspLys: 1.55 ± 0.0
4.651AspLeu: 4.651 ± 0.0
1.938AspMet: 1.938 ± 0.0
1.938AspAsn: 1.938 ± 0.0
4.264AspPro: 4.264 ± 0.0
1.163AspGln: 1.163 ± 0.0
2.326AspArg: 2.326 ± 0.0
5.426AspSer: 5.426 ± 0.0
3.101AspThr: 3.101 ± 0.0
4.651AspVal: 4.651 ± 0.0
0.775AspTrp: 0.775 ± 0.0
1.938AspTyr: 1.938 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
1.938GluAla: 1.938 ± 0.0
0.0GluCys: 0.0 ± 0.0
4.264GluAsp: 4.264 ± 0.0
2.713GluGlu: 2.713 ± 0.0
1.163GluPhe: 1.163 ± 0.0
4.651GluGly: 4.651 ± 0.0
2.326GluHis: 2.326 ± 0.0
4.264GluIle: 4.264 ± 0.0
3.876GluLys: 3.876 ± 0.0
6.202GluLeu: 6.202 ± 0.0
1.938GluMet: 1.938 ± 0.0
2.713GluAsn: 2.713 ± 0.0
1.163GluPro: 1.163 ± 0.0
2.713GluGln: 2.713 ± 0.0
2.713GluArg: 2.713 ± 0.0
2.713GluSer: 2.713 ± 0.0
5.039GluThr: 5.039 ± 0.0
3.876GluVal: 3.876 ± 0.0
0.388GluTrp: 0.388 ± 0.0
1.55GluTyr: 1.55 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
4.264PheAla: 4.264 ± 0.0
1.163PheCys: 1.163 ± 0.0
1.938PheAsp: 1.938 ± 0.0
2.326PheGlu: 2.326 ± 0.0
2.713PhePhe: 2.713 ± 0.0
3.488PheGly: 3.488 ± 0.0
1.938PheHis: 1.938 ± 0.0
2.713PheIle: 2.713 ± 0.0
0.388PheLys: 0.388 ± 0.0
3.876PheLeu: 3.876 ± 0.0
1.55PheMet: 1.55 ± 0.0
0.775PheAsn: 0.775 ± 0.0
1.55PhePro: 1.55 ± 0.0
1.55PheGln: 1.55 ± 0.0
2.326PheArg: 2.326 ± 0.0
3.876PheSer: 3.876 ± 0.0
2.713PheThr: 2.713 ± 0.0
4.264PheVal: 4.264 ± 0.0
0.388PheTrp: 0.388 ± 0.0
1.55PheTyr: 1.55 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
6.589GlyAla: 6.589 ± 0.0
2.713GlyCys: 2.713 ± 0.0
5.426GlyAsp: 5.426 ± 0.0
5.814GlyGlu: 5.814 ± 0.0
1.938GlyPhe: 1.938 ± 0.0
3.488GlyGly: 3.488 ± 0.0
1.55GlyHis: 1.55 ± 0.0
3.876GlyIle: 3.876 ± 0.0
4.651GlyLys: 4.651 ± 0.0
4.651GlyLeu: 4.651 ± 0.0
1.55GlyMet: 1.55 ± 0.0
3.101GlyAsn: 3.101 ± 0.0
3.101GlyPro: 3.101 ± 0.0
2.713GlyGln: 2.713 ± 0.0
1.55GlyArg: 1.55 ± 0.0
4.264GlySer: 4.264 ± 0.0
6.589GlyThr: 6.589 ± 0.0
4.651GlyVal: 4.651 ± 0.0
0.388GlyTrp: 0.388 ± 0.0
1.938GlyTyr: 1.938 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
2.326HisAla: 2.326 ± 0.0
0.388HisCys: 0.388 ± 0.0
0.775HisAsp: 0.775 ± 0.0
0.0HisGlu: 0.0 ± 0.0
1.163HisPhe: 1.163 ± 0.0
0.775HisGly: 0.775 ± 0.0
0.388HisHis: 0.388 ± 0.0
1.163HisIle: 1.163 ± 0.0
1.163HisLys: 1.163 ± 0.0
2.326HisLeu: 2.326 ± 0.0
1.55HisMet: 1.55 ± 0.0
0.775HisAsn: 0.775 ± 0.0
3.101HisPro: 3.101 ± 0.0
1.163HisGln: 1.163 ± 0.0
1.55HisArg: 1.55 ± 0.0
0.775HisSer: 0.775 ± 0.0
1.163HisThr: 1.163 ± 0.0
1.938HisVal: 1.938 ± 0.0
0.0HisTrp: 0.0 ± 0.0
1.163HisTyr: 1.163 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.101IleAla: 3.101 ± 0.0
0.388IleCys: 0.388 ± 0.0
4.264IleAsp: 4.264 ± 0.0
1.163IleGlu: 1.163 ± 0.0
0.388IlePhe: 0.388 ± 0.0
5.426IleGly: 5.426 ± 0.0
1.163IleHis: 1.163 ± 0.0
0.775IleIle: 0.775 ± 0.0
1.163IleLys: 1.163 ± 0.0
2.326IleLeu: 2.326 ± 0.0
1.55IleMet: 1.55 ± 0.0
1.163IleAsn: 1.163 ± 0.0
0.0IlePro: 0.0 ± 0.0
0.775IleGln: 0.775 ± 0.0
3.488IleArg: 3.488 ± 0.0
4.651IleSer: 4.651 ± 0.0
0.775IleThr: 0.775 ± 0.0
3.876IleVal: 3.876 ± 0.0
0.388IleTrp: 0.388 ± 0.0
2.713IleTyr: 2.713 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
2.326LysAla: 2.326 ± 0.0
0.388LysCys: 0.388 ± 0.0
1.938LysAsp: 1.938 ± 0.0
1.163LysGlu: 1.163 ± 0.0
2.713LysPhe: 2.713 ± 0.0
3.101LysGly: 3.101 ± 0.0
1.55LysHis: 1.55 ± 0.0
2.326LysIle: 2.326 ± 0.0
2.326LysLys: 2.326 ± 0.0
4.651LysLeu: 4.651 ± 0.0
1.163LysMet: 1.163 ± 0.0
2.326LysAsn: 2.326 ± 0.0
1.938LysPro: 1.938 ± 0.0
0.775LysGln: 0.775 ± 0.0
2.326LysArg: 2.326 ± 0.0
3.876LysSer: 3.876 ± 0.0
2.326LysThr: 2.326 ± 0.0
6.202LysVal: 6.202 ± 0.0
0.388LysTrp: 0.388 ± 0.0
1.55LysTyr: 1.55 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
8.14LeuAla: 8.14 ± 0.0
1.55LeuCys: 1.55 ± 0.0
5.039LeuAsp: 5.039 ± 0.0
4.651LeuGlu: 4.651 ± 0.0
4.264LeuPhe: 4.264 ± 0.0
5.039LeuGly: 5.039 ± 0.0
0.775LeuHis: 0.775 ± 0.0
4.264LeuIle: 4.264 ± 0.0
4.651LeuLys: 4.651 ± 0.0
8.14LeuLeu: 8.14 ± 0.0
1.938LeuMet: 1.938 ± 0.0
5.426LeuAsn: 5.426 ± 0.0
2.713LeuPro: 2.713 ± 0.0
1.163LeuGln: 1.163 ± 0.0
5.039LeuArg: 5.039 ± 0.0
8.915LeuSer: 8.915 ± 0.0
5.426LeuThr: 5.426 ± 0.0
8.527LeuVal: 8.527 ± 0.0
0.775LeuTrp: 0.775 ± 0.0
2.713LeuTyr: 2.713 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
0.775MetAla: 0.775 ± 0.0
0.388MetCys: 0.388 ± 0.0
1.163MetAsp: 1.163 ± 0.0
3.876MetGlu: 3.876 ± 0.0
1.938MetPhe: 1.938 ± 0.0
1.55MetGly: 1.55 ± 0.0
0.775MetHis: 0.775 ± 0.0
1.163MetIle: 1.163 ± 0.0
0.388MetLys: 0.388 ± 0.0
1.938MetLeu: 1.938 ± 0.0
0.775MetMet: 0.775 ± 0.0
0.388MetAsn: 0.388 ± 0.0
0.388MetPro: 0.388 ± 0.0
1.163MetGln: 1.163 ± 0.0
1.938MetArg: 1.938 ± 0.0
2.326MetSer: 2.326 ± 0.0
1.163MetThr: 1.163 ± 0.0
2.326MetVal: 2.326 ± 0.0
0.0MetTrp: 0.0 ± 0.0
1.55MetTyr: 1.55 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.101AsnAla: 3.101 ± 0.0
1.163AsnCys: 1.163 ± 0.0
1.163AsnAsp: 1.163 ± 0.0
0.388AsnGlu: 0.388 ± 0.0
3.101AsnPhe: 3.101 ± 0.0
2.326AsnGly: 2.326 ± 0.0
0.775AsnHis: 0.775 ± 0.0
1.163AsnIle: 1.163 ± 0.0
1.55AsnLys: 1.55 ± 0.0
3.488AsnLeu: 3.488 ± 0.0
0.388AsnMet: 0.388 ± 0.0
1.163AsnAsn: 1.163 ± 0.0
3.488AsnPro: 3.488 ± 0.0
2.326AsnGln: 2.326 ± 0.0
1.55AsnArg: 1.55 ± 0.0
3.876AsnSer: 3.876 ± 0.0
4.264AsnThr: 4.264 ± 0.0
3.488AsnVal: 3.488 ± 0.0
0.775AsnTrp: 0.775 ± 0.0
0.775AsnTyr: 0.775 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
3.488ProAla: 3.488 ± 0.0
1.938ProCys: 1.938 ± 0.0
1.55ProAsp: 1.55 ± 0.0
2.713ProGlu: 2.713 ± 0.0
2.326ProPhe: 2.326 ± 0.0
2.326ProGly: 2.326 ± 0.0
2.326ProHis: 2.326 ± 0.0
2.326ProIle: 2.326 ± 0.0
1.55ProLys: 1.55 ± 0.0
5.039ProLeu: 5.039 ± 0.0
1.55ProMet: 1.55 ± 0.0
1.938ProAsn: 1.938 ± 0.0
1.163ProPro: 1.163 ± 0.0
0.388ProGln: 0.388 ± 0.0
3.101ProArg: 3.101 ± 0.0
4.651ProSer: 4.651 ± 0.0
3.488ProThr: 3.488 ± 0.0
3.101ProVal: 3.101 ± 0.0
0.775ProTrp: 0.775 ± 0.0
2.326ProTyr: 2.326 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
1.55GlnAla: 1.55 ± 0.0
1.163GlnCys: 1.163 ± 0.0
1.163GlnAsp: 1.163 ± 0.0
1.163GlnGlu: 1.163 ± 0.0
0.775GlnPhe: 0.775 ± 0.0
1.938GlnGly: 1.938 ± 0.0
0.0GlnHis: 0.0 ± 0.0
1.55GlnIle: 1.55 ± 0.0
1.55GlnLys: 1.55 ± 0.0
2.326GlnLeu: 2.326 ± 0.0
0.388GlnMet: 0.388 ± 0.0
1.938GlnAsn: 1.938 ± 0.0
1.55GlnPro: 1.55 ± 0.0
1.55GlnGln: 1.55 ± 0.0
2.326GlnArg: 2.326 ± 0.0
1.938GlnSer: 1.938 ± 0.0
0.775GlnThr: 0.775 ± 0.0
3.101GlnVal: 3.101 ± 0.0
0.775GlnTrp: 0.775 ± 0.0
0.388GlnTyr: 0.388 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
3.101ArgAla: 3.101 ± 0.0
1.163ArgCys: 1.163 ± 0.0
3.876ArgAsp: 3.876 ± 0.0
4.264ArgGlu: 4.264 ± 0.0
3.488ArgPhe: 3.488 ± 0.0
6.202ArgGly: 6.202 ± 0.0
0.775ArgHis: 0.775 ± 0.0
1.938ArgIle: 1.938 ± 0.0
3.101ArgLys: 3.101 ± 0.0
6.977ArgLeu: 6.977 ± 0.0
0.775ArgMet: 0.775 ± 0.0
2.326ArgAsn: 2.326 ± 0.0
2.713ArgPro: 2.713 ± 0.0
0.388ArgGln: 0.388 ± 0.0
4.264ArgArg: 4.264 ± 0.0
4.264ArgSer: 4.264 ± 0.0
3.876ArgThr: 3.876 ± 0.0
4.264ArgVal: 4.264 ± 0.0
1.163ArgTrp: 1.163 ± 0.0
2.326ArgTyr: 2.326 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
6.202SerAla: 6.202 ± 0.0
0.0SerCys: 0.0 ± 0.0
2.713SerAsp: 2.713 ± 0.0
3.876SerGlu: 3.876 ± 0.0
5.039SerPhe: 5.039 ± 0.0
8.915SerGly: 8.915 ± 0.0
3.101SerHis: 3.101 ± 0.0
2.713SerIle: 2.713 ± 0.0
5.814SerLys: 5.814 ± 0.0
7.364SerLeu: 7.364 ± 0.0
1.938SerMet: 1.938 ± 0.0
3.101SerAsn: 3.101 ± 0.0
4.264SerPro: 4.264 ± 0.0
2.713SerGln: 2.713 ± 0.0
5.426SerArg: 5.426 ± 0.0
6.977SerSer: 6.977 ± 0.0
6.977SerThr: 6.977 ± 0.0
6.589SerVal: 6.589 ± 0.0
0.388SerTrp: 0.388 ± 0.0
0.775SerTyr: 0.775 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
3.488ThrAla: 3.488 ± 0.0
0.775ThrCys: 0.775 ± 0.0
5.814ThrAsp: 5.814 ± 0.0
3.488ThrGlu: 3.488 ± 0.0
2.326ThrPhe: 2.326 ± 0.0
3.101ThrGly: 3.101 ± 0.0
0.775ThrHis: 0.775 ± 0.0
1.55ThrIle: 1.55 ± 0.0
3.488ThrLys: 3.488 ± 0.0
9.302ThrLeu: 9.302 ± 0.0
1.938ThrMet: 1.938 ± 0.0
3.101ThrAsn: 3.101 ± 0.0
2.326ThrPro: 2.326 ± 0.0
1.163ThrGln: 1.163 ± 0.0
5.426ThrArg: 5.426 ± 0.0
5.426ThrSer: 5.426 ± 0.0
3.488ThrThr: 3.488 ± 0.0
3.488ThrVal: 3.488 ± 0.0
1.55ThrTrp: 1.55 ± 0.0
1.938ThrTyr: 1.938 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
6.589ValAla: 6.589 ± 0.0
1.938ValCys: 1.938 ± 0.0
7.752ValAsp: 7.752 ± 0.0
3.488ValGlu: 3.488 ± 0.0
2.326ValPhe: 2.326 ± 0.0
3.876ValGly: 3.876 ± 0.0
1.938ValHis: 1.938 ± 0.0
3.488ValIle: 3.488 ± 0.0
1.938ValLys: 1.938 ± 0.0
6.202ValLeu: 6.202 ± 0.0
1.163ValMet: 1.163 ± 0.0
1.938ValAsn: 1.938 ± 0.0
6.977ValPro: 6.977 ± 0.0
1.938ValGln: 1.938 ± 0.0
5.039ValArg: 5.039 ± 0.0
7.752ValSer: 7.752 ± 0.0
4.651ValThr: 4.651 ± 0.0
8.14ValVal: 8.14 ± 0.0
1.55ValTrp: 1.55 ± 0.0
3.488ValTyr: 3.488 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.388TrpAla: 0.388 ± 0.0
0.388TrpCys: 0.388 ± 0.0
1.938TrpAsp: 1.938 ± 0.0
1.163TrpGlu: 1.163 ± 0.0
0.388TrpPhe: 0.388 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.775TrpIle: 0.775 ± 0.0
1.163TrpLys: 1.163 ± 0.0
1.163TrpLeu: 1.163 ± 0.0
0.775TrpMet: 0.775 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.388TrpPro: 0.388 ± 0.0
0.388TrpGln: 0.388 ± 0.0
1.55TrpArg: 1.55 ± 0.0
0.775TrpSer: 0.775 ± 0.0
1.55TrpThr: 1.55 ± 0.0
0.775TrpVal: 0.775 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.938TyrAla: 1.938 ± 0.0
1.938TyrCys: 1.938 ± 0.0
1.938TyrAsp: 1.938 ± 0.0
0.775TyrGlu: 0.775 ± 0.0
1.938TyrPhe: 1.938 ± 0.0
2.326TyrGly: 2.326 ± 0.0
1.163TyrHis: 1.163 ± 0.0
0.388TyrIle: 0.388 ± 0.0
0.388TyrLys: 0.388 ± 0.0
2.326TyrLeu: 2.326 ± 0.0
1.163TyrMet: 1.163 ± 0.0
1.938TyrAsn: 1.938 ± 0.0
0.388TyrPro: 0.388 ± 0.0
1.55TyrGln: 1.55 ± 0.0
1.938TyrArg: 1.938 ± 0.0
3.488TyrSer: 3.488 ± 0.0
1.938TyrThr: 1.938 ± 0.0
1.163TyrVal: 1.163 ± 0.0
1.163TyrTrp: 1.163 ± 0.0
0.775TyrTyr: 0.775 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (2581 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski