Amino acid dipepetide frequency for Zostera marina amalgavirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.294AlaAla: 6.294 ± 0.785
3.497AlaCys: 3.497 ± 2.185
4.196AlaAsp: 4.196 ± 3.146
2.797AlaGlu: 2.797 ± 1.223
3.497AlaPhe: 3.497 ± 0.438
5.594AlaGly: 5.594 ± 2.799
0.699AlaHis: 0.699 ± 0.35
2.797AlaIle: 2.797 ± 1.399
6.294AlaLys: 6.294 ± 0.785
6.993AlaLeu: 6.993 ± 0.876
2.098AlaMet: 2.098 ± 0.262
2.797AlaAsn: 2.797 ± 1.223
0.0AlaPro: 0.0 ± 0.0
2.797AlaGln: 2.797 ± 2.534
6.993AlaArg: 6.993 ± 0.436
5.594AlaSer: 5.594 ± 2.446
3.497AlaThr: 3.497 ± 0.438
6.294AlaVal: 6.294 ± 1.837
4.196AlaTrp: 4.196 ± 0.524
3.497AlaTyr: 3.497 ± 0.873
0.0AlaXaa: 0.0 ± 0.0
Cys
4.196CysAla: 4.196 ± 0.788
0.0CysCys: 0.0 ± 0.0
0.699CysAsp: 0.699 ± 0.35
0.0CysGlu: 0.0 ± 0.0
0.699CysPhe: 0.699 ± 0.35
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.699CysIle: 0.699 ± 0.35
3.497CysLys: 3.497 ± 0.873
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
2.098CysAsn: 2.098 ± 0.262
0.0CysPro: 0.0 ± 0.0
2.098CysGln: 2.098 ± 0.262
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
1.399CysThr: 1.399 ± 0.612
0.699CysVal: 0.699 ± 0.961
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.196AspAla: 4.196 ± 0.788
0.0AspCys: 0.0 ± 0.0
4.196AspAsp: 4.196 ± 1.835
2.797AspGlu: 2.797 ± 1.223
2.797AspPhe: 2.797 ± 0.088
4.895AspGly: 4.895 ± 1.137
0.699AspHis: 0.699 ± 0.35
4.895AspIle: 4.895 ± 0.174
4.895AspLys: 4.895 ± 2.796
4.895AspLeu: 4.895 ± 0.174
0.0AspMet: 0.0 ± 0.0
3.497AspAsn: 3.497 ± 0.438
1.399AspPro: 1.399 ± 0.7
1.399AspGln: 1.399 ± 0.7
7.692AspArg: 7.692 ± 2.708
2.797AspSer: 2.797 ± 1.223
2.797AspThr: 2.797 ± 2.534
6.993AspVal: 6.993 ± 1.747
1.399AspTrp: 1.399 ± 0.7
2.797AspTyr: 2.797 ± 0.088
0.0AspXaa: 0.0 ± 0.0
Glu
5.594GluAla: 5.594 ± 0.176
0.0GluCys: 0.0 ± 0.0
9.79GluAsp: 9.79 ± 1.659
3.497GluGlu: 3.497 ± 0.873
2.098GluPhe: 2.098 ± 1.049
4.895GluGly: 4.895 ± 1.485
4.895GluHis: 4.895 ± 1.485
2.797GluIle: 2.797 ± 0.088
4.196GluLys: 4.196 ± 0.524
4.196GluLeu: 4.196 ± 0.788
2.098GluMet: 2.098 ± 1.049
2.098GluAsn: 2.098 ± 0.262
0.0GluPro: 0.0 ± 0.0
1.399GluGln: 1.399 ± 0.7
6.294GluArg: 6.294 ± 0.526
3.497GluSer: 3.497 ± 0.873
2.797GluThr: 2.797 ± 0.088
3.497GluVal: 3.497 ± 0.873
0.0GluTrp: 0.0 ± 0.0
1.399GluTyr: 1.399 ± 0.612
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
4.196PheCys: 4.196 ± 0.524
1.399PheAsp: 1.399 ± 0.7
4.196PheGlu: 4.196 ± 1.835
0.0PhePhe: 0.0 ± 0.0
0.0PheGly: 0.0 ± 0.0
2.098PheHis: 2.098 ± 0.262
1.399PheIle: 1.399 ± 0.7
4.196PheLys: 4.196 ± 0.524
4.196PheLeu: 4.196 ± 2.099
1.399PheMet: 1.399 ± 0.7
0.699PheAsn: 0.699 ± 0.35
1.399PhePro: 1.399 ± 0.7
0.699PheGln: 0.699 ± 0.35
2.098PheArg: 2.098 ± 0.262
3.497PheSer: 3.497 ± 1.749
1.399PheThr: 1.399 ± 0.7
3.497PheVal: 3.497 ± 1.749
0.699PheTrp: 0.699 ± 0.35
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
0.699GlyAla: 0.699 ± 0.35
0.699GlyCys: 0.699 ± 0.35
5.594GlyAsp: 5.594 ± 0.176
2.797GlyGlu: 2.797 ± 1.223
3.497GlyPhe: 3.497 ± 0.438
6.294GlyGly: 6.294 ± 0.526
0.0GlyHis: 0.0 ± 0.0
2.797GlyIle: 2.797 ± 0.088
2.797GlyLys: 2.797 ± 1.399
6.993GlyLeu: 6.993 ± 0.876
2.098GlyMet: 2.098 ± 1.049
3.497GlyAsn: 3.497 ± 0.873
1.399GlyPro: 1.399 ± 0.7
1.399GlyGln: 1.399 ± 0.7
3.497GlyArg: 3.497 ± 1.749
4.196GlySer: 4.196 ± 0.788
2.098GlyThr: 2.098 ± 1.049
2.098GlyVal: 2.098 ± 1.049
1.399GlyTrp: 1.399 ± 0.7
0.699GlyTyr: 0.699 ± 0.961
0.0GlyXaa: 0.0 ± 0.0
His
0.699HisAla: 0.699 ± 0.35
0.0HisCys: 0.0 ± 0.0
1.399HisAsp: 1.399 ± 0.7
0.0HisGlu: 0.0 ± 0.0
0.699HisPhe: 0.699 ± 0.35
1.399HisGly: 1.399 ± 0.7
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
0.0HisLys: 0.0 ± 0.0
2.098HisLeu: 2.098 ± 0.262
0.0HisMet: 0.0 ± 0.0
0.699HisAsn: 0.699 ± 0.35
1.399HisPro: 1.399 ± 0.612
0.0HisGln: 0.0 ± 0.0
2.098HisArg: 2.098 ± 0.262
3.497HisSer: 3.497 ± 2.185
0.0HisThr: 0.0 ± 0.0
2.098HisVal: 2.098 ± 1.049
0.0HisTrp: 0.0 ± 0.0
2.098HisTyr: 2.098 ± 0.262
0.0HisXaa: 0.0 ± 0.0
Ile
6.993IleAla: 6.993 ± 1.747
0.0IleCys: 0.0 ± 0.0
2.797IleAsp: 2.797 ± 0.088
4.196IleGlu: 4.196 ± 0.788
2.098IlePhe: 2.098 ± 1.049
2.797IleGly: 2.797 ± 1.399
1.399IleHis: 1.399 ± 0.7
2.797IleIle: 2.797 ± 1.399
4.196IleLys: 4.196 ± 0.524
6.294IleLeu: 6.294 ± 0.785
0.699IleMet: 0.699 ± 0.35
2.098IleAsn: 2.098 ± 1.049
4.895IlePro: 4.895 ± 1.137
1.399IleGln: 1.399 ± 0.7
3.497IleArg: 3.497 ± 1.749
3.497IleSer: 3.497 ± 0.438
0.0IleThr: 0.0 ± 0.0
0.699IleVal: 0.699 ± 0.35
0.0IleTrp: 0.0 ± 0.0
0.699IleTyr: 0.699 ± 0.35
0.0IleXaa: 0.0 ± 0.0
Lys
5.594LysAla: 5.594 ± 2.446
0.0LysCys: 0.0 ± 0.0
0.699LysAsp: 0.699 ± 0.35
4.196LysGlu: 4.196 ± 0.788
4.895LysPhe: 4.895 ± 1.137
3.497LysGly: 3.497 ± 0.438
0.699LysHis: 0.699 ± 0.961
6.993LysIle: 6.993 ± 0.436
7.692LysLys: 7.692 ± 4.019
6.294LysLeu: 6.294 ± 0.526
0.699LysMet: 0.699 ± 0.278
5.594LysAsn: 5.594 ± 1.135
5.594LysPro: 5.594 ± 1.135
2.797LysGln: 2.797 ± 0.088
4.196LysArg: 4.196 ± 1.835
0.699LysSer: 0.699 ± 0.35
2.098LysThr: 2.098 ± 0.262
5.594LysVal: 5.594 ± 1.135
2.098LysTrp: 2.098 ± 0.262
3.497LysTyr: 3.497 ± 0.438
0.0LysXaa: 0.0 ± 0.0
Leu
4.196LeuAla: 4.196 ± 0.788
0.699LeuCys: 0.699 ± 0.35
7.692LeuAsp: 7.692 ± 0.086
11.888LeuGlu: 11.888 ± 0.609
2.797LeuPhe: 2.797 ± 0.088
3.497LeuGly: 3.497 ± 1.749
0.699LeuHis: 0.699 ± 0.35
4.196LeuIle: 4.196 ± 0.524
6.294LeuLys: 6.294 ± 0.526
13.986LeuLeu: 13.986 ± 1.751
1.399LeuMet: 1.399 ± 0.57
3.497LeuAsn: 3.497 ± 1.749
4.895LeuPro: 4.895 ± 0.174
4.196LeuGln: 4.196 ± 1.835
9.091LeuArg: 9.091 ± 1.925
4.196LeuSer: 4.196 ± 0.788
9.091LeuThr: 9.091 ± 0.697
1.399LeuVal: 1.399 ± 0.7
0.699LeuTrp: 0.699 ± 0.35
4.196LeuTyr: 4.196 ± 2.099
0.0LeuXaa: 0.0 ± 0.0
Met
3.497MetAla: 3.497 ± 0.873
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
2.797MetGlu: 2.797 ± 1.399
1.399MetPhe: 1.399 ± 0.7
0.0MetGly: 0.0 ± 0.0
0.699MetHis: 0.699 ± 0.35
2.098MetIle: 2.098 ± 0.262
2.098MetLys: 2.098 ± 0.262
2.098MetLeu: 2.098 ± 1.049
1.399MetMet: 1.399 ± 0.7
0.699MetAsn: 0.699 ± 0.35
0.0MetPro: 0.0 ± 0.0
0.699MetGln: 0.699 ± 0.35
1.399MetArg: 1.399 ± 0.7
0.0MetSer: 0.0 ± 0.0
3.497MetThr: 3.497 ± 0.438
0.699MetVal: 0.699 ± 0.35
0.699MetTrp: 0.699 ± 0.35
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.797AsnAla: 2.797 ± 0.088
0.0AsnCys: 0.0 ± 0.0
3.497AsnAsp: 3.497 ± 2.185
3.497AsnGlu: 3.497 ± 0.438
3.497AsnPhe: 3.497 ± 0.873
0.699AsnGly: 0.699 ± 0.35
0.699AsnHis: 0.699 ± 0.35
4.895AsnIle: 4.895 ± 1.137
0.0AsnLys: 0.0 ± 0.0
8.392AsnLeu: 8.392 ± 0.264
1.399AsnMet: 1.399 ± 0.7
1.399AsnAsn: 1.399 ± 0.612
1.399AsnPro: 1.399 ± 0.612
1.399AsnGln: 1.399 ± 1.923
2.098AsnArg: 2.098 ± 1.049
2.098AsnSer: 2.098 ± 1.049
1.399AsnThr: 1.399 ± 0.612
1.399AsnVal: 1.399 ± 0.7
0.0AsnTrp: 0.0 ± 0.0
2.797AsnTyr: 2.797 ± 0.088
0.0AsnXaa: 0.0 ± 0.0
Pro
3.497ProAla: 3.497 ± 0.438
0.699ProCys: 0.699 ± 0.35
2.797ProAsp: 2.797 ± 1.223
7.692ProGlu: 7.692 ± 2.708
1.399ProPhe: 1.399 ± 0.7
2.797ProGly: 2.797 ± 1.223
0.0ProHis: 0.0 ± 0.0
1.399ProIle: 1.399 ± 0.7
2.797ProLys: 2.797 ± 1.399
5.594ProLeu: 5.594 ± 1.487
2.098ProMet: 2.098 ± 0.262
2.098ProAsn: 2.098 ± 0.262
2.797ProPro: 2.797 ± 1.399
2.098ProGln: 2.098 ± 0.262
0.699ProArg: 0.699 ± 0.961
1.399ProSer: 1.399 ± 0.7
2.797ProThr: 2.797 ± 0.088
2.797ProVal: 2.797 ± 0.088
0.699ProTrp: 0.699 ± 0.35
0.699ProTyr: 0.699 ± 0.35
0.0ProXaa: 0.0 ± 0.0
Gln
4.196GlnAla: 4.196 ± 0.524
0.699GlnCys: 0.699 ± 0.35
0.699GlnAsp: 0.699 ± 0.35
1.399GlnGlu: 1.399 ± 0.612
2.797GlnPhe: 2.797 ± 1.223
0.0GlnGly: 0.0 ± 0.0
0.0GlnHis: 0.0 ± 0.0
2.797GlnIle: 2.797 ± 1.399
3.497GlnLys: 3.497 ± 0.873
5.594GlnLeu: 5.594 ± 0.176
0.0GlnMet: 0.0 ± 0.0
1.399GlnAsn: 1.399 ± 0.612
2.098GlnPro: 2.098 ± 0.262
2.797GlnGln: 2.797 ± 0.088
2.098GlnArg: 2.098 ± 1.049
1.399GlnSer: 1.399 ± 0.7
2.797GlnThr: 2.797 ± 1.223
0.0GlnVal: 0.0 ± 0.0
0.699GlnTrp: 0.699 ± 0.35
2.098GlnTyr: 2.098 ± 0.262
0.0GlnXaa: 0.0 ± 0.0
Arg
10.49ArgAla: 10.49 ± 2.62
0.699ArgCys: 0.699 ± 0.35
4.196ArgAsp: 4.196 ± 0.524
4.196ArgGlu: 4.196 ± 0.788
1.399ArgPhe: 1.399 ± 0.7
6.993ArgGly: 6.993 ± 0.876
0.0ArgHis: 0.0 ± 0.0
3.497ArgIle: 3.497 ± 1.749
4.895ArgLys: 4.895 ± 1.485
6.294ArgLeu: 6.294 ± 1.837
1.399ArgMet: 1.399 ± 0.7
2.797ArgAsn: 2.797 ± 1.399
4.196ArgPro: 4.196 ± 0.524
4.196ArgGln: 4.196 ± 0.788
9.79ArgArg: 9.79 ± 1.659
4.196ArgSer: 4.196 ± 0.788
4.196ArgThr: 4.196 ± 3.146
2.098ArgVal: 2.098 ± 1.049
1.399ArgTrp: 1.399 ± 0.7
3.497ArgTyr: 3.497 ± 0.873
0.0ArgXaa: 0.0 ± 0.0
Ser
6.294SerAla: 6.294 ± 0.785
1.399SerCys: 1.399 ± 0.7
4.895SerAsp: 4.895 ± 0.174
2.797SerGlu: 2.797 ± 0.088
0.699SerPhe: 0.699 ± 0.35
2.797SerGly: 2.797 ± 1.399
1.399SerHis: 1.399 ± 0.7
3.497SerIle: 3.497 ± 0.438
6.993SerLys: 6.993 ± 0.876
4.895SerLeu: 4.895 ± 1.137
0.699SerMet: 0.699 ± 0.35
3.497SerAsn: 3.497 ± 0.873
2.098SerPro: 2.098 ± 0.262
2.098SerGln: 2.098 ± 0.262
3.497SerArg: 3.497 ± 0.873
4.196SerSer: 4.196 ± 1.835
2.098SerThr: 2.098 ± 0.262
3.497SerVal: 3.497 ± 2.185
0.699SerTrp: 0.699 ± 0.35
1.399SerTyr: 1.399 ± 0.612
0.0SerXaa: 0.0 ± 0.0
Thr
4.895ThrAla: 4.895 ± 1.137
0.699ThrCys: 0.699 ± 0.961
1.399ThrAsp: 1.399 ± 0.612
2.098ThrGlu: 2.098 ± 0.262
0.0ThrPhe: 0.0 ± 0.0
3.497ThrGly: 3.497 ± 0.873
1.399ThrHis: 1.399 ± 0.612
0.0ThrIle: 0.0 ± 0.0
3.497ThrLys: 3.497 ± 0.438
4.895ThrLeu: 4.895 ± 1.485
0.699ThrMet: 0.699 ± 0.35
2.797ThrAsn: 2.797 ± 1.223
4.196ThrPro: 4.196 ± 0.524
0.0ThrGln: 0.0 ± 0.0
4.895ThrArg: 4.895 ± 1.485
4.895ThrSer: 4.895 ± 0.174
1.399ThrThr: 1.399 ± 0.612
4.196ThrVal: 4.196 ± 0.524
0.0ThrTrp: 0.0 ± 0.0
2.797ThrTyr: 2.797 ± 0.088
0.0ThrXaa: 0.0 ± 0.0
Val
3.497ValAla: 3.497 ± 2.185
0.0ValCys: 0.0 ± 0.0
4.196ValAsp: 4.196 ± 0.524
2.098ValGlu: 2.098 ± 1.049
1.399ValPhe: 1.399 ± 0.7
2.797ValGly: 2.797 ± 0.088
1.399ValHis: 1.399 ± 0.7
2.098ValIle: 2.098 ± 0.262
2.797ValLys: 2.797 ± 1.223
2.098ValLeu: 2.098 ± 1.049
3.497ValMet: 3.497 ± 0.873
0.0ValAsn: 0.0 ± 0.0
4.196ValPro: 4.196 ± 0.788
3.497ValGln: 3.497 ± 0.438
6.993ValArg: 6.993 ± 2.187
6.294ValSer: 6.294 ± 0.785
2.797ValThr: 2.797 ± 0.088
4.196ValVal: 4.196 ± 0.524
0.0ValTrp: 0.0 ± 0.0
0.0ValTyr: 0.0 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.699TrpCys: 0.699 ± 0.35
1.399TrpAsp: 1.399 ± 0.612
1.399TrpGlu: 1.399 ± 0.7
0.0TrpPhe: 0.0 ± 0.0
0.699TrpGly: 0.699 ± 0.35
1.399TrpHis: 1.399 ± 0.612
0.699TrpIle: 0.699 ± 0.35
0.0TrpLys: 0.0 ± 0.0
1.399TrpLeu: 1.399 ± 0.612
0.699TrpMet: 0.699 ± 0.35
1.399TrpAsn: 1.399 ± 0.7
0.0TrpPro: 0.0 ± 0.0
1.399TrpGln: 1.399 ± 0.7
1.399TrpArg: 1.399 ± 0.7
1.399TrpSer: 1.399 ± 0.7
0.699TrpThr: 0.699 ± 0.35
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.797TyrAla: 2.797 ± 1.223
2.098TyrCys: 2.098 ± 0.262
3.497TyrAsp: 3.497 ± 0.438
0.699TyrGlu: 0.699 ± 0.35
1.399TyrPhe: 1.399 ± 0.7
1.399TyrGly: 1.399 ± 0.7
0.0TyrHis: 0.0 ± 0.0
0.699TyrIle: 0.699 ± 0.35
3.497TyrLys: 3.497 ± 0.873
1.399TyrLeu: 1.399 ± 0.612
0.699TyrMet: 0.699 ± 0.35
0.699TyrAsn: 0.699 ± 0.35
4.895TyrPro: 4.895 ± 0.174
0.699TyrGln: 0.699 ± 0.35
2.098TyrArg: 2.098 ± 0.262
2.098TyrSer: 2.098 ± 0.262
1.399TyrThr: 1.399 ± 0.612
2.098TyrVal: 2.098 ± 0.262
0.0TyrTrp: 0.0 ± 0.0
0.699TyrTyr: 0.699 ± 0.961
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1431 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski