Amino acid dipepetide frequency for Sanxia water strider virus 6

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.83AlaAla: 3.83 ± 0.0
0.957AlaCys: 0.957 ± 0.0
3.42AlaAsp: 3.42 ± 0.0
2.188AlaGlu: 2.188 ± 0.0
1.641AlaPhe: 1.641 ± 0.0
1.505AlaGly: 1.505 ± 0.0
0.41AlaHis: 0.41 ± 0.0
3.283AlaIle: 3.283 ± 0.0
4.377AlaLys: 4.377 ± 0.0
5.608AlaLeu: 5.608 ± 0.0
0.957AlaMet: 0.957 ± 0.0
3.283AlaAsn: 3.283 ± 0.0
1.505AlaPro: 1.505 ± 0.0
2.325AlaGln: 2.325 ± 0.0
1.641AlaArg: 1.641 ± 0.0
3.283AlaSer: 3.283 ± 0.0
3.009AlaThr: 3.009 ± 0.0
4.377AlaVal: 4.377 ± 0.0
0.684AlaTrp: 0.684 ± 0.0
2.736AlaTyr: 2.736 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.957CysAla: 0.957 ± 0.0
0.547CysCys: 0.547 ± 0.0
1.505CysAsp: 1.505 ± 0.0
2.052CysGlu: 2.052 ± 0.0
1.231CysPhe: 1.231 ± 0.0
1.094CysGly: 1.094 ± 0.0
0.547CysHis: 0.547 ± 0.0
2.188CysIle: 2.188 ± 0.0
0.957CysLys: 0.957 ± 0.0
2.052CysLeu: 2.052 ± 0.0
0.274CysMet: 0.274 ± 0.0
1.505CysAsn: 1.505 ± 0.0
0.684CysPro: 0.684 ± 0.0
0.547CysGln: 0.547 ± 0.0
0.547CysArg: 0.547 ± 0.0
2.599CysSer: 2.599 ± 0.0
1.915CysThr: 1.915 ± 0.0
0.274CysVal: 0.274 ± 0.0
0.41CysTrp: 0.41 ± 0.0
1.231CysTyr: 1.231 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.146AspAla: 3.146 ± 0.0
0.957AspCys: 0.957 ± 0.0
2.462AspAsp: 2.462 ± 0.0
2.325AspGlu: 2.325 ± 0.0
1.778AspPhe: 1.778 ± 0.0
2.599AspGly: 2.599 ± 0.0
1.231AspHis: 1.231 ± 0.0
3.009AspIle: 3.009 ± 0.0
5.471AspLys: 5.471 ± 0.0
2.872AspLeu: 2.872 ± 0.0
1.641AspMet: 1.641 ± 0.0
4.24AspAsn: 4.24 ± 0.0
2.188AspPro: 2.188 ± 0.0
1.641AspGln: 1.641 ± 0.0
1.915AspArg: 1.915 ± 0.0
3.146AspSer: 3.146 ± 0.0
3.146AspThr: 3.146 ± 0.0
3.42AspVal: 3.42 ± 0.0
0.821AspTrp: 0.821 ± 0.0
3.693AspTyr: 3.693 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
2.736GluAla: 2.736 ± 0.0
1.231GluCys: 1.231 ± 0.0
3.693GluAsp: 3.693 ± 0.0
4.787GluGlu: 4.787 ± 0.0
4.24GluPhe: 4.24 ± 0.0
1.094GluGly: 1.094 ± 0.0
1.778GluHis: 1.778 ± 0.0
4.24GluIle: 4.24 ± 0.0
5.745GluLys: 5.745 ± 0.0
5.745GluLeu: 5.745 ± 0.0
1.641GluMet: 1.641 ± 0.0
3.83GluAsn: 3.83 ± 0.0
1.368GluPro: 1.368 ± 0.0
2.052GluGln: 2.052 ± 0.0
1.094GluArg: 1.094 ± 0.0
2.736GluSer: 2.736 ± 0.0
2.736GluThr: 2.736 ± 0.0
4.103GluVal: 4.103 ± 0.0
0.684GluTrp: 0.684 ± 0.0
1.505GluTyr: 1.505 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
1.368PheAla: 1.368 ± 0.0
1.094PheCys: 1.094 ± 0.0
4.377PheAsp: 4.377 ± 0.0
2.736PheGlu: 2.736 ± 0.0
2.462PhePhe: 2.462 ± 0.0
1.505PheGly: 1.505 ± 0.0
0.684PheHis: 0.684 ± 0.0
2.462PheIle: 2.462 ± 0.0
5.198PheLys: 5.198 ± 0.0
3.283PheLeu: 3.283 ± 0.0
0.41PheMet: 0.41 ± 0.0
4.514PheAsn: 4.514 ± 0.0
0.684PhePro: 0.684 ± 0.0
0.547PheGln: 0.547 ± 0.0
1.094PheArg: 1.094 ± 0.0
1.915PheSer: 1.915 ± 0.0
3.556PheThr: 3.556 ± 0.0
4.514PheVal: 4.514 ± 0.0
0.684PheTrp: 0.684 ± 0.0
2.052PheTyr: 2.052 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
1.915GlyAla: 1.915 ± 0.0
0.547GlyCys: 0.547 ± 0.0
1.505GlyAsp: 1.505 ± 0.0
2.188GlyGlu: 2.188 ± 0.0
1.641GlyPhe: 1.641 ± 0.0
2.599GlyGly: 2.599 ± 0.0
0.547GlyHis: 0.547 ± 0.0
2.599GlyIle: 2.599 ± 0.0
4.787GlyLys: 4.787 ± 0.0
4.651GlyLeu: 4.651 ± 0.0
0.547GlyMet: 0.547 ± 0.0
2.872GlyAsn: 2.872 ± 0.0
1.505GlyPro: 1.505 ± 0.0
2.188GlyGln: 2.188 ± 0.0
2.052GlyArg: 2.052 ± 0.0
2.188GlySer: 2.188 ± 0.0
3.693GlyThr: 3.693 ± 0.0
3.42GlyVal: 3.42 ± 0.0
0.137GlyTrp: 0.137 ± 0.0
1.915GlyTyr: 1.915 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
0.957HisAla: 0.957 ± 0.0
0.547HisCys: 0.547 ± 0.0
1.094HisAsp: 1.094 ± 0.0
0.684HisGlu: 0.684 ± 0.0
0.821HisPhe: 0.821 ± 0.0
1.368HisGly: 1.368 ± 0.0
0.957HisHis: 0.957 ± 0.0
1.231HisIle: 1.231 ± 0.0
1.231HisLys: 1.231 ± 0.0
0.957HisLeu: 0.957 ± 0.0
0.274HisMet: 0.274 ± 0.0
2.462HisAsn: 2.462 ± 0.0
0.547HisPro: 0.547 ± 0.0
1.231HisGln: 1.231 ± 0.0
0.821HisArg: 0.821 ± 0.0
3.009HisSer: 3.009 ± 0.0
2.052HisThr: 2.052 ± 0.0
1.368HisVal: 1.368 ± 0.0
0.274HisTrp: 0.274 ± 0.0
0.684HisTyr: 0.684 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.283IleAla: 3.283 ± 0.0
1.368IleCys: 1.368 ± 0.0
3.556IleAsp: 3.556 ± 0.0
3.009IleGlu: 3.009 ± 0.0
2.325IlePhe: 2.325 ± 0.0
2.872IleGly: 2.872 ± 0.0
1.094IleHis: 1.094 ± 0.0
5.608IleIle: 5.608 ± 0.0
7.933IleLys: 7.933 ± 0.0
6.839IleLeu: 6.839 ± 0.0
1.505IleMet: 1.505 ± 0.0
4.651IleAsn: 4.651 ± 0.0
1.641IlePro: 1.641 ± 0.0
2.736IleGln: 2.736 ± 0.0
2.052IleArg: 2.052 ± 0.0
4.377IleSer: 4.377 ± 0.0
6.155IleThr: 6.155 ± 0.0
5.334IleVal: 5.334 ± 0.0
0.41IleTrp: 0.41 ± 0.0
2.736IleTyr: 2.736 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
3.83LysAla: 3.83 ± 0.0
1.915LysCys: 1.915 ± 0.0
4.377LysAsp: 4.377 ± 0.0
5.745LysGlu: 5.745 ± 0.0
5.334LysPhe: 5.334 ± 0.0
2.872LysGly: 2.872 ± 0.0
1.505LysHis: 1.505 ± 0.0
5.061LysIle: 5.061 ± 0.0
3.556LysLys: 3.556 ± 0.0
7.249LysLeu: 7.249 ± 0.0
2.052LysMet: 2.052 ± 0.0
6.976LysAsn: 6.976 ± 0.0
3.009LysPro: 3.009 ± 0.0
3.556LysGln: 3.556 ± 0.0
2.736LysArg: 2.736 ± 0.0
6.702LysSer: 6.702 ± 0.0
6.839LysThr: 6.839 ± 0.0
3.42LysVal: 3.42 ± 0.0
1.231LysTrp: 1.231 ± 0.0
3.83LysTyr: 3.83 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
5.608LeuAla: 5.608 ± 0.0
2.325LeuCys: 2.325 ± 0.0
5.334LeuAsp: 5.334 ± 0.0
5.745LeuGlu: 5.745 ± 0.0
3.146LeuPhe: 3.146 ± 0.0
5.061LeuGly: 5.061 ± 0.0
2.052LeuHis: 2.052 ± 0.0
5.334LeuIle: 5.334 ± 0.0
7.386LeuLys: 7.386 ± 0.0
5.198LeuLeu: 5.198 ± 0.0
1.778LeuMet: 1.778 ± 0.0
6.565LeuAsn: 6.565 ± 0.0
3.42LeuPro: 3.42 ± 0.0
2.462LeuGln: 2.462 ± 0.0
2.188LeuArg: 2.188 ± 0.0
6.976LeuSer: 6.976 ± 0.0
7.249LeuThr: 7.249 ± 0.0
5.608LeuVal: 5.608 ± 0.0
0.547LeuTrp: 0.547 ± 0.0
3.556LeuTyr: 3.556 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
1.368MetAla: 1.368 ± 0.0
0.41MetCys: 0.41 ± 0.0
1.094MetAsp: 1.094 ± 0.0
1.368MetGlu: 1.368 ± 0.0
1.641MetPhe: 1.641 ± 0.0
1.641MetGly: 1.641 ± 0.0
0.821MetHis: 0.821 ± 0.0
1.094MetIle: 1.094 ± 0.0
0.821MetLys: 0.821 ± 0.0
2.188MetLeu: 2.188 ± 0.0
0.547MetMet: 0.547 ± 0.0
1.094MetAsn: 1.094 ± 0.0
0.957MetPro: 0.957 ± 0.0
1.231MetGln: 1.231 ± 0.0
0.547MetArg: 0.547 ± 0.0
1.094MetSer: 1.094 ± 0.0
2.325MetThr: 2.325 ± 0.0
1.915MetVal: 1.915 ± 0.0
0.137MetTrp: 0.137 ± 0.0
0.821MetTyr: 0.821 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.83AsnAla: 3.83 ± 0.0
2.325AsnCys: 2.325 ± 0.0
3.146AsnAsp: 3.146 ± 0.0
3.967AsnGlu: 3.967 ± 0.0
4.103AsnPhe: 4.103 ± 0.0
2.462AsnGly: 2.462 ± 0.0
1.505AsnHis: 1.505 ± 0.0
6.429AsnIle: 6.429 ± 0.0
7.66AsnLys: 7.66 ± 0.0
6.839AsnLeu: 6.839 ± 0.0
2.462AsnMet: 2.462 ± 0.0
5.608AsnAsn: 5.608 ± 0.0
1.641AsnPro: 1.641 ± 0.0
2.188AsnGln: 2.188 ± 0.0
2.599AsnArg: 2.599 ± 0.0
4.377AsnSer: 4.377 ± 0.0
5.471AsnThr: 5.471 ± 0.0
5.198AsnVal: 5.198 ± 0.0
0.821AsnTrp: 0.821 ± 0.0
3.693AsnTyr: 3.693 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
1.641ProAla: 1.641 ± 0.0
1.094ProCys: 1.094 ± 0.0
0.957ProAsp: 0.957 ± 0.0
1.641ProGlu: 1.641 ± 0.0
0.957ProPhe: 0.957 ± 0.0
1.641ProGly: 1.641 ± 0.0
0.821ProHis: 0.821 ± 0.0
2.462ProIle: 2.462 ± 0.0
2.872ProLys: 2.872 ± 0.0
3.693ProLeu: 3.693 ± 0.0
0.821ProMet: 0.821 ± 0.0
1.915ProAsn: 1.915 ± 0.0
0.547ProPro: 0.547 ± 0.0
1.231ProGln: 1.231 ± 0.0
0.821ProArg: 0.821 ± 0.0
2.736ProSer: 2.736 ± 0.0
2.599ProThr: 2.599 ± 0.0
2.462ProVal: 2.462 ± 0.0
0.41ProTrp: 0.41 ± 0.0
1.094ProTyr: 1.094 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
1.915GlnAla: 1.915 ± 0.0
1.094GlnCys: 1.094 ± 0.0
2.599GlnAsp: 2.599 ± 0.0
1.505GlnGlu: 1.505 ± 0.0
1.915GlnPhe: 1.915 ± 0.0
1.505GlnGly: 1.505 ± 0.0
0.957GlnHis: 0.957 ± 0.0
2.462GlnIle: 2.462 ± 0.0
1.915GlnLys: 1.915 ± 0.0
2.736GlnLeu: 2.736 ± 0.0
1.231GlnMet: 1.231 ± 0.0
2.052GlnAsn: 2.052 ± 0.0
2.052GlnPro: 2.052 ± 0.0
1.505GlnGln: 1.505 ± 0.0
1.505GlnArg: 1.505 ± 0.0
0.957GlnSer: 0.957 ± 0.0
2.872GlnThr: 2.872 ± 0.0
2.188GlnVal: 2.188 ± 0.0
0.547GlnTrp: 0.547 ± 0.0
1.915GlnTyr: 1.915 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
0.821ArgAla: 0.821 ± 0.0
0.684ArgCys: 0.684 ± 0.0
0.547ArgAsp: 0.547 ± 0.0
1.368ArgGlu: 1.368 ± 0.0
1.094ArgPhe: 1.094 ± 0.0
1.641ArgGly: 1.641 ± 0.0
1.094ArgHis: 1.094 ± 0.0
2.599ArgIle: 2.599 ± 0.0
1.641ArgLys: 1.641 ± 0.0
3.83ArgLeu: 3.83 ± 0.0
0.957ArgMet: 0.957 ± 0.0
2.872ArgAsn: 2.872 ± 0.0
1.231ArgPro: 1.231 ± 0.0
0.821ArgGln: 0.821 ± 0.0
1.778ArgArg: 1.778 ± 0.0
3.146ArgSer: 3.146 ± 0.0
2.052ArgThr: 2.052 ± 0.0
2.188ArgVal: 2.188 ± 0.0
0.137ArgTrp: 0.137 ± 0.0
0.821ArgTyr: 0.821 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
3.42SerAla: 3.42 ± 0.0
1.778SerCys: 1.778 ± 0.0
3.693SerAsp: 3.693 ± 0.0
4.514SerGlu: 4.514 ± 0.0
2.872SerPhe: 2.872 ± 0.0
4.651SerGly: 4.651 ± 0.0
1.641SerHis: 1.641 ± 0.0
4.651SerIle: 4.651 ± 0.0
5.198SerLys: 5.198 ± 0.0
6.976SerLeu: 6.976 ± 0.0
1.778SerMet: 1.778 ± 0.0
4.377SerAsn: 4.377 ± 0.0
1.368SerPro: 1.368 ± 0.0
2.872SerGln: 2.872 ± 0.0
1.915SerArg: 1.915 ± 0.0
4.651SerSer: 4.651 ± 0.0
4.787SerThr: 4.787 ± 0.0
4.924SerVal: 4.924 ± 0.0
0.137SerTrp: 0.137 ± 0.0
3.146SerTyr: 3.146 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
4.787ThrAla: 4.787 ± 0.0
2.052ThrCys: 2.052 ± 0.0
3.146ThrAsp: 3.146 ± 0.0
4.103ThrGlu: 4.103 ± 0.0
3.556ThrPhe: 3.556 ± 0.0
2.736ThrGly: 2.736 ± 0.0
1.505ThrHis: 1.505 ± 0.0
6.018ThrIle: 6.018 ± 0.0
5.471ThrLys: 5.471 ± 0.0
5.882ThrLeu: 5.882 ± 0.0
1.641ThrMet: 1.641 ± 0.0
6.292ThrAsn: 6.292 ± 0.0
3.283ThrPro: 3.283 ± 0.0
2.599ThrGln: 2.599 ± 0.0
2.052ThrArg: 2.052 ± 0.0
5.471ThrSer: 5.471 ± 0.0
5.334ThrThr: 5.334 ± 0.0
4.377ThrVal: 4.377 ± 0.0
0.957ThrTrp: 0.957 ± 0.0
4.377ThrTyr: 4.377 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
2.462ValAla: 2.462 ± 0.0
1.094ValCys: 1.094 ± 0.0
2.325ValAsp: 2.325 ± 0.0
3.283ValGlu: 3.283 ± 0.0
2.462ValPhe: 2.462 ± 0.0
2.188ValGly: 2.188 ± 0.0
2.052ValHis: 2.052 ± 0.0
5.061ValIle: 5.061 ± 0.0
5.334ValLys: 5.334 ± 0.0
5.471ValLeu: 5.471 ± 0.0
1.231ValMet: 1.231 ± 0.0
6.155ValAsn: 6.155 ± 0.0
3.967ValPro: 3.967 ± 0.0
2.052ValGln: 2.052 ± 0.0
2.599ValArg: 2.599 ± 0.0
6.429ValSer: 6.429 ± 0.0
5.471ValThr: 5.471 ± 0.0
5.471ValVal: 5.471 ± 0.0
0.547ValTrp: 0.547 ± 0.0
2.599ValTyr: 2.599 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.274TrpAla: 0.274 ± 0.0
0.137TrpCys: 0.137 ± 0.0
0.41TrpAsp: 0.41 ± 0.0
0.684TrpGlu: 0.684 ± 0.0
0.41TrpPhe: 0.41 ± 0.0
0.274TrpGly: 0.274 ± 0.0
0.41TrpHis: 0.41 ± 0.0
0.684TrpIle: 0.684 ± 0.0
0.684TrpLys: 0.684 ± 0.0
0.957TrpLeu: 0.957 ± 0.0
0.274TrpMet: 0.274 ± 0.0
0.957TrpAsn: 0.957 ± 0.0
0.137TrpPro: 0.137 ± 0.0
0.547TrpGln: 0.547 ± 0.0
0.0TrpArg: 0.0 ± 0.0
0.684TrpSer: 0.684 ± 0.0
1.231TrpThr: 1.231 ± 0.0
0.41TrpVal: 0.41 ± 0.0
0.41TrpTrp: 0.41 ± 0.0
0.547TrpTyr: 0.547 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.872TyrAla: 2.872 ± 0.0
0.957TyrCys: 0.957 ± 0.0
2.462TyrAsp: 2.462 ± 0.0
3.146TyrGlu: 3.146 ± 0.0
1.505TyrPhe: 1.505 ± 0.0
2.188TyrGly: 2.188 ± 0.0
0.957TyrHis: 0.957 ± 0.0
3.009TyrIle: 3.009 ± 0.0
3.556TyrLys: 3.556 ± 0.0
4.514TyrLeu: 4.514 ± 0.0
0.957TyrMet: 0.957 ± 0.0
4.103TyrAsn: 4.103 ± 0.0
0.684TyrPro: 0.684 ± 0.0
1.094TyrGln: 1.094 ± 0.0
1.368TyrArg: 1.368 ± 0.0
2.872TyrSer: 2.872 ± 0.0
3.283TyrThr: 3.283 ± 0.0
3.283TyrVal: 3.283 ± 0.0
0.137TyrTrp: 0.137 ± 0.0
2.188TyrTyr: 2.188 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (7312 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski