Amino acid dipepetide frequency for Xingshan cricket virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.366AlaAla: 6.366 ± 0.0
2.688AlaCys: 2.688 ± 0.0
3.254AlaAsp: 3.254 ± 0.0
5.093AlaGlu: 5.093 ± 0.0
2.546AlaPhe: 2.546 ± 0.0
5.234AlaGly: 5.234 ± 0.0
1.556AlaHis: 1.556 ± 0.0
2.971AlaIle: 2.971 ± 0.0
4.385AlaLys: 4.385 ± 0.0
7.073AlaLeu: 7.073 ± 0.0
2.405AlaMet: 2.405 ± 0.0
2.546AlaAsn: 2.546 ± 0.0
3.395AlaPro: 3.395 ± 0.0
2.405AlaGln: 2.405 ± 0.0
4.102AlaArg: 4.102 ± 0.0
5.941AlaSer: 5.941 ± 0.0
5.659AlaThr: 5.659 ± 0.0
4.668AlaVal: 4.668 ± 0.0
1.839AlaTrp: 1.839 ± 0.0
2.688AlaTyr: 2.688 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
1.273CysAla: 1.273 ± 0.0
0.707CysCys: 0.707 ± 0.0
1.556CysAsp: 1.556 ± 0.0
0.99CysGlu: 0.99 ± 0.0
1.132CysPhe: 1.132 ± 0.0
2.829CysGly: 2.829 ± 0.0
0.707CysHis: 0.707 ± 0.0
1.132CysIle: 1.132 ± 0.0
1.556CysLys: 1.556 ± 0.0
2.546CysLeu: 2.546 ± 0.0
0.424CysMet: 0.424 ± 0.0
1.556CysAsn: 1.556 ± 0.0
0.99CysPro: 0.99 ± 0.0
0.849CysGln: 0.849 ± 0.0
1.556CysArg: 1.556 ± 0.0
2.546CysSer: 2.546 ± 0.0
1.132CysThr: 1.132 ± 0.0
2.405CysVal: 2.405 ± 0.0
0.566CysTrp: 0.566 ± 0.0
0.99CysTyr: 0.99 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.678AspAla: 3.678 ± 0.0
1.98AspCys: 1.98 ± 0.0
2.546AspAsp: 2.546 ± 0.0
2.829AspGlu: 2.829 ± 0.0
2.688AspPhe: 2.688 ± 0.0
3.395AspGly: 3.395 ± 0.0
1.698AspHis: 1.698 ± 0.0
3.112AspIle: 3.112 ± 0.0
2.829AspLys: 2.829 ± 0.0
3.112AspLeu: 3.112 ± 0.0
0.424AspMet: 0.424 ± 0.0
2.405AspAsn: 2.405 ± 0.0
2.263AspPro: 2.263 ± 0.0
1.556AspGln: 1.556 ± 0.0
2.405AspArg: 2.405 ± 0.0
4.81AspSer: 4.81 ± 0.0
2.688AspThr: 2.688 ± 0.0
3.537AspVal: 3.537 ± 0.0
2.122AspTrp: 2.122 ± 0.0
1.273AspTyr: 1.273 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
4.668GluAla: 4.668 ± 0.0
1.273GluCys: 1.273 ± 0.0
3.254GluAsp: 3.254 ± 0.0
2.971GluGlu: 2.971 ± 0.0
1.698GluPhe: 1.698 ± 0.0
3.112GluGly: 3.112 ± 0.0
1.698GluHis: 1.698 ± 0.0
1.98GluIle: 1.98 ± 0.0
2.405GluLys: 2.405 ± 0.0
6.366GluLeu: 6.366 ± 0.0
0.849GluMet: 0.849 ± 0.0
1.98GluAsn: 1.98 ± 0.0
2.829GluPro: 2.829 ± 0.0
1.98GluGln: 1.98 ± 0.0
3.678GluArg: 3.678 ± 0.0
2.688GluSer: 2.688 ± 0.0
3.112GluThr: 3.112 ± 0.0
4.102GluVal: 4.102 ± 0.0
1.556GluTrp: 1.556 ± 0.0
1.698GluTyr: 1.698 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
1.273PheAla: 1.273 ± 0.0
0.707PheCys: 0.707 ± 0.0
2.688PheAsp: 2.688 ± 0.0
2.546PheGlu: 2.546 ± 0.0
1.839PhePhe: 1.839 ± 0.0
3.112PheGly: 3.112 ± 0.0
0.283PheHis: 0.283 ± 0.0
1.556PheIle: 1.556 ± 0.0
2.263PheLys: 2.263 ± 0.0
2.971PheLeu: 2.971 ± 0.0
0.849PheMet: 0.849 ± 0.0
1.132PheAsn: 1.132 ± 0.0
1.273PhePro: 1.273 ± 0.0
0.849PheGln: 0.849 ± 0.0
0.849PheArg: 0.849 ± 0.0
4.102PheSer: 4.102 ± 0.0
3.112PheThr: 3.112 ± 0.0
2.688PheVal: 2.688 ± 0.0
0.849PheTrp: 0.849 ± 0.0
0.849PheTyr: 0.849 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
5.093GlyAla: 5.093 ± 0.0
2.405GlyCys: 2.405 ± 0.0
4.385GlyAsp: 4.385 ± 0.0
5.659GlyGlu: 5.659 ± 0.0
2.971GlyPhe: 2.971 ± 0.0
4.244GlyGly: 4.244 ± 0.0
2.122GlyHis: 2.122 ± 0.0
3.254GlyIle: 3.254 ± 0.0
3.395GlyLys: 3.395 ± 0.0
6.649GlyLeu: 6.649 ± 0.0
1.415GlyMet: 1.415 ± 0.0
2.688GlyAsn: 2.688 ± 0.0
3.254GlyPro: 3.254 ± 0.0
1.698GlyGln: 1.698 ± 0.0
4.385GlyArg: 4.385 ± 0.0
4.527GlySer: 4.527 ± 0.0
5.093GlyThr: 5.093 ± 0.0
4.951GlyVal: 4.951 ± 0.0
0.849GlyTrp: 0.849 ± 0.0
1.415GlyTyr: 1.415 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
2.405HisAla: 2.405 ± 0.0
1.556HisCys: 1.556 ± 0.0
0.99HisAsp: 0.99 ± 0.0
1.698HisGlu: 1.698 ± 0.0
0.424HisPhe: 0.424 ± 0.0
1.415HisGly: 1.415 ± 0.0
0.424HisHis: 0.424 ± 0.0
1.273HisIle: 1.273 ± 0.0
1.415HisLys: 1.415 ± 0.0
2.688HisLeu: 2.688 ± 0.0
0.424HisMet: 0.424 ± 0.0
1.132HisAsn: 1.132 ± 0.0
1.132HisPro: 1.132 ± 0.0
1.556HisGln: 1.556 ± 0.0
2.263HisArg: 2.263 ± 0.0
1.415HisSer: 1.415 ± 0.0
1.698HisThr: 1.698 ± 0.0
2.829HisVal: 2.829 ± 0.0
0.141HisTrp: 0.141 ± 0.0
1.415HisTyr: 1.415 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
4.102IleAla: 4.102 ± 0.0
0.99IleCys: 0.99 ± 0.0
2.829IleAsp: 2.829 ± 0.0
1.839IleGlu: 1.839 ± 0.0
1.556IlePhe: 1.556 ± 0.0
3.961IleGly: 3.961 ± 0.0
1.273IleHis: 1.273 ± 0.0
3.254IleIle: 3.254 ± 0.0
2.546IleLys: 2.546 ± 0.0
3.537IleLeu: 3.537 ± 0.0
0.283IleMet: 0.283 ± 0.0
3.112IleAsn: 3.112 ± 0.0
1.98IlePro: 1.98 ± 0.0
1.98IleGln: 1.98 ± 0.0
2.688IleArg: 2.688 ± 0.0
3.678IleSer: 3.678 ± 0.0
3.537IleThr: 3.537 ± 0.0
3.678IleVal: 3.678 ± 0.0
1.273IleTrp: 1.273 ± 0.0
1.698IleTyr: 1.698 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
3.395LysAla: 3.395 ± 0.0
0.566LysCys: 0.566 ± 0.0
2.546LysAsp: 2.546 ± 0.0
2.263LysGlu: 2.263 ± 0.0
2.263LysPhe: 2.263 ± 0.0
2.688LysGly: 2.688 ± 0.0
1.98LysHis: 1.98 ± 0.0
3.678LysIle: 3.678 ± 0.0
3.395LysLys: 3.395 ± 0.0
3.112LysLeu: 3.112 ± 0.0
0.849LysMet: 0.849 ± 0.0
3.112LysAsn: 3.112 ± 0.0
2.546LysPro: 2.546 ± 0.0
1.556LysGln: 1.556 ± 0.0
2.688LysArg: 2.688 ± 0.0
3.395LysSer: 3.395 ± 0.0
5.659LysThr: 5.659 ± 0.0
4.81LysVal: 4.81 ± 0.0
1.132LysTrp: 1.132 ± 0.0
2.122LysTyr: 2.122 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
8.629LeuAla: 8.629 ± 0.0
1.415LeuCys: 1.415 ± 0.0
4.668LeuAsp: 4.668 ± 0.0
3.537LeuGlu: 3.537 ± 0.0
2.546LeuPhe: 2.546 ± 0.0
5.8LeuGly: 5.8 ± 0.0
1.98LeuHis: 1.98 ± 0.0
5.093LeuIle: 5.093 ± 0.0
4.81LeuLys: 4.81 ± 0.0
8.346LeuLeu: 8.346 ± 0.0
2.122LeuMet: 2.122 ± 0.0
3.678LeuAsn: 3.678 ± 0.0
3.961LeuPro: 3.961 ± 0.0
3.254LeuGln: 3.254 ± 0.0
5.8LeuArg: 5.8 ± 0.0
6.083LeuSer: 6.083 ± 0.0
5.093LeuThr: 5.093 ± 0.0
6.649LeuVal: 6.649 ± 0.0
0.99LeuTrp: 0.99 ± 0.0
1.415LeuTyr: 1.415 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
0.99MetAla: 0.99 ± 0.0
0.141MetCys: 0.141 ± 0.0
0.849MetAsp: 0.849 ± 0.0
0.283MetGlu: 0.283 ± 0.0
0.566MetPhe: 0.566 ± 0.0
0.849MetGly: 0.849 ± 0.0
0.849MetHis: 0.849 ± 0.0
0.283MetIle: 0.283 ± 0.0
1.273MetLys: 1.273 ± 0.0
1.698MetLeu: 1.698 ± 0.0
0.424MetMet: 0.424 ± 0.0
0.707MetAsn: 0.707 ± 0.0
0.424MetPro: 0.424 ± 0.0
0.424MetGln: 0.424 ± 0.0
1.839MetArg: 1.839 ± 0.0
0.566MetSer: 0.566 ± 0.0
1.839MetThr: 1.839 ± 0.0
1.698MetVal: 1.698 ± 0.0
0.141MetTrp: 0.141 ± 0.0
0.424MetTyr: 0.424 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.971AsnAla: 2.971 ± 0.0
0.99AsnCys: 0.99 ± 0.0
1.273AsnAsp: 1.273 ± 0.0
1.98AsnGlu: 1.98 ± 0.0
0.566AsnPhe: 0.566 ± 0.0
2.546AsnGly: 2.546 ± 0.0
1.415AsnHis: 1.415 ± 0.0
3.112AsnIle: 3.112 ± 0.0
2.263AsnLys: 2.263 ± 0.0
4.81AsnLeu: 4.81 ± 0.0
0.566AsnMet: 0.566 ± 0.0
2.263AsnAsn: 2.263 ± 0.0
2.829AsnPro: 2.829 ± 0.0
1.839AsnGln: 1.839 ± 0.0
3.112AsnArg: 3.112 ± 0.0
3.395AsnSer: 3.395 ± 0.0
2.546AsnThr: 2.546 ± 0.0
3.112AsnVal: 3.112 ± 0.0
0.707AsnTrp: 0.707 ± 0.0
1.273AsnTyr: 1.273 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
3.961ProAla: 3.961 ± 0.0
0.849ProCys: 0.849 ± 0.0
3.112ProAsp: 3.112 ± 0.0
2.263ProGlu: 2.263 ± 0.0
1.839ProPhe: 1.839 ± 0.0
2.971ProGly: 2.971 ± 0.0
1.556ProHis: 1.556 ± 0.0
1.415ProIle: 1.415 ± 0.0
2.546ProLys: 2.546 ± 0.0
2.405ProLeu: 2.405 ± 0.0
0.424ProMet: 0.424 ± 0.0
1.698ProAsn: 1.698 ± 0.0
2.122ProPro: 2.122 ± 0.0
1.556ProGln: 1.556 ± 0.0
1.98ProArg: 1.98 ± 0.0
3.819ProSer: 3.819 ± 0.0
3.112ProThr: 3.112 ± 0.0
5.659ProVal: 5.659 ± 0.0
0.566ProTrp: 0.566 ± 0.0
0.99ProTyr: 0.99 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
2.263GlnAla: 2.263 ± 0.0
1.698GlnCys: 1.698 ± 0.0
1.273GlnAsp: 1.273 ± 0.0
1.98GlnGlu: 1.98 ± 0.0
2.263GlnPhe: 2.263 ± 0.0
2.405GlnGly: 2.405 ± 0.0
1.415GlnHis: 1.415 ± 0.0
2.122GlnIle: 2.122 ± 0.0
0.849GlnLys: 0.849 ± 0.0
2.829GlnLeu: 2.829 ± 0.0
0.283GlnMet: 0.283 ± 0.0
0.849GlnAsn: 0.849 ± 0.0
1.98GlnPro: 1.98 ± 0.0
1.839GlnGln: 1.839 ± 0.0
2.263GlnArg: 2.263 ± 0.0
2.263GlnSer: 2.263 ± 0.0
2.546GlnThr: 2.546 ± 0.0
2.122GlnVal: 2.122 ± 0.0
0.424GlnTrp: 0.424 ± 0.0
0.424GlnTyr: 0.424 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
4.385ArgAla: 4.385 ± 0.0
0.849ArgCys: 0.849 ± 0.0
3.254ArgAsp: 3.254 ± 0.0
2.971ArgGlu: 2.971 ± 0.0
1.415ArgPhe: 1.415 ± 0.0
3.961ArgGly: 3.961 ± 0.0
1.556ArgHis: 1.556 ± 0.0
3.112ArgIle: 3.112 ± 0.0
2.263ArgLys: 2.263 ± 0.0
6.79ArgLeu: 6.79 ± 0.0
0.99ArgMet: 0.99 ± 0.0
3.395ArgAsn: 3.395 ± 0.0
2.688ArgPro: 2.688 ± 0.0
1.839ArgGln: 1.839 ± 0.0
3.254ArgArg: 3.254 ± 0.0
4.244ArgSer: 4.244 ± 0.0
2.405ArgThr: 2.405 ± 0.0
5.8ArgVal: 5.8 ± 0.0
0.99ArgTrp: 0.99 ± 0.0
1.839ArgTyr: 1.839 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
5.093SerAla: 5.093 ± 0.0
2.546SerCys: 2.546 ± 0.0
2.688SerAsp: 2.688 ± 0.0
4.385SerGlu: 4.385 ± 0.0
2.546SerPhe: 2.546 ± 0.0
7.498SerGly: 7.498 ± 0.0
2.263SerHis: 2.263 ± 0.0
3.961SerIle: 3.961 ± 0.0
3.819SerLys: 3.819 ± 0.0
5.093SerLeu: 5.093 ± 0.0
0.849SerMet: 0.849 ± 0.0
2.263SerAsn: 2.263 ± 0.0
2.405SerPro: 2.405 ± 0.0
3.112SerGln: 3.112 ± 0.0
4.81SerArg: 4.81 ± 0.0
4.244SerSer: 4.244 ± 0.0
5.517SerThr: 5.517 ± 0.0
5.941SerVal: 5.941 ± 0.0
1.132SerTrp: 1.132 ± 0.0
1.839SerTyr: 1.839 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
5.659ThrAla: 5.659 ± 0.0
1.839ThrCys: 1.839 ± 0.0
3.112ThrAsp: 3.112 ± 0.0
3.112ThrGlu: 3.112 ± 0.0
2.263ThrPhe: 2.263 ± 0.0
5.093ThrGly: 5.093 ± 0.0
2.405ThrHis: 2.405 ± 0.0
3.819ThrIle: 3.819 ± 0.0
4.244ThrLys: 4.244 ± 0.0
5.234ThrLeu: 5.234 ± 0.0
0.707ThrMet: 0.707 ± 0.0
2.122ThrAsn: 2.122 ± 0.0
3.112ThrPro: 3.112 ± 0.0
1.98ThrGln: 1.98 ± 0.0
3.395ThrArg: 3.395 ± 0.0
5.093ThrSer: 5.093 ± 0.0
4.244ThrThr: 4.244 ± 0.0
6.79ThrVal: 6.79 ± 0.0
1.273ThrTrp: 1.273 ± 0.0
2.405ThrTyr: 2.405 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
6.649ValAla: 6.649 ± 0.0
2.263ValCys: 2.263 ± 0.0
4.102ValAsp: 4.102 ± 0.0
5.659ValGlu: 5.659 ± 0.0
3.254ValPhe: 3.254 ± 0.0
5.376ValGly: 5.376 ± 0.0
1.98ValHis: 1.98 ± 0.0
2.263ValIle: 2.263 ± 0.0
4.527ValLys: 4.527 ± 0.0
6.224ValLeu: 6.224 ± 0.0
0.707ValMet: 0.707 ± 0.0
4.385ValAsn: 4.385 ± 0.0
3.819ValPro: 3.819 ± 0.0
1.698ValGln: 1.698 ± 0.0
4.527ValArg: 4.527 ± 0.0
6.649ValSer: 6.649 ± 0.0
6.224ValThr: 6.224 ± 0.0
6.79ValVal: 6.79 ± 0.0
1.415ValTrp: 1.415 ± 0.0
2.971ValTyr: 2.971 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
1.132TrpAla: 1.132 ± 0.0
0.849TrpCys: 0.849 ± 0.0
1.273TrpAsp: 1.273 ± 0.0
0.707TrpGlu: 0.707 ± 0.0
0.707TrpPhe: 0.707 ± 0.0
1.556TrpGly: 1.556 ± 0.0
0.424TrpHis: 0.424 ± 0.0
0.566TrpIle: 0.566 ± 0.0
1.556TrpLys: 1.556 ± 0.0
1.698TrpLeu: 1.698 ± 0.0
0.566TrpMet: 0.566 ± 0.0
1.132TrpAsn: 1.132 ± 0.0
0.849TrpPro: 0.849 ± 0.0
0.283TrpGln: 0.283 ± 0.0
0.99TrpArg: 0.99 ± 0.0
1.132TrpSer: 1.132 ± 0.0
1.273TrpThr: 1.273 ± 0.0
1.415TrpVal: 1.415 ± 0.0
0.566TrpTrp: 0.566 ± 0.0
0.424TrpTyr: 0.424 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.829TyrAla: 2.829 ± 0.0
1.132TyrCys: 1.132 ± 0.0
1.698TyrAsp: 1.698 ± 0.0
1.132TyrGlu: 1.132 ± 0.0
0.707TyrPhe: 0.707 ± 0.0
2.405TyrGly: 2.405 ± 0.0
0.707TyrHis: 0.707 ± 0.0
1.839TyrIle: 1.839 ± 0.0
1.415TyrLys: 1.415 ± 0.0
2.688TyrLeu: 2.688 ± 0.0
0.566TyrMet: 0.566 ± 0.0
1.415TyrAsn: 1.415 ± 0.0
0.99TyrPro: 0.99 ± 0.0
1.839TyrGln: 1.839 ± 0.0
1.415TyrArg: 1.415 ± 0.0
1.556TyrSer: 1.556 ± 0.0
1.415TyrThr: 1.415 ± 0.0
1.839TyrVal: 1.839 ± 0.0
0.566TyrTrp: 0.566 ± 0.0
1.132TyrTyr: 1.132 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (7070 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski