Amino acid dipepetide frequency for Milolii virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.464AlaAla: 8.464 ± 0.0
0.605AlaCys: 0.605 ± 0.0
4.837AlaAsp: 4.837 ± 0.0
4.232AlaGlu: 4.232 ± 0.0
3.023AlaPhe: 3.023 ± 0.0
4.837AlaGly: 4.837 ± 0.0
1.209AlaHis: 1.209 ± 0.0
5.139AlaIle: 5.139 ± 0.0
2.721AlaLys: 2.721 ± 0.0
9.069AlaLeu: 9.069 ± 0.0
1.511AlaMet: 1.511 ± 0.0
3.325AlaAsn: 3.325 ± 0.0
3.93AlaPro: 3.93 ± 0.0
2.721AlaGln: 2.721 ± 0.0
3.325AlaArg: 3.325 ± 0.0
6.651AlaSer: 6.651 ± 0.0
6.348AlaThr: 6.348 ± 0.0
5.744AlaVal: 5.744 ± 0.0
2.116AlaTrp: 2.116 ± 0.0
3.325AlaTyr: 3.325 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
1.209CysAla: 1.209 ± 0.0
0.302CysCys: 0.302 ± 0.0
0.907CysAsp: 0.907 ± 0.0
0.605CysGlu: 0.605 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.302CysGly: 0.302 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.302CysIle: 0.302 ± 0.0
0.605CysLys: 0.605 ± 0.0
1.209CysLeu: 1.209 ± 0.0
0.302CysMet: 0.302 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.907CysPro: 0.907 ± 0.0
0.605CysGln: 0.605 ± 0.0
0.907CysArg: 0.907 ± 0.0
0.907CysSer: 0.907 ± 0.0
1.209CysThr: 1.209 ± 0.0
1.209CysVal: 1.209 ± 0.0
0.302CysTrp: 0.302 ± 0.0
0.907CysTyr: 0.907 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.325AspAla: 3.325 ± 0.0
1.511AspCys: 1.511 ± 0.0
3.628AspAsp: 3.628 ± 0.0
3.325AspGlu: 3.325 ± 0.0
3.93AspPhe: 3.93 ± 0.0
3.325AspGly: 3.325 ± 0.0
3.325AspHis: 3.325 ± 0.0
1.814AspIle: 1.814 ± 0.0
2.418AspLys: 2.418 ± 0.0
5.441AspLeu: 5.441 ± 0.0
1.511AspMet: 1.511 ± 0.0
1.814AspAsn: 1.814 ± 0.0
3.325AspPro: 3.325 ± 0.0
2.721AspGln: 2.721 ± 0.0
3.93AspArg: 3.93 ± 0.0
3.93AspSer: 3.93 ± 0.0
3.93AspThr: 3.93 ± 0.0
3.325AspVal: 3.325 ± 0.0
0.907AspTrp: 0.907 ± 0.0
2.418AspTyr: 2.418 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
3.628GluAla: 3.628 ± 0.0
0.605GluCys: 0.605 ± 0.0
2.116GluAsp: 2.116 ± 0.0
3.628GluGlu: 3.628 ± 0.0
2.721GluPhe: 2.721 ± 0.0
2.116GluGly: 2.116 ± 0.0
1.511GluHis: 1.511 ± 0.0
1.814GluIle: 1.814 ± 0.0
2.116GluLys: 2.116 ± 0.0
5.744GluLeu: 5.744 ± 0.0
0.907GluMet: 0.907 ± 0.0
0.605GluAsn: 0.605 ± 0.0
1.511GluPro: 1.511 ± 0.0
0.907GluGln: 0.907 ± 0.0
3.628GluArg: 3.628 ± 0.0
2.721GluSer: 2.721 ± 0.0
2.418GluThr: 2.418 ± 0.0
2.116GluVal: 2.116 ± 0.0
0.605GluTrp: 0.605 ± 0.0
2.116GluTyr: 2.116 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
2.721PheAla: 2.721 ± 0.0
0.907PheCys: 0.907 ± 0.0
1.814PheAsp: 1.814 ± 0.0
1.511PheGlu: 1.511 ± 0.0
0.907PhePhe: 0.907 ± 0.0
2.418PheGly: 2.418 ± 0.0
0.907PheHis: 0.907 ± 0.0
3.93PheIle: 3.93 ± 0.0
0.907PheLys: 0.907 ± 0.0
3.023PheLeu: 3.023 ± 0.0
0.605PheMet: 0.605 ± 0.0
2.721PheAsn: 2.721 ± 0.0
2.418PhePro: 2.418 ± 0.0
1.511PheGln: 1.511 ± 0.0
2.418PheArg: 2.418 ± 0.0
4.837PheSer: 4.837 ± 0.0
3.93PheThr: 3.93 ± 0.0
3.325PheVal: 3.325 ± 0.0
0.302PheTrp: 0.302 ± 0.0
0.907PheTyr: 0.907 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
4.232GlyAla: 4.232 ± 0.0
1.511GlyCys: 1.511 ± 0.0
3.325GlyAsp: 3.325 ± 0.0
1.814GlyGlu: 1.814 ± 0.0
2.418GlyPhe: 2.418 ± 0.0
3.023GlyGly: 3.023 ± 0.0
0.907GlyHis: 0.907 ± 0.0
3.023GlyIle: 3.023 ± 0.0
3.325GlyLys: 3.325 ± 0.0
3.93GlyLeu: 3.93 ± 0.0
0.302GlyMet: 0.302 ± 0.0
2.116GlyAsn: 2.116 ± 0.0
3.628GlyPro: 3.628 ± 0.0
2.418GlyGln: 2.418 ± 0.0
3.93GlyArg: 3.93 ± 0.0
3.023GlySer: 3.023 ± 0.0
3.325GlyThr: 3.325 ± 0.0
5.441GlyVal: 5.441 ± 0.0
0.907GlyTrp: 0.907 ± 0.0
3.628GlyTyr: 3.628 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
0.907HisAla: 0.907 ± 0.0
0.605HisCys: 0.605 ± 0.0
1.511HisAsp: 1.511 ± 0.0
1.209HisGlu: 1.209 ± 0.0
0.907HisPhe: 0.907 ± 0.0
0.605HisGly: 0.605 ± 0.0
0.907HisHis: 0.907 ± 0.0
1.209HisIle: 1.209 ± 0.0
0.907HisLys: 0.907 ± 0.0
1.814HisLeu: 1.814 ± 0.0
0.907HisMet: 0.907 ± 0.0
0.907HisAsn: 0.907 ± 0.0
2.116HisPro: 2.116 ± 0.0
0.605HisGln: 0.605 ± 0.0
2.418HisArg: 2.418 ± 0.0
2.418HisSer: 2.418 ± 0.0
1.209HisThr: 1.209 ± 0.0
2.418HisVal: 2.418 ± 0.0
0.302HisTrp: 0.302 ± 0.0
0.907HisTyr: 0.907 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.628IleAla: 3.628 ± 0.0
0.0IleCys: 0.0 ± 0.0
4.232IleAsp: 4.232 ± 0.0
2.418IleGlu: 2.418 ± 0.0
2.721IlePhe: 2.721 ± 0.0
2.418IleGly: 2.418 ± 0.0
1.814IleHis: 1.814 ± 0.0
2.116IleIle: 2.116 ± 0.0
1.814IleLys: 1.814 ± 0.0
5.744IleLeu: 5.744 ± 0.0
0.605IleMet: 0.605 ± 0.0
1.511IleAsn: 1.511 ± 0.0
2.418IlePro: 2.418 ± 0.0
0.907IleGln: 0.907 ± 0.0
5.441IleArg: 5.441 ± 0.0
4.232IleSer: 4.232 ± 0.0
6.348IleThr: 6.348 ± 0.0
4.232IleVal: 4.232 ± 0.0
0.302IleTrp: 0.302 ± 0.0
3.023IleTyr: 3.023 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
4.534LysAla: 4.534 ± 0.0
0.907LysCys: 0.907 ± 0.0
2.721LysAsp: 2.721 ± 0.0
0.605LysGlu: 0.605 ± 0.0
1.814LysPhe: 1.814 ± 0.0
0.605LysGly: 0.605 ± 0.0
1.814LysHis: 1.814 ± 0.0
2.418LysIle: 2.418 ± 0.0
0.302LysLys: 0.302 ± 0.0
3.628LysLeu: 3.628 ± 0.0
1.511LysMet: 1.511 ± 0.0
2.116LysAsn: 2.116 ± 0.0
2.418LysPro: 2.418 ± 0.0
2.418LysGln: 2.418 ± 0.0
2.418LysArg: 2.418 ± 0.0
1.814LysSer: 1.814 ± 0.0
1.511LysThr: 1.511 ± 0.0
1.814LysVal: 1.814 ± 0.0
0.605LysTrp: 0.605 ± 0.0
0.907LysTyr: 0.907 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
6.651LeuAla: 6.651 ± 0.0
0.302LeuCys: 0.302 ± 0.0
3.628LeuAsp: 3.628 ± 0.0
1.814LeuGlu: 1.814 ± 0.0
3.325LeuPhe: 3.325 ± 0.0
6.651LeuGly: 6.651 ± 0.0
3.023LeuHis: 3.023 ± 0.0
3.628LeuIle: 3.628 ± 0.0
3.628LeuLys: 3.628 ± 0.0
8.767LeuLeu: 8.767 ± 0.0
1.511LeuMet: 1.511 ± 0.0
4.837LeuAsn: 4.837 ± 0.0
8.162LeuPro: 8.162 ± 0.0
3.628LeuGln: 3.628 ± 0.0
5.441LeuArg: 5.441 ± 0.0
9.371LeuSer: 9.371 ± 0.0
6.046LeuThr: 6.046 ± 0.0
4.837LeuVal: 4.837 ± 0.0
0.302LeuTrp: 0.302 ± 0.0
3.93LeuTyr: 3.93 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
1.814MetAla: 1.814 ± 0.0
0.605MetCys: 0.605 ± 0.0
1.814MetAsp: 1.814 ± 0.0
1.209MetGlu: 1.209 ± 0.0
0.605MetPhe: 0.605 ± 0.0
0.907MetGly: 0.907 ± 0.0
0.302MetHis: 0.302 ± 0.0
1.814MetIle: 1.814 ± 0.0
0.605MetLys: 0.605 ± 0.0
1.511MetLeu: 1.511 ± 0.0
0.0MetMet: 0.0 ± 0.0
1.209MetAsn: 1.209 ± 0.0
1.209MetPro: 1.209 ± 0.0
0.605MetGln: 0.605 ± 0.0
1.814MetArg: 1.814 ± 0.0
0.605MetSer: 0.605 ± 0.0
1.209MetThr: 1.209 ± 0.0
0.907MetVal: 0.907 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.605MetTyr: 0.605 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.93AsnAla: 3.93 ± 0.0
0.0AsnCys: 0.0 ± 0.0
2.721AsnAsp: 2.721 ± 0.0
3.325AsnGlu: 3.325 ± 0.0
2.116AsnPhe: 2.116 ± 0.0
2.418AsnGly: 2.418 ± 0.0
0.907AsnHis: 0.907 ± 0.0
3.93AsnIle: 3.93 ± 0.0
1.511AsnLys: 1.511 ± 0.0
3.93AsnLeu: 3.93 ± 0.0
1.209AsnMet: 1.209 ± 0.0
1.511AsnAsn: 1.511 ± 0.0
3.023AsnPro: 3.023 ± 0.0
2.116AsnGln: 2.116 ± 0.0
0.605AsnArg: 0.605 ± 0.0
3.325AsnSer: 3.325 ± 0.0
2.116AsnThr: 2.116 ± 0.0
1.814AsnVal: 1.814 ± 0.0
0.605AsnTrp: 0.605 ± 0.0
1.209AsnTyr: 1.209 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
6.348ProAla: 6.348 ± 0.0
0.605ProCys: 0.605 ± 0.0
3.325ProAsp: 3.325 ± 0.0
1.814ProGlu: 1.814 ± 0.0
2.116ProPhe: 2.116 ± 0.0
5.139ProGly: 5.139 ± 0.0
1.511ProHis: 1.511 ± 0.0
2.721ProIle: 2.721 ± 0.0
2.418ProLys: 2.418 ± 0.0
6.953ProLeu: 6.953 ± 0.0
1.511ProMet: 1.511 ± 0.0
3.93ProAsn: 3.93 ± 0.0
3.023ProPro: 3.023 ± 0.0
0.605ProGln: 0.605 ± 0.0
3.325ProArg: 3.325 ± 0.0
5.441ProSer: 5.441 ± 0.0
4.837ProThr: 4.837 ± 0.0
2.721ProVal: 2.721 ± 0.0
0.302ProTrp: 0.302 ± 0.0
2.418ProTyr: 2.418 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
2.721GlnAla: 2.721 ± 0.0
0.605GlnCys: 0.605 ± 0.0
1.814GlnAsp: 1.814 ± 0.0
1.209GlnGlu: 1.209 ± 0.0
2.418GlnPhe: 2.418 ± 0.0
2.116GlnGly: 2.116 ± 0.0
0.605GlnHis: 0.605 ± 0.0
2.116GlnIle: 2.116 ± 0.0
1.511GlnLys: 1.511 ± 0.0
1.209GlnLeu: 1.209 ± 0.0
0.0GlnMet: 0.0 ± 0.0
1.511GlnAsn: 1.511 ± 0.0
3.628GlnPro: 3.628 ± 0.0
1.511GlnGln: 1.511 ± 0.0
2.116GlnArg: 2.116 ± 0.0
3.93GlnSer: 3.93 ± 0.0
1.511GlnThr: 1.511 ± 0.0
3.628GlnVal: 3.628 ± 0.0
0.302GlnTrp: 0.302 ± 0.0
2.418GlnTyr: 2.418 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
3.023ArgAla: 3.023 ± 0.0
0.605ArgCys: 0.605 ± 0.0
6.046ArgAsp: 6.046 ± 0.0
4.534ArgGlu: 4.534 ± 0.0
0.605ArgPhe: 0.605 ± 0.0
2.418ArgGly: 2.418 ± 0.0
1.814ArgHis: 1.814 ± 0.0
2.721ArgIle: 2.721 ± 0.0
0.907ArgLys: 0.907 ± 0.0
6.046ArgLeu: 6.046 ± 0.0
2.418ArgMet: 2.418 ± 0.0
3.023ArgAsn: 3.023 ± 0.0
4.534ArgPro: 4.534 ± 0.0
3.93ArgGln: 3.93 ± 0.0
3.023ArgArg: 3.023 ± 0.0
5.744ArgSer: 5.744 ± 0.0
3.325ArgThr: 3.325 ± 0.0
5.441ArgVal: 5.441 ± 0.0
0.907ArgTrp: 0.907 ± 0.0
2.721ArgTyr: 2.721 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
6.348SerAla: 6.348 ± 0.0
0.605SerCys: 0.605 ± 0.0
3.93SerAsp: 3.93 ± 0.0
3.023SerGlu: 3.023 ± 0.0
4.232SerPhe: 4.232 ± 0.0
6.953SerGly: 6.953 ± 0.0
1.209SerHis: 1.209 ± 0.0
4.534SerIle: 4.534 ± 0.0
5.139SerLys: 5.139 ± 0.0
7.255SerLeu: 7.255 ± 0.0
1.511SerMet: 1.511 ± 0.0
3.023SerAsn: 3.023 ± 0.0
4.837SerPro: 4.837 ± 0.0
2.418SerGln: 2.418 ± 0.0
5.441SerArg: 5.441 ± 0.0
6.046SerSer: 6.046 ± 0.0
7.255SerThr: 7.255 ± 0.0
3.628SerVal: 3.628 ± 0.0
0.605SerTrp: 0.605 ± 0.0
2.116SerTyr: 2.116 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
7.255ThrAla: 7.255 ± 0.0
0.605ThrCys: 0.605 ± 0.0
3.628ThrAsp: 3.628 ± 0.0
2.116ThrGlu: 2.116 ± 0.0
3.628ThrPhe: 3.628 ± 0.0
2.116ThrGly: 2.116 ± 0.0
1.511ThrHis: 1.511 ± 0.0
5.744ThrIle: 5.744 ± 0.0
2.721ThrLys: 2.721 ± 0.0
4.837ThrLeu: 4.837 ± 0.0
0.302ThrMet: 0.302 ± 0.0
2.418ThrAsn: 2.418 ± 0.0
6.348ThrPro: 6.348 ± 0.0
2.721ThrGln: 2.721 ± 0.0
5.139ThrArg: 5.139 ± 0.0
7.255ThrSer: 7.255 ± 0.0
5.744ThrThr: 5.744 ± 0.0
4.837ThrVal: 4.837 ± 0.0
0.605ThrTrp: 0.605 ± 0.0
2.721ThrTyr: 2.721 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
6.953ValAla: 6.953 ± 0.0
1.209ValCys: 1.209 ± 0.0
2.721ValAsp: 2.721 ± 0.0
3.023ValGlu: 3.023 ± 0.0
1.209ValPhe: 1.209 ± 0.0
5.441ValGly: 5.441 ± 0.0
0.605ValHis: 0.605 ± 0.0
4.232ValIle: 4.232 ± 0.0
1.814ValLys: 1.814 ± 0.0
6.651ValLeu: 6.651 ± 0.0
1.511ValMet: 1.511 ± 0.0
3.628ValAsn: 3.628 ± 0.0
1.209ValPro: 1.209 ± 0.0
1.209ValGln: 1.209 ± 0.0
5.139ValArg: 5.139 ± 0.0
4.534ValSer: 4.534 ± 0.0
7.255ValThr: 7.255 ± 0.0
3.628ValVal: 3.628 ± 0.0
0.907ValTrp: 0.907 ± 0.0
2.721ValTyr: 2.721 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.907TrpAla: 0.907 ± 0.0
0.0TrpCys: 0.0 ± 0.0
1.814TrpAsp: 1.814 ± 0.0
0.302TrpGlu: 0.302 ± 0.0
0.907TrpPhe: 0.907 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.302TrpIle: 0.302 ± 0.0
0.0TrpLys: 0.0 ± 0.0
0.907TrpLeu: 0.907 ± 0.0
0.605TrpMet: 0.605 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.605TrpPro: 0.605 ± 0.0
0.907TrpGln: 0.907 ± 0.0
0.907TrpArg: 0.907 ± 0.0
0.605TrpSer: 0.605 ± 0.0
0.605TrpThr: 0.605 ± 0.0
1.209TrpVal: 1.209 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.605TrpTyr: 0.605 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.837TyrAla: 4.837 ± 0.0
0.605TyrCys: 0.605 ± 0.0
3.325TyrAsp: 3.325 ± 0.0
1.814TyrGlu: 1.814 ± 0.0
2.418TyrPhe: 2.418 ± 0.0
2.418TyrGly: 2.418 ± 0.0
0.907TyrHis: 0.907 ± 0.0
2.418TyrIle: 2.418 ± 0.0
1.511TyrLys: 1.511 ± 0.0
1.814TyrLeu: 1.814 ± 0.0
0.302TyrMet: 0.302 ± 0.0
2.116TyrAsn: 2.116 ± 0.0
1.814TyrPro: 1.814 ± 0.0
2.418TyrGln: 2.418 ± 0.0
2.418TyrArg: 2.418 ± 0.0
2.721TyrSer: 2.721 ± 0.0
2.116TyrThr: 2.116 ± 0.0
3.325TyrVal: 3.325 ± 0.0
0.302TyrTrp: 0.302 ± 0.0
3.325TyrTyr: 3.325 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (3309 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski