Amino acid dipepetide frequency for Beihai sobemo-like virus 15

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.582AlaAla: 2.582 ± 0.092
0.861AlaCys: 0.861 ± 0.917
2.582AlaAsp: 2.582 ± 1.422
4.303AlaGlu: 4.303 ± 1.926
4.303AlaPhe: 4.303 ± 0.733
4.303AlaGly: 4.303 ± 0.733
1.721AlaHis: 1.721 ± 0.825
4.303AlaIle: 4.303 ± 0.597
8.606AlaLys: 8.606 ± 1.466
5.164AlaLeu: 5.164 ± 2.475
2.582AlaMet: 2.582 ± 1.238
4.303AlaAsn: 4.303 ± 0.733
6.024AlaPro: 6.024 ± 1.558
3.442AlaGln: 3.442 ± 0.32
0.861AlaArg: 0.861 ± 0.917
4.303AlaSer: 4.303 ± 0.733
4.303AlaThr: 4.303 ± 0.733
3.442AlaVal: 3.442 ± 2.339
2.582AlaTrp: 2.582 ± 1.422
0.861AlaTyr: 0.861 ± 0.413
0.0AlaXaa: 0.0 ± 0.0
Cys
1.721CysAla: 1.721 ± 0.505
0.861CysCys: 0.861 ± 0.413
0.861CysAsp: 0.861 ± 0.917
0.0CysGlu: 0.0 ± 0.0
0.861CysPhe: 0.861 ± 0.917
2.582CysGly: 2.582 ± 0.092
0.861CysHis: 0.861 ± 0.917
0.861CysIle: 0.861 ± 0.413
1.721CysLys: 1.721 ± 0.825
0.861CysLeu: 0.861 ± 0.413
0.861CysMet: 0.861 ± 0.389
0.861CysAsn: 0.861 ± 0.413
1.721CysPro: 1.721 ± 0.505
2.582CysGln: 2.582 ± 0.092
2.582CysArg: 2.582 ± 0.092
1.721CysSer: 1.721 ± 0.505
0.861CysThr: 0.861 ± 0.413
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
6.024AspAla: 6.024 ± 2.888
0.0AspCys: 0.0 ± 0.0
3.442AspAsp: 3.442 ± 0.32
5.164AspGlu: 5.164 ± 1.145
1.721AspPhe: 1.721 ± 0.825
4.303AspGly: 4.303 ± 1.926
0.861AspHis: 0.861 ± 0.413
0.0AspIle: 0.0 ± 0.0
2.582AspLys: 2.582 ± 2.751
4.303AspLeu: 4.303 ± 0.597
2.582AspMet: 2.582 ± 1.238
1.721AspAsn: 1.721 ± 0.505
1.721AspPro: 1.721 ± 0.825
0.861AspGln: 0.861 ± 0.917
2.582AspArg: 2.582 ± 1.238
1.721AspSer: 1.721 ± 0.825
4.303AspThr: 4.303 ± 0.733
0.861AspVal: 0.861 ± 0.413
1.721AspTrp: 1.721 ± 0.505
1.721AspTyr: 1.721 ± 0.505
0.0AspXaa: 0.0 ± 0.0
Glu
2.582GluAla: 2.582 ± 0.092
3.442GluCys: 3.442 ± 0.32
0.861GluAsp: 0.861 ± 0.413
9.466GluGlu: 9.466 ± 3.208
2.582GluPhe: 2.582 ± 0.092
3.442GluGly: 3.442 ± 0.32
0.861GluHis: 0.861 ± 0.413
5.164GluIle: 5.164 ± 0.184
4.303GluLys: 4.303 ± 0.597
1.721GluLeu: 1.721 ± 0.825
3.442GluMet: 3.442 ± 1.009
5.164GluAsn: 5.164 ± 1.514
9.466GluPro: 9.466 ± 0.781
2.582GluGln: 2.582 ± 1.238
0.0GluArg: 0.0 ± 0.0
6.024GluSer: 6.024 ± 2.888
1.721GluThr: 1.721 ± 0.505
1.721GluVal: 1.721 ± 1.834
0.0GluTrp: 0.0 ± 0.0
2.582GluTyr: 2.582 ± 0.092
0.0GluXaa: 0.0 ± 0.0
Phe
2.582PheAla: 2.582 ± 1.238
2.582PheCys: 2.582 ± 1.422
0.0PheAsp: 0.0 ± 0.0
3.442PheGlu: 3.442 ± 1.009
2.582PhePhe: 2.582 ± 1.238
0.861PheGly: 0.861 ± 0.917
1.721PheHis: 1.721 ± 0.825
3.442PheIle: 3.442 ± 2.339
1.721PheLys: 1.721 ± 0.825
3.442PheLeu: 3.442 ± 2.339
0.0PheMet: 0.0 ± 0.0
3.442PheAsn: 3.442 ± 1.65
0.861PhePro: 0.861 ± 0.413
0.861PheGln: 0.861 ± 0.917
2.582PheArg: 2.582 ± 1.422
1.721PheSer: 1.721 ± 0.825
2.582PheThr: 2.582 ± 1.238
0.861PheVal: 0.861 ± 0.917
0.0PheTrp: 0.0 ± 0.0
0.861PheTyr: 0.861 ± 0.413
0.0PheXaa: 0.0 ± 0.0
Gly
4.303GlyAla: 4.303 ± 0.597
0.861GlyCys: 0.861 ± 0.413
3.442GlyAsp: 3.442 ± 1.009
2.582GlyGlu: 2.582 ± 1.238
2.582GlyPhe: 2.582 ± 1.422
4.303GlyGly: 4.303 ± 0.597
0.0GlyHis: 0.0 ± 0.0
4.303GlyIle: 4.303 ± 0.733
2.582GlyLys: 2.582 ± 1.422
6.024GlyLeu: 6.024 ± 2.431
1.721GlyMet: 1.721 ± 0.505
5.164GlyAsn: 5.164 ± 1.514
1.721GlyPro: 1.721 ± 0.505
0.861GlyGln: 0.861 ± 0.413
2.582GlyArg: 2.582 ± 0.092
1.721GlySer: 1.721 ± 0.825
2.582GlyThr: 2.582 ± 1.422
5.164GlyVal: 5.164 ± 2.844
1.721GlyTrp: 1.721 ± 1.834
4.303GlyTyr: 4.303 ± 3.256
0.0GlyXaa: 0.0 ± 0.0
His
0.861HisAla: 0.861 ± 0.917
0.0HisCys: 0.0 ± 0.0
1.721HisAsp: 1.721 ± 0.505
0.861HisGlu: 0.861 ± 0.413
0.0HisPhe: 0.0 ± 0.0
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
0.861HisIle: 0.861 ± 0.413
0.861HisLys: 0.861 ± 0.917
3.442HisLeu: 3.442 ± 1.009
0.0HisMet: 0.0 ± 0.0
2.582HisAsn: 2.582 ± 1.238
0.861HisPro: 0.861 ± 0.413
0.0HisGln: 0.0 ± 0.0
0.861HisArg: 0.861 ± 0.917
0.861HisSer: 0.861 ± 0.413
0.861HisThr: 0.861 ± 0.917
2.582HisVal: 2.582 ± 1.422
0.861HisTrp: 0.861 ± 0.917
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
7.745IleAla: 7.745 ± 0.276
0.0IleCys: 0.0 ± 0.0
5.164IleAsp: 5.164 ± 1.145
3.442IleGlu: 3.442 ± 0.32
1.721IlePhe: 1.721 ± 0.505
2.582IleGly: 2.582 ± 0.092
0.0IleHis: 0.0 ± 0.0
0.861IleIle: 0.861 ± 0.917
4.303IleLys: 4.303 ± 0.597
6.024IleLeu: 6.024 ± 2.888
1.721IleMet: 1.721 ± 2.309
0.861IleAsn: 0.861 ± 0.413
2.582IlePro: 2.582 ± 0.092
2.582IleGln: 2.582 ± 0.092
2.582IleArg: 2.582 ± 0.092
4.303IleSer: 4.303 ± 0.597
4.303IleThr: 4.303 ± 0.597
0.0IleVal: 0.0 ± 0.0
0.0IleTrp: 0.0 ± 0.0
0.861IleTyr: 0.861 ± 0.413
0.0IleXaa: 0.0 ± 0.0
Lys
3.442LysAla: 3.442 ± 0.32
2.582LysCys: 2.582 ± 1.238
4.303LysAsp: 4.303 ± 0.733
3.442LysGlu: 3.442 ± 0.32
0.0LysPhe: 0.0 ± 0.0
2.582LysGly: 2.582 ± 1.238
1.721LysHis: 1.721 ± 0.505
4.303LysIle: 4.303 ± 0.597
4.303LysLys: 4.303 ± 0.733
10.327LysLeu: 10.327 ± 0.368
0.861LysMet: 0.861 ± 0.413
0.861LysAsn: 0.861 ± 0.413
5.164LysPro: 5.164 ± 0.184
6.024LysGln: 6.024 ± 1.558
3.442LysArg: 3.442 ± 0.32
6.024LysSer: 6.024 ± 0.228
4.303LysThr: 4.303 ± 0.733
3.442LysVal: 3.442 ± 0.32
0.861LysTrp: 0.861 ± 0.413
2.582LysTyr: 2.582 ± 0.092
0.0LysXaa: 0.0 ± 0.0
Leu
6.885LeuAla: 6.885 ± 1.97
1.721LeuCys: 1.721 ± 1.834
0.861LeuAsp: 0.861 ± 0.917
9.466LeuGlu: 9.466 ± 2.111
1.721LeuPhe: 1.721 ± 1.834
6.024LeuGly: 6.024 ± 3.761
0.0LeuHis: 0.0 ± 0.0
4.303LeuIle: 4.303 ± 0.597
8.606LeuLys: 8.606 ± 2.795
7.745LeuLeu: 7.745 ± 2.383
1.721LeuMet: 1.721 ± 0.825
3.442LeuAsn: 3.442 ± 1.65
4.303LeuPro: 4.303 ± 2.063
3.442LeuGln: 3.442 ± 1.65
5.164LeuArg: 5.164 ± 0.184
11.188LeuSer: 11.188 ± 1.374
10.327LeuThr: 10.327 ± 0.368
4.303LeuVal: 4.303 ± 2.063
1.721LeuTrp: 1.721 ± 0.825
3.442LeuTyr: 3.442 ± 2.339
0.0LeuXaa: 0.0 ± 0.0
Met
2.582MetAla: 2.582 ± 1.422
1.721MetCys: 1.721 ± 0.825
1.721MetAsp: 1.721 ± 0.825
0.861MetGlu: 0.861 ± 0.413
0.861MetPhe: 0.861 ± 0.413
1.721MetGly: 1.721 ± 0.505
0.861MetHis: 0.861 ± 0.413
0.0MetIle: 0.0 ± 0.0
0.861MetLys: 0.861 ± 0.917
2.582MetLeu: 2.582 ± 0.092
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
0.861MetPro: 0.861 ± 0.413
2.582MetGln: 2.582 ± 1.422
0.0MetArg: 0.0 ± 0.0
3.442MetSer: 3.442 ± 1.009
1.721MetThr: 1.721 ± 0.505
0.861MetVal: 0.861 ± 0.413
0.0MetTrp: 0.0 ± 0.0
0.861MetTyr: 0.861 ± 0.917
0.0MetXaa: 0.0 ± 0.0
Asn
5.164AsnAla: 5.164 ± 0.184
1.721AsnCys: 1.721 ± 0.825
2.582AsnAsp: 2.582 ± 1.238
0.0AsnGlu: 0.0 ± 0.0
3.442AsnPhe: 3.442 ± 1.65
1.721AsnGly: 1.721 ± 0.505
1.721AsnHis: 1.721 ± 1.834
3.442AsnIle: 3.442 ± 0.32
2.582AsnLys: 2.582 ± 1.238
4.303AsnLeu: 4.303 ± 0.733
0.861AsnMet: 0.861 ± 0.917
2.582AsnAsn: 2.582 ± 0.092
1.721AsnPro: 1.721 ± 0.505
1.721AsnGln: 1.721 ± 0.825
0.861AsnArg: 0.861 ± 0.413
2.582AsnSer: 2.582 ± 1.422
3.442AsnThr: 3.442 ± 1.65
1.721AsnVal: 1.721 ± 0.825
0.0AsnTrp: 0.0 ± 0.0
0.861AsnTyr: 0.861 ± 0.413
0.0AsnXaa: 0.0 ± 0.0
Pro
3.442ProAla: 3.442 ± 0.32
1.721ProCys: 1.721 ± 0.505
2.582ProAsp: 2.582 ± 0.092
7.745ProGlu: 7.745 ± 0.276
2.582ProPhe: 2.582 ± 2.751
6.024ProGly: 6.024 ± 2.431
0.861ProHis: 0.861 ± 0.917
1.721ProIle: 1.721 ± 0.505
7.745ProLys: 7.745 ± 1.053
6.885ProLeu: 6.885 ± 1.97
1.721ProMet: 1.721 ± 0.505
2.582ProAsn: 2.582 ± 1.238
6.024ProPro: 6.024 ± 0.228
6.024ProGln: 6.024 ± 2.888
0.861ProArg: 0.861 ± 0.413
3.442ProSer: 3.442 ± 1.65
5.164ProThr: 5.164 ± 1.145
2.582ProVal: 2.582 ± 0.092
2.582ProTrp: 2.582 ± 1.422
0.861ProTyr: 0.861 ± 0.413
0.0ProXaa: 0.0 ± 0.0
Gln
6.024GlnAla: 6.024 ± 2.431
0.861GlnCys: 0.861 ± 0.413
1.721GlnAsp: 1.721 ± 0.825
3.442GlnGlu: 3.442 ± 0.32
1.721GlnPhe: 1.721 ± 0.825
1.721GlnGly: 1.721 ± 0.825
0.861GlnHis: 0.861 ± 0.917
2.582GlnIle: 2.582 ± 1.238
4.303GlnLys: 4.303 ± 2.063
7.745GlnLeu: 7.745 ± 1.053
0.0GlnMet: 0.0 ± 0.0
0.861GlnAsn: 0.861 ± 0.917
3.442GlnPro: 3.442 ± 0.32
1.721GlnGln: 1.721 ± 0.505
2.582GlnArg: 2.582 ± 1.422
4.303GlnSer: 4.303 ± 2.063
4.303GlnThr: 4.303 ± 2.063
4.303GlnVal: 4.303 ± 2.063
0.861GlnTrp: 0.861 ± 0.917
1.721GlnTyr: 1.721 ± 0.505
0.0GlnXaa: 0.0 ± 0.0
Arg
3.442ArgAla: 3.442 ± 0.32
0.0ArgCys: 0.0 ± 0.0
5.164ArgAsp: 5.164 ± 1.145
1.721ArgGlu: 1.721 ± 0.505
0.861ArgPhe: 0.861 ± 0.413
1.721ArgGly: 1.721 ± 1.834
1.721ArgHis: 1.721 ± 0.505
0.861ArgIle: 0.861 ± 0.413
2.582ArgLys: 2.582 ± 0.092
3.442ArgLeu: 3.442 ± 1.009
0.861ArgMet: 0.861 ± 0.917
0.861ArgAsn: 0.861 ± 0.413
4.303ArgPro: 4.303 ± 0.733
2.582ArgGln: 2.582 ± 0.092
4.303ArgArg: 4.303 ± 0.597
0.0ArgSer: 0.0 ± 0.0
1.721ArgThr: 1.721 ± 0.825
0.0ArgVal: 0.0 ± 0.0
0.861ArgTrp: 0.861 ± 0.917
0.861ArgTyr: 0.861 ± 0.917
0.0ArgXaa: 0.0 ± 0.0
Ser
4.303SerAla: 4.303 ± 0.733
0.0SerCys: 0.0 ± 0.0
1.721SerAsp: 1.721 ± 0.825
6.885SerGlu: 6.885 ± 1.97
2.582SerPhe: 2.582 ± 0.092
5.164SerGly: 5.164 ± 2.844
0.861SerHis: 0.861 ± 0.917
3.442SerIle: 3.442 ± 0.32
8.606SerLys: 8.606 ± 2.795
6.885SerLeu: 6.885 ± 1.97
0.0SerMet: 0.0 ± 0.0
3.442SerAsn: 3.442 ± 0.32
5.164SerPro: 5.164 ± 2.475
3.442SerGln: 3.442 ± 1.65
1.721SerArg: 1.721 ± 0.825
6.885SerSer: 6.885 ± 3.3
2.582SerThr: 2.582 ± 1.422
3.442SerVal: 3.442 ± 1.65
0.0SerTrp: 0.0 ± 0.0
1.721SerTyr: 1.721 ± 0.505
0.0SerXaa: 0.0 ± 0.0
Thr
1.721ThrAla: 1.721 ± 0.825
1.721ThrCys: 1.721 ± 0.505
4.303ThrAsp: 4.303 ± 0.733
0.861ThrGlu: 0.861 ± 0.413
1.721ThrPhe: 1.721 ± 1.834
4.303ThrGly: 4.303 ± 0.597
1.721ThrHis: 1.721 ± 0.505
6.024ThrIle: 6.024 ± 0.228
2.582ThrLys: 2.582 ± 1.238
3.442ThrLeu: 3.442 ± 0.32
2.582ThrMet: 2.582 ± 1.238
2.582ThrAsn: 2.582 ± 0.092
8.606ThrPro: 8.606 ± 1.193
5.164ThrGln: 5.164 ± 2.475
0.861ThrArg: 0.861 ± 0.413
4.303ThrSer: 4.303 ± 0.733
10.327ThrThr: 10.327 ± 3.62
3.442ThrVal: 3.442 ± 2.339
0.0ThrTrp: 0.0 ± 0.0
1.721ThrTyr: 1.721 ± 0.825
0.0ThrXaa: 0.0 ± 0.0
Val
3.442ValAla: 3.442 ± 0.32
1.721ValCys: 1.721 ± 0.505
2.582ValAsp: 2.582 ± 1.238
1.721ValGlu: 1.721 ± 0.505
1.721ValPhe: 1.721 ± 0.505
2.582ValGly: 2.582 ± 1.422
0.861ValHis: 0.861 ± 0.413
4.303ValIle: 4.303 ± 0.733
0.0ValLys: 0.0 ± 0.0
7.745ValLeu: 7.745 ± 2.936
0.861ValMet: 0.861 ± 0.917
0.861ValAsn: 0.861 ± 0.413
3.442ValPro: 3.442 ± 1.009
5.164ValGln: 5.164 ± 1.514
1.721ValArg: 1.721 ± 0.505
3.442ValSer: 3.442 ± 0.32
0.0ValThr: 0.0 ± 0.0
2.582ValVal: 2.582 ± 0.092
0.0ValTrp: 0.0 ± 0.0
1.721ValTyr: 1.721 ± 0.825
0.0ValXaa: 0.0 ± 0.0
Trp
1.721TrpAla: 1.721 ± 0.505
0.0TrpCys: 0.0 ± 0.0
1.721TrpAsp: 1.721 ± 1.834
1.721TrpGlu: 1.721 ± 0.505
0.861TrpPhe: 0.861 ± 0.413
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.861TrpIle: 0.861 ± 0.917
1.721TrpLys: 1.721 ± 0.505
0.861TrpLeu: 0.861 ± 0.917
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
1.721TrpPro: 1.721 ± 0.505
0.0TrpGln: 0.0 ± 0.0
1.721TrpArg: 1.721 ± 0.505
0.0TrpSer: 0.0 ± 0.0
0.861TrpThr: 0.861 ± 0.917
0.861TrpVal: 0.861 ± 0.917
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.861TyrAla: 0.861 ± 0.413
0.0TyrCys: 0.0 ± 0.0
0.861TyrAsp: 0.861 ± 0.413
0.0TyrGlu: 0.0 ± 0.0
1.721TyrPhe: 1.721 ± 0.825
2.582TyrGly: 2.582 ± 1.422
0.861TyrHis: 0.861 ± 0.917
0.861TyrIle: 0.861 ± 0.413
0.0TyrLys: 0.0 ± 0.0
3.442TyrLeu: 3.442 ± 0.32
0.861TyrMet: 0.861 ± 0.917
0.861TyrAsn: 0.861 ± 0.413
3.442TyrPro: 3.442 ± 2.339
3.442TyrGln: 3.442 ± 1.009
0.0TyrArg: 0.0 ± 0.0
0.861TyrSer: 0.861 ± 0.413
1.721TyrThr: 1.721 ± 0.505
4.303TyrVal: 4.303 ± 0.597
0.861TyrTrp: 0.861 ± 0.917
0.861TyrTyr: 0.861 ± 0.917
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1163 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski