Amino acid dipepetide frequency for Beihai sobemo-like virus 3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.766AlaAla: 12.766 ± 4.677
4.255AlaCys: 4.255 ± 0.658
2.837AlaAsp: 2.837 ± 0.011
5.674AlaGlu: 5.674 ± 1.328
5.674AlaPhe: 5.674 ± 2.679
9.22AlaGly: 9.22 ± 5.029
2.128AlaHis: 2.128 ± 1.022
3.546AlaIle: 3.546 ± 0.999
7.092AlaLys: 7.092 ± 0.704
9.22AlaLeu: 9.22 ± 3.678
0.709AlaMet: 0.709 ± 0.335
4.255AlaAsn: 4.255 ± 2.009
3.546AlaPro: 3.546 ± 0.352
0.0AlaGln: 0.0 ± 0.0
4.965AlaArg: 4.965 ± 1.033
5.674AlaSer: 5.674 ± 1.374
4.965AlaThr: 4.965 ± 1.033
9.929AlaVal: 9.929 ± 0.635
0.0AlaTrp: 0.0 ± 0.0
2.837AlaTyr: 2.837 ± 0.011
0.0AlaXaa: 0.0 ± 0.0
Cys
1.418CysAla: 1.418 ± 0.681
2.128CysCys: 2.128 ± 0.329
1.418CysAsp: 1.418 ± 0.681
2.837CysGlu: 2.837 ± 0.011
1.418CysPhe: 1.418 ± 0.681
1.418CysGly: 1.418 ± 0.681
0.0CysHis: 0.0 ± 0.0
0.709CysIle: 0.709 ± 0.341
2.837CysLys: 2.837 ± 0.011
0.709CysLeu: 0.709 ± 0.341
0.709CysMet: 0.709 ± 0.341
0.0CysAsn: 0.0 ± 0.0
0.709CysPro: 0.709 ± 0.341
0.0CysGln: 0.0 ± 0.0
2.128CysArg: 2.128 ± 1.022
4.965CysSer: 4.965 ± 0.318
0.709CysThr: 0.709 ± 0.341
1.418CysVal: 1.418 ± 0.681
0.709CysTrp: 0.709 ± 1.01
0.709CysTyr: 0.709 ± 0.341
0.0CysXaa: 0.0 ± 0.0
Asp
9.22AspAla: 9.22 ± 2.327
2.128AspCys: 2.128 ± 1.022
4.965AspAsp: 4.965 ± 2.385
4.255AspGlu: 4.255 ± 2.044
5.674AspPhe: 5.674 ± 2.725
3.546AspGly: 3.546 ± 2.35
0.709AspHis: 0.709 ± 1.01
2.128AspIle: 2.128 ± 0.329
2.128AspLys: 2.128 ± 1.022
3.546AspLeu: 3.546 ± 1.703
2.837AspMet: 2.837 ± 1.363
0.709AspAsn: 0.709 ± 0.341
3.546AspPro: 3.546 ± 0.352
1.418AspGln: 1.418 ± 0.681
2.128AspArg: 2.128 ± 1.022
3.546AspSer: 3.546 ± 1.703
2.128AspThr: 2.128 ± 0.329
4.255AspVal: 4.255 ± 0.693
1.418AspTrp: 1.418 ± 0.681
3.546AspTyr: 3.546 ± 1.703
0.0AspXaa: 0.0 ± 0.0
Glu
4.255GluAla: 4.255 ± 0.693
0.709GluCys: 0.709 ± 0.341
2.837GluAsp: 2.837 ± 1.363
2.128GluGlu: 2.128 ± 0.329
0.0GluPhe: 0.0 ± 0.0
4.255GluGly: 4.255 ± 0.693
0.709GluHis: 0.709 ± 0.341
1.418GluIle: 1.418 ± 0.681
4.965GluLys: 4.965 ± 1.033
4.255GluLeu: 4.255 ± 0.658
1.418GluMet: 1.418 ± 0.681
1.418GluAsn: 1.418 ± 0.681
4.255GluPro: 4.255 ± 2.044
0.0GluGln: 0.0 ± 0.0
4.255GluArg: 4.255 ± 2.044
6.383GluSer: 6.383 ± 0.364
4.965GluThr: 4.965 ± 1.033
1.418GluVal: 1.418 ± 0.681
1.418GluTrp: 1.418 ± 0.67
3.546GluTyr: 3.546 ± 0.352
0.0GluXaa: 0.0 ± 0.0
Phe
4.255PheAla: 4.255 ± 0.693
0.0PheCys: 0.0 ± 0.0
5.674PheAsp: 5.674 ± 1.374
1.418PheGlu: 1.418 ± 0.67
1.418PhePhe: 1.418 ± 0.681
4.965PheGly: 4.965 ± 0.318
0.709PheHis: 0.709 ± 0.341
0.0PheIle: 0.0 ± 0.0
1.418PheLys: 1.418 ± 0.681
1.418PheLeu: 1.418 ± 0.681
1.418PheMet: 1.418 ± 0.681
2.128PheAsn: 2.128 ± 0.329
1.418PhePro: 1.418 ± 0.681
0.709PheGln: 0.709 ± 0.341
1.418PheArg: 1.418 ± 0.681
4.965PheSer: 4.965 ± 1.669
3.546PheThr: 3.546 ± 0.352
3.546PheVal: 3.546 ± 0.352
0.0PheTrp: 0.0 ± 0.0
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
7.801GlyAla: 7.801 ± 1.657
1.418GlyCys: 1.418 ± 0.681
3.546GlyAsp: 3.546 ± 0.352
5.674GlyGlu: 5.674 ± 2.725
4.255GlyPhe: 4.255 ± 0.693
4.965GlyGly: 4.965 ± 0.318
1.418GlyHis: 1.418 ± 0.681
2.837GlyIle: 2.837 ± 0.011
4.255GlyLys: 4.255 ± 0.693
4.255GlyLeu: 4.255 ± 2.044
2.128GlyMet: 2.128 ± 1.022
3.546GlyAsn: 3.546 ± 2.35
2.128GlyPro: 2.128 ± 0.329
3.546GlyGln: 3.546 ± 0.352
4.255GlyArg: 4.255 ± 2.009
4.965GlySer: 4.965 ± 1.669
2.837GlyThr: 2.837 ± 1.363
9.929GlyVal: 9.929 ± 3.338
0.709GlyTrp: 0.709 ± 0.341
1.418GlyTyr: 1.418 ± 0.67
0.0GlyXaa: 0.0 ± 0.0
His
3.546HisAla: 3.546 ± 0.352
0.0HisCys: 0.0 ± 0.0
0.709HisAsp: 0.709 ± 0.341
0.709HisGlu: 0.709 ± 0.341
0.0HisPhe: 0.0 ± 0.0
0.0HisGly: 0.0 ± 0.0
1.418HisHis: 1.418 ± 0.681
1.418HisIle: 1.418 ± 0.681
3.546HisLys: 3.546 ± 0.999
1.418HisLeu: 1.418 ± 0.67
0.709HisMet: 0.709 ± 1.01
0.709HisAsn: 0.709 ± 0.341
2.128HisPro: 2.128 ± 1.68
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
1.418HisSer: 1.418 ± 0.681
0.709HisThr: 0.709 ± 1.01
2.837HisVal: 2.837 ± 1.363
0.709HisTrp: 0.709 ± 0.341
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
5.674IleAla: 5.674 ± 1.328
2.128IleCys: 2.128 ± 0.329
4.255IleAsp: 4.255 ± 0.693
2.837IleGlu: 2.837 ± 1.363
1.418IlePhe: 1.418 ± 0.67
2.837IleGly: 2.837 ± 0.011
0.709IleHis: 0.709 ± 0.341
0.709IleIle: 0.709 ± 0.341
1.418IleLys: 1.418 ± 0.681
1.418IleLeu: 1.418 ± 0.67
1.418IleMet: 1.418 ± 0.67
1.418IleAsn: 1.418 ± 0.681
0.709IlePro: 0.709 ± 1.01
0.709IleGln: 0.709 ± 1.01
0.709IleArg: 0.709 ± 1.01
1.418IleSer: 1.418 ± 0.67
2.128IleThr: 2.128 ± 1.022
1.418IleVal: 1.418 ± 0.681
2.128IleTrp: 2.128 ± 0.329
0.709IleTyr: 0.709 ± 0.341
0.0IleXaa: 0.0 ± 0.0
Lys
6.383LysAla: 6.383 ± 0.987
0.709LysCys: 0.709 ± 0.341
2.128LysAsp: 2.128 ± 1.022
4.255LysGlu: 4.255 ± 0.693
1.418LysPhe: 1.418 ± 0.67
3.546LysGly: 3.546 ± 1.703
2.128LysHis: 2.128 ± 0.329
0.709LysIle: 0.709 ± 0.341
1.418LysLys: 1.418 ± 0.681
7.092LysLeu: 7.092 ± 0.704
0.709LysMet: 0.709 ± 0.341
4.255LysAsn: 4.255 ± 0.658
4.255LysPro: 4.255 ± 0.693
2.128LysGln: 2.128 ± 1.022
5.674LysArg: 5.674 ± 1.374
0.709LysSer: 0.709 ± 0.341
0.709LysThr: 0.709 ± 0.341
2.128LysVal: 2.128 ± 0.329
0.0LysTrp: 0.0 ± 0.0
3.546LysTyr: 3.546 ± 0.352
0.0LysXaa: 0.0 ± 0.0
Leu
7.092LeuAla: 7.092 ± 3.349
1.418LeuCys: 1.418 ± 0.681
5.674LeuAsp: 5.674 ± 0.023
2.837LeuGlu: 2.837 ± 1.363
2.837LeuPhe: 2.837 ± 1.34
4.255LeuGly: 4.255 ± 0.693
2.128LeuHis: 2.128 ± 1.022
4.255LeuIle: 4.255 ± 2.044
4.255LeuLys: 4.255 ± 0.693
2.837LeuLeu: 2.837 ± 0.011
2.128LeuMet: 2.128 ± 1.022
1.418LeuAsn: 1.418 ± 0.681
1.418LeuPro: 1.418 ± 0.67
3.546LeuGln: 3.546 ± 0.999
1.418LeuArg: 1.418 ± 0.67
7.801LeuSer: 7.801 ± 1.045
4.255LeuThr: 4.255 ± 0.658
7.092LeuVal: 7.092 ± 0.647
2.128LeuTrp: 2.128 ± 1.022
1.418LeuTyr: 1.418 ± 0.681
0.0LeuXaa: 0.0 ± 0.0
Met
0.709MetAla: 0.709 ± 0.341
0.709MetCys: 0.709 ± 0.341
0.709MetAsp: 0.709 ± 0.341
0.709MetGlu: 0.709 ± 0.341
1.418MetPhe: 1.418 ± 0.681
2.128MetGly: 2.128 ± 1.022
1.418MetHis: 1.418 ± 0.681
1.418MetIle: 1.418 ± 0.67
0.709MetLys: 0.709 ± 0.341
1.418MetLeu: 1.418 ± 0.681
0.0MetMet: 0.0 ± 0.0
1.418MetAsn: 1.418 ± 0.67
2.128MetPro: 2.128 ± 0.329
0.709MetGln: 0.709 ± 1.01
2.128MetArg: 2.128 ± 1.022
1.418MetSer: 1.418 ± 0.67
0.709MetThr: 0.709 ± 0.341
0.0MetVal: 0.0 ± 0.0
1.418MetTrp: 1.418 ± 0.681
1.418MetTyr: 1.418 ± 0.681
0.0MetXaa: 0.0 ± 0.0
Asn
2.128AsnAla: 2.128 ± 1.022
1.418AsnCys: 1.418 ± 0.681
0.709AsnAsp: 0.709 ± 0.341
0.709AsnGlu: 0.709 ± 0.341
0.709AsnPhe: 0.709 ± 1.01
4.965AsnGly: 4.965 ± 0.318
0.0AsnHis: 0.0 ± 0.0
2.128AsnIle: 2.128 ± 1.022
1.418AsnLys: 1.418 ± 0.67
3.546AsnLeu: 3.546 ± 0.999
0.709AsnMet: 0.709 ± 1.01
1.418AsnAsn: 1.418 ± 2.021
4.965AsnPro: 4.965 ± 7.073
0.0AsnGln: 0.0 ± 0.0
3.546AsnArg: 3.546 ± 2.35
2.128AsnSer: 2.128 ± 0.329
4.255AsnThr: 4.255 ± 0.658
1.418AsnVal: 1.418 ± 0.681
0.0AsnTrp: 0.0 ± 0.0
2.128AsnTyr: 2.128 ± 1.022
0.0AsnXaa: 0.0 ± 0.0
Pro
3.546ProAla: 3.546 ± 0.999
0.0ProCys: 0.0 ± 0.0
4.255ProAsp: 4.255 ± 0.658
5.674ProGlu: 5.674 ± 1.374
1.418ProPhe: 1.418 ± 0.681
3.546ProGly: 3.546 ± 1.703
1.418ProHis: 1.418 ± 0.67
2.837ProIle: 2.837 ± 1.34
3.546ProLys: 3.546 ± 0.999
3.546ProLeu: 3.546 ± 1.703
0.0ProMet: 0.0 ± 0.0
1.418ProAsn: 1.418 ± 2.021
2.128ProPro: 2.128 ± 1.022
1.418ProGln: 1.418 ± 0.681
2.128ProArg: 2.128 ± 3.031
3.546ProSer: 3.546 ± 2.35
1.418ProThr: 1.418 ± 0.67
4.965ProVal: 4.965 ± 0.318
0.0ProTrp: 0.0 ± 0.0
1.418ProTyr: 1.418 ± 2.021
0.0ProXaa: 0.0 ± 0.0
Gln
2.837GlnAla: 2.837 ± 1.363
2.837GlnCys: 2.837 ± 1.363
3.546GlnAsp: 3.546 ± 1.703
0.709GlnGlu: 0.709 ± 1.01
0.0GlnPhe: 0.0 ± 0.0
1.418GlnGly: 1.418 ± 0.67
0.709GlnHis: 0.709 ± 1.01
2.128GlnIle: 2.128 ± 3.031
1.418GlnLys: 1.418 ± 0.681
2.837GlnLeu: 2.837 ± 0.011
1.418GlnMet: 1.418 ± 0.681
0.0GlnAsn: 0.0 ± 0.0
0.0GlnPro: 0.0 ± 0.0
0.709GlnGln: 0.709 ± 1.01
1.418GlnArg: 1.418 ± 0.67
0.0GlnSer: 0.0 ± 0.0
1.418GlnThr: 1.418 ± 0.67
2.128GlnVal: 2.128 ± 1.022
1.418GlnTrp: 1.418 ± 0.67
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
6.383ArgAla: 6.383 ± 0.364
1.418ArgCys: 1.418 ± 0.67
3.546ArgAsp: 3.546 ± 1.703
1.418ArgGlu: 1.418 ± 0.681
4.255ArgPhe: 4.255 ± 0.693
1.418ArgGly: 1.418 ± 0.67
0.0ArgHis: 0.0 ± 0.0
1.418ArgIle: 1.418 ± 0.67
3.546ArgLys: 3.546 ± 0.352
3.546ArgLeu: 3.546 ± 1.703
0.709ArgMet: 0.709 ± 0.341
2.128ArgAsn: 2.128 ± 1.022
2.128ArgPro: 2.128 ± 0.329
1.418ArgGln: 1.418 ± 0.681
7.092ArgArg: 7.092 ± 3.349
2.837ArgSer: 2.837 ± 0.011
4.255ArgThr: 4.255 ± 0.658
7.092ArgVal: 7.092 ± 0.704
2.837ArgTrp: 2.837 ± 2.691
2.128ArgTyr: 2.128 ± 0.329
0.0ArgXaa: 0.0 ± 0.0
Ser
8.511SerAla: 8.511 ± 1.317
0.709SerCys: 0.709 ± 0.341
4.965SerAsp: 4.965 ± 1.669
4.255SerGlu: 4.255 ± 0.693
2.837SerPhe: 2.837 ± 1.363
5.674SerGly: 5.674 ± 1.328
2.837SerHis: 2.837 ± 0.011
1.418SerIle: 1.418 ± 0.67
2.128SerLys: 2.128 ± 1.022
2.837SerLeu: 2.837 ± 0.011
2.837SerMet: 2.837 ± 0.011
3.546SerAsn: 3.546 ± 3.701
4.965SerPro: 4.965 ± 0.318
0.709SerGln: 0.709 ± 0.341
4.965SerArg: 4.965 ± 1.033
7.801SerSer: 7.801 ± 1.657
4.965SerThr: 4.965 ± 0.318
5.674SerVal: 5.674 ± 0.023
0.709SerTrp: 0.709 ± 1.01
2.128SerTyr: 2.128 ± 0.329
0.0SerXaa: 0.0 ± 0.0
Thr
4.965ThrAla: 4.965 ± 3.02
2.837ThrCys: 2.837 ± 1.363
2.128ThrAsp: 2.128 ± 1.022
4.255ThrGlu: 4.255 ± 0.693
2.128ThrPhe: 2.128 ± 1.022
3.546ThrGly: 3.546 ± 0.352
0.0ThrHis: 0.0 ± 0.0
1.418ThrIle: 1.418 ± 0.67
1.418ThrLys: 1.418 ± 0.681
5.674ThrLeu: 5.674 ± 1.374
1.418ThrMet: 1.418 ± 0.681
1.418ThrAsn: 1.418 ± 2.021
3.546ThrPro: 3.546 ± 0.352
0.709ThrGln: 0.709 ± 0.341
2.837ThrArg: 2.837 ± 1.363
4.965ThrSer: 4.965 ± 1.669
6.383ThrThr: 6.383 ± 2.339
4.255ThrVal: 4.255 ± 0.693
1.418ThrTrp: 1.418 ± 2.021
1.418ThrTyr: 1.418 ± 0.681
0.0ThrXaa: 0.0 ± 0.0
Val
6.383ValAla: 6.383 ± 0.364
1.418ValCys: 1.418 ± 0.67
4.255ValAsp: 4.255 ± 0.693
3.546ValGlu: 3.546 ± 1.703
2.837ValPhe: 2.837 ± 1.363
5.674ValGly: 5.674 ± 2.725
2.128ValHis: 2.128 ± 1.68
4.255ValIle: 4.255 ± 2.044
2.837ValLys: 2.837 ± 0.011
8.511ValLeu: 8.511 ± 0.034
0.709ValMet: 0.709 ± 0.341
3.546ValAsn: 3.546 ± 1.703
4.255ValPro: 4.255 ± 2.009
6.383ValGln: 6.383 ± 0.987
3.546ValArg: 3.546 ± 0.352
4.965ValSer: 4.965 ± 0.318
4.965ValThr: 4.965 ± 0.318
4.255ValVal: 4.255 ± 2.044
0.0ValTrp: 0.0 ± 0.0
3.546ValTyr: 3.546 ± 0.999
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
1.418TrpAsp: 1.418 ± 0.67
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
2.128TrpGly: 2.128 ± 1.68
0.709TrpHis: 0.709 ± 1.01
0.709TrpIle: 0.709 ± 0.341
2.128TrpLys: 2.128 ± 0.329
0.0TrpLeu: 0.0 ± 0.0
0.0TrpMet: 0.0 ± 0.0
1.418TrpAsn: 1.418 ± 0.67
0.0TrpPro: 0.0 ± 0.0
2.128TrpGln: 2.128 ± 1.68
2.128TrpArg: 2.128 ± 0.329
1.418TrpSer: 1.418 ± 0.67
0.0TrpThr: 0.0 ± 0.0
1.418TrpVal: 1.418 ± 0.681
0.0TrpTrp: 0.0 ± 0.0
1.418TrpTyr: 1.418 ± 0.681
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.128TyrAla: 2.128 ± 3.031
0.709TyrCys: 0.709 ± 0.341
4.255TyrAsp: 4.255 ± 2.044
0.709TyrGlu: 0.709 ± 0.341
0.709TyrPhe: 0.709 ± 0.341
6.383TyrGly: 6.383 ± 1.715
0.709TyrHis: 0.709 ± 0.341
0.709TyrIle: 0.709 ± 1.01
1.418TyrLys: 1.418 ± 0.681
2.128TyrLeu: 2.128 ± 1.68
0.0TyrMet: 0.0 ± 0.0
2.128TyrAsn: 2.128 ± 0.329
0.0TyrPro: 0.0 ± 0.0
0.709TyrGln: 0.709 ± 0.341
2.837TyrArg: 2.837 ± 1.363
3.546TyrSer: 3.546 ± 0.999
1.418TyrThr: 1.418 ± 0.681
2.837TyrVal: 2.837 ± 1.363
0.0TyrTrp: 0.0 ± 0.0
1.418TyrTyr: 1.418 ± 0.67
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1411 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski