Amino acid dipepetide frequency for Beihai sobemo-like virus 25

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.17AlaAla: 8.17 ± 3.548
0.0AlaCys: 0.0 ± 0.0
3.268AlaAsp: 3.268 ± 0.251
4.085AlaGlu: 4.085 ± 0.865
1.634AlaPhe: 1.634 ± 0.567
6.536AlaGly: 6.536 ± 0.636
1.634AlaHis: 1.634 ± 0.713
0.0AlaIle: 0.0 ± 0.0
1.634AlaLys: 1.634 ± 0.872
7.353AlaLeu: 7.353 ± 0.284
0.817AlaMet: 0.817 ± 0.638
5.719AlaAsn: 5.719 ± 0.996
5.719AlaPro: 5.719 ± 1.924
3.268AlaGln: 3.268 ± 1.281
4.902AlaArg: 4.902 ± 1.663
8.17AlaSer: 8.17 ± 0.974
4.085AlaThr: 4.085 ± 0.487
4.902AlaVal: 4.902 ± 1.663
1.634AlaTrp: 1.634 ± 0.872
3.268AlaTyr: 3.268 ± 1.425
0.0AlaXaa: 0.0 ± 0.0
Cys
0.817CysAla: 0.817 ± 0.721
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.817CysPhe: 0.817 ± 0.534
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
1.634CysIle: 1.634 ± 1.069
0.0CysLys: 0.0 ± 0.0
1.634CysLeu: 1.634 ± 1.069
0.817CysMet: 0.817 ± 0.463
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.817CysGln: 0.817 ± 0.721
0.0CysArg: 0.0 ± 0.0
0.817CysSer: 0.817 ± 0.534
0.0CysThr: 0.0 ± 0.0
0.817CysVal: 0.817 ± 0.721
0.0CysTrp: 0.0 ± 0.0
0.817CysTyr: 0.817 ± 0.721
0.0CysXaa: 0.0 ± 0.0
Asp
4.902AspAla: 4.902 ± 2.728
0.0AspCys: 0.0 ± 0.0
5.719AspAsp: 5.719 ± 2.811
1.634AspGlu: 1.634 ± 0.713
2.451AspPhe: 2.451 ± 0.834
3.268AspGly: 3.268 ± 1.135
0.817AspHis: 0.817 ± 0.534
5.719AspIle: 5.719 ± 2.255
2.451AspLys: 2.451 ± 1.603
3.268AspLeu: 3.268 ± 1.385
1.634AspMet: 1.634 ± 0.713
1.634AspAsn: 1.634 ± 1.442
3.268AspPro: 3.268 ± 1.117
0.0AspGln: 0.0 ± 0.0
2.451AspArg: 2.451 ± 0.834
3.268AspSer: 3.268 ± 1.425
1.634AspThr: 1.634 ± 0.713
1.634AspVal: 1.634 ± 0.567
0.817AspTrp: 0.817 ± 0.721
3.268AspTyr: 3.268 ± 1.135
0.0AspXaa: 0.0 ± 0.0
Glu
4.902GluAla: 4.902 ± 1.285
0.817GluCys: 0.817 ± 0.534
3.268GluAsp: 3.268 ± 0.251
7.353GluGlu: 7.353 ± 2.89
1.634GluPhe: 1.634 ± 0.713
4.902GluGly: 4.902 ± 1.285
3.268GluHis: 3.268 ± 2.137
2.451GluIle: 2.451 ± 0.834
4.902GluLys: 4.902 ± 2.728
2.451GluLeu: 2.451 ± 1.523
0.817GluMet: 0.817 ± 0.534
2.451GluAsn: 2.451 ± 1.603
2.451GluPro: 2.451 ± 0.963
1.634GluGln: 1.634 ± 1.069
5.719GluArg: 5.719 ± 2.294
3.268GluSer: 3.268 ± 0.251
0.0GluThr: 0.0 ± 0.0
2.451GluVal: 2.451 ± 0.963
1.634GluTrp: 1.634 ± 0.567
0.817GluTyr: 0.817 ± 0.534
0.0GluXaa: 0.0 ± 0.0
Phe
6.536PheAla: 6.536 ± 2.645
0.0PheCys: 0.0 ± 0.0
0.817PheAsp: 0.817 ± 0.534
1.634PheGlu: 1.634 ± 1.069
1.634PhePhe: 1.634 ± 0.567
3.268PheGly: 3.268 ± 1.117
0.817PheHis: 0.817 ± 0.721
0.0PheIle: 0.0 ± 0.0
3.268PheLys: 3.268 ± 0.885
1.634PheLeu: 1.634 ± 0.567
0.817PheMet: 0.817 ± 0.534
1.634PheAsn: 1.634 ± 0.567
2.451PhePro: 2.451 ± 1.603
0.817PheGln: 0.817 ± 0.534
0.817PheArg: 0.817 ± 0.812
4.085PheSer: 4.085 ± 0.756
1.634PheThr: 1.634 ± 1.442
2.451PheVal: 2.451 ± 1.182
0.817PheTrp: 0.817 ± 0.812
1.634PheTyr: 1.634 ± 1.442
0.0PheXaa: 0.0 ± 0.0
Gly
4.902GlyAla: 4.902 ± 1.663
0.817GlyCys: 0.817 ± 0.534
4.085GlyAsp: 4.085 ± 0.487
2.451GlyGlu: 2.451 ± 0.963
5.719GlyPhe: 5.719 ± 0.996
5.719GlyGly: 5.719 ± 3.538
1.634GlyHis: 1.634 ± 1.069
1.634GlyIle: 1.634 ± 0.713
5.719GlyLys: 5.719 ± 1.817
2.451GlyLeu: 2.451 ± 2.435
1.634GlyMet: 1.634 ± 1.069
4.902GlyAsn: 4.902 ± 3.86
6.536GlyPro: 6.536 ± 1.114
1.634GlyGln: 1.634 ± 0.567
2.451GlyArg: 2.451 ± 1.523
6.536GlySer: 6.536 ± 2.905
3.268GlyThr: 3.268 ± 2.883
4.085GlyVal: 4.085 ± 2.672
0.817GlyTrp: 0.817 ± 0.534
2.451GlyTyr: 2.451 ± 0.834
0.0GlyXaa: 0.0 ± 0.0
His
1.634HisAla: 1.634 ± 1.069
0.0HisCys: 0.0 ± 0.0
1.634HisAsp: 1.634 ± 1.623
1.634HisGlu: 1.634 ± 0.713
0.0HisPhe: 0.0 ± 0.0
0.817HisGly: 0.817 ± 0.534
0.817HisHis: 0.817 ± 0.534
3.268HisIle: 3.268 ± 2.137
0.817HisLys: 0.817 ± 0.534
2.451HisLeu: 2.451 ± 1.603
0.817HisMet: 0.817 ± 0.534
0.0HisAsn: 0.0 ± 0.0
0.817HisPro: 0.817 ± 0.534
0.817HisGln: 0.817 ± 0.534
1.634HisArg: 1.634 ± 0.713
1.634HisSer: 1.634 ± 0.567
1.634HisThr: 1.634 ± 0.872
3.268HisVal: 3.268 ± 2.137
0.0HisTrp: 0.0 ± 0.0
1.634HisTyr: 1.634 ± 0.567
0.0HisXaa: 0.0 ± 0.0
Ile
3.268IleAla: 3.268 ± 1.281
0.0IleCys: 0.0 ± 0.0
1.634IleAsp: 1.634 ± 1.069
3.268IleGlu: 3.268 ± 2.137
0.0IlePhe: 0.0 ± 0.0
3.268IleGly: 3.268 ± 0.885
0.817IleHis: 0.817 ± 0.534
2.451IleIle: 2.451 ± 0.834
3.268IleLys: 3.268 ± 1.425
3.268IleLeu: 3.268 ± 0.885
1.634IleMet: 1.634 ± 0.567
2.451IleAsn: 2.451 ± 1.182
2.451IlePro: 2.451 ± 0.355
1.634IleGln: 1.634 ± 1.442
3.268IleArg: 3.268 ± 0.885
0.817IleSer: 0.817 ± 0.721
3.268IleThr: 3.268 ± 1.874
3.268IleVal: 3.268 ± 1.385
0.0IleTrp: 0.0 ± 0.0
3.268IleTyr: 3.268 ± 1.135
0.0IleXaa: 0.0 ± 0.0
Lys
2.451LysAla: 2.451 ± 0.963
0.817LysCys: 0.817 ± 0.721
4.902LysAsp: 4.902 ± 1.873
6.536LysGlu: 6.536 ± 2.296
3.268LysPhe: 3.268 ± 1.117
2.451LysGly: 2.451 ± 1.603
2.451LysHis: 2.451 ± 1.603
2.451LysIle: 2.451 ± 1.431
8.987LysLys: 8.987 ± 3.727
8.17LysLeu: 8.17 ± 3.563
1.634LysMet: 1.634 ± 0.567
2.451LysAsn: 2.451 ± 1.431
4.085LysPro: 4.085 ± 1.608
4.085LysGln: 4.085 ± 1.608
6.536LysArg: 6.536 ± 3.273
3.268LysSer: 3.268 ± 2.137
6.536LysThr: 6.536 ± 0.636
3.268LysVal: 3.268 ± 1.117
0.0LysTrp: 0.0 ± 0.0
2.451LysTyr: 2.451 ± 0.963
0.0LysXaa: 0.0 ± 0.0
Leu
3.268LeuAla: 3.268 ± 0.251
1.634LeuCys: 1.634 ± 0.567
3.268LeuAsp: 3.268 ± 0.251
6.536LeuGlu: 6.536 ± 1.423
4.085LeuPhe: 4.085 ± 1.219
6.536LeuGly: 6.536 ± 1.61
2.451LeuHis: 2.451 ± 1.431
0.0LeuIle: 0.0 ± 0.0
4.902LeuLys: 4.902 ± 2.138
5.719LeuLeu: 5.719 ± 1.545
1.634LeuMet: 1.634 ± 0.567
3.268LeuAsn: 3.268 ± 0.251
6.536LeuPro: 6.536 ± 1.77
2.451LeuGln: 2.451 ± 1.431
4.085LeuArg: 4.085 ± 0.487
5.719LeuSer: 5.719 ± 0.996
2.451LeuThr: 2.451 ± 1.379
8.17LeuVal: 8.17 ± 2.172
0.817LeuTrp: 0.817 ± 0.534
2.451LeuTyr: 2.451 ± 0.963
0.0LeuXaa: 0.0 ± 0.0
Met
2.451MetAla: 2.451 ± 2.163
0.0MetCys: 0.0 ± 0.0
1.634MetAsp: 1.634 ± 0.567
1.634MetGlu: 1.634 ± 1.069
1.634MetPhe: 1.634 ± 1.442
0.817MetGly: 0.817 ± 0.534
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
1.634MetLys: 1.634 ± 0.713
0.817MetLeu: 0.817 ± 0.721
0.817MetMet: 0.817 ± 0.534
0.817MetAsn: 0.817 ± 0.812
1.634MetPro: 1.634 ± 0.567
1.634MetGln: 1.634 ± 1.069
0.0MetArg: 0.0 ± 0.0
1.634MetSer: 1.634 ± 1.069
2.451MetThr: 2.451 ± 1.603
0.817MetVal: 0.817 ± 0.812
0.0MetTrp: 0.0 ± 0.0
0.817MetTyr: 0.817 ± 0.534
0.0MetXaa: 0.0 ± 0.0
Asn
2.451AsnAla: 2.451 ± 1.182
0.0AsnCys: 0.0 ± 0.0
0.817AsnAsp: 0.817 ± 0.812
1.634AsnGlu: 1.634 ± 0.567
0.817AsnPhe: 0.817 ± 0.534
2.451AsnGly: 2.451 ± 0.355
1.634AsnHis: 1.634 ± 0.872
3.268AsnIle: 3.268 ± 2.02
0.817AsnLys: 0.817 ± 0.721
4.085AsnLeu: 4.085 ± 1.608
0.0AsnMet: 0.0 ± 0.0
3.268AsnAsn: 3.268 ± 1.744
2.451AsnPro: 2.451 ± 0.355
1.634AsnGln: 1.634 ± 0.567
0.817AsnArg: 0.817 ± 0.812
6.536AsnSer: 6.536 ± 2.645
2.451AsnThr: 2.451 ± 1.182
2.451AsnVal: 2.451 ± 0.355
2.451AsnTrp: 2.451 ± 0.355
3.268AsnTyr: 3.268 ± 1.117
0.0AsnXaa: 0.0 ± 0.0
Pro
5.719ProAla: 5.719 ± 1.817
0.0ProCys: 0.0 ± 0.0
3.268ProAsp: 3.268 ± 0.251
1.634ProGlu: 1.634 ± 1.069
2.451ProPhe: 2.451 ± 0.355
5.719ProGly: 5.719 ± 0.304
0.817ProHis: 0.817 ± 0.721
3.268ProIle: 3.268 ± 0.885
6.536ProLys: 6.536 ± 2.554
2.451ProLeu: 2.451 ± 1.431
0.817ProMet: 0.817 ± 0.812
1.634ProAsn: 1.634 ± 0.872
3.268ProPro: 3.268 ± 2.215
1.634ProGln: 1.634 ± 0.872
3.268ProArg: 3.268 ± 1.135
4.085ProSer: 4.085 ± 1.709
7.353ProThr: 7.353 ± 1.325
5.719ProVal: 5.719 ± 0.996
0.817ProTrp: 0.817 ± 0.812
0.817ProTyr: 0.817 ± 0.534
0.0ProXaa: 0.0 ± 0.0
Gln
1.634GlnAla: 1.634 ± 0.567
0.0GlnCys: 0.0 ± 0.0
0.0GlnAsp: 0.0 ± 0.0
1.634GlnGlu: 1.634 ± 1.623
1.634GlnPhe: 1.634 ± 1.442
0.817GlnGly: 0.817 ± 0.721
0.817GlnHis: 0.817 ± 0.534
0.0GlnIle: 0.0 ± 0.0
1.634GlnLys: 1.634 ± 0.713
0.817GlnLeu: 0.817 ± 0.534
0.817GlnMet: 0.817 ± 0.721
0.817GlnAsn: 0.817 ± 0.812
1.634GlnPro: 1.634 ± 0.713
0.817GlnGln: 0.817 ± 0.534
2.451GlnArg: 2.451 ± 0.963
5.719GlnSer: 5.719 ± 1.14
1.634GlnThr: 1.634 ± 1.442
4.902GlnVal: 4.902 ± 2.37
2.451GlnTrp: 2.451 ± 1.182
4.085GlnTyr: 4.085 ± 1.866
0.0GlnXaa: 0.0 ± 0.0
Arg
3.268ArgAla: 3.268 ± 1.117
1.634ArgCys: 1.634 ± 1.069
1.634ArgAsp: 1.634 ± 0.567
1.634ArgGlu: 1.634 ± 1.069
0.0ArgPhe: 0.0 ± 0.0
3.268ArgGly: 3.268 ± 1.135
1.634ArgHis: 1.634 ± 1.069
0.0ArgIle: 0.0 ± 0.0
8.987ArgLys: 8.987 ± 6.782
6.536ArgLeu: 6.536 ± 0.636
0.817ArgMet: 0.817 ± 0.721
0.817ArgAsn: 0.817 ± 0.721
2.451ArgPro: 2.451 ± 0.963
4.902ArgGln: 4.902 ± 1.659
3.268ArgArg: 3.268 ± 1.281
3.268ArgSer: 3.268 ± 1.385
2.451ArgThr: 2.451 ± 1.603
3.268ArgVal: 3.268 ± 2.02
1.634ArgTrp: 1.634 ± 1.069
3.268ArgTyr: 3.268 ± 1.117
0.0ArgXaa: 0.0 ± 0.0
Ser
6.536SerAla: 6.536 ± 1.77
0.0SerCys: 0.0 ± 0.0
8.987SerAsp: 8.987 ± 1.834
1.634SerGlu: 1.634 ± 1.069
0.817SerPhe: 0.817 ± 0.721
6.536SerGly: 6.536 ± 0.636
1.634SerHis: 1.634 ± 0.713
5.719SerIle: 5.719 ± 2.094
4.902SerLys: 4.902 ± 0.905
4.902SerLeu: 4.902 ± 0.905
2.451SerMet: 2.451 ± 0.834
3.268SerAsn: 3.268 ± 0.885
5.719SerPro: 5.719 ± 1.14
2.451SerGln: 2.451 ± 1.182
3.268SerArg: 3.268 ± 1.425
6.536SerSer: 6.536 ± 1.662
5.719SerThr: 5.719 ± 4.012
7.353SerVal: 7.353 ± 2.451
0.0SerTrp: 0.0 ± 0.0
3.268SerTyr: 3.268 ± 1.135
0.0SerXaa: 0.0 ± 0.0
Thr
2.451ThrAla: 2.451 ± 1.379
1.634ThrCys: 1.634 ± 0.567
1.634ThrAsp: 1.634 ± 0.567
2.451ThrGlu: 2.451 ± 0.834
3.268ThrPhe: 3.268 ± 1.874
3.268ThrGly: 3.268 ± 1.744
0.817ThrHis: 0.817 ± 0.534
6.536ThrIle: 6.536 ± 1.77
4.902ThrLys: 4.902 ± 1.285
5.719ThrLeu: 5.719 ± 2.998
0.0ThrMet: 0.0 ± 0.0
4.085ThrAsn: 4.085 ± 1.709
4.085ThrPro: 4.085 ± 2.582
2.451ThrGln: 2.451 ± 1.379
1.634ThrArg: 1.634 ± 1.069
7.353ThrSer: 7.353 ± 3.583
4.085ThrThr: 4.085 ± 2.702
3.268ThrVal: 3.268 ± 1.135
0.817ThrTrp: 0.817 ± 0.534
0.0ThrTyr: 0.0 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
6.536ValAla: 6.536 ± 0.502
0.0ValCys: 0.0 ± 0.0
1.634ValAsp: 1.634 ± 1.069
4.085ValGlu: 4.085 ± 3.014
2.451ValPhe: 2.451 ± 1.603
6.536ValGly: 6.536 ± 2.7
2.451ValHis: 2.451 ± 1.603
3.268ValIle: 3.268 ± 1.135
6.536ValLys: 6.536 ± 2.771
4.902ValLeu: 4.902 ± 1.702
1.634ValMet: 1.634 ± 0.567
3.268ValAsn: 3.268 ± 1.874
2.451ValPro: 2.451 ± 0.963
0.817ValGln: 0.817 ± 0.812
4.902ValArg: 4.902 ± 1.204
5.719ValSer: 5.719 ± 1.857
6.536ValThr: 6.536 ± 1.77
4.085ValVal: 4.085 ± 1.575
0.817ValTrp: 0.817 ± 0.534
0.0ValTyr: 0.0 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
3.268TrpAla: 3.268 ± 2.279
0.0TrpCys: 0.0 ± 0.0
0.817TrpAsp: 0.817 ± 0.534
2.451TrpGlu: 2.451 ± 0.834
0.817TrpPhe: 0.817 ± 0.534
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.817TrpIle: 0.817 ± 0.534
1.634TrpLys: 1.634 ± 0.567
0.0TrpLeu: 0.0 ± 0.0
0.0TrpMet: 0.0 ± 0.0
0.817TrpAsn: 0.817 ± 0.721
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.817TrpArg: 0.817 ± 0.534
0.817TrpSer: 0.817 ± 0.812
3.268TrpThr: 3.268 ± 1.135
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.451TyrAla: 2.451 ± 0.355
1.634TyrCys: 1.634 ± 0.567
0.817TyrAsp: 0.817 ± 0.721
3.268TyrGlu: 3.268 ± 1.281
1.634TyrPhe: 1.634 ± 0.567
3.268TyrGly: 3.268 ± 1.425
0.817TyrHis: 0.817 ± 0.534
0.817TyrIle: 0.817 ± 0.721
3.268TyrLys: 3.268 ± 1.117
7.353TyrLeu: 7.353 ± 0.991
0.817TyrMet: 0.817 ± 0.591
0.0TyrAsn: 0.0 ± 0.0
3.268TyrPro: 3.268 ± 1.281
0.817TyrGln: 0.817 ± 0.534
2.451TyrArg: 2.451 ± 1.603
2.451TyrSer: 2.451 ± 0.355
0.0TyrThr: 0.0 ± 0.0
2.451TyrVal: 2.451 ± 0.355
0.0TyrTrp: 0.0 ± 0.0
0.817TyrTyr: 0.817 ± 0.721
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (1225 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski