Amino acid dipepetide frequency for Beihai sea slater virus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.943AlaAla: 2.943 ± 1.138
0.268AlaCys: 0.268 ± 0.144
3.21AlaAsp: 3.21 ± 0.386
4.013AlaGlu: 4.013 ± 0.818
2.943AlaPhe: 2.943 ± 0.69
2.675AlaGly: 2.675 ± 0.351
1.07AlaHis: 1.07 ± 0.129
2.943AlaIle: 2.943 ± 0.655
3.21AlaLys: 3.21 ± 0.386
3.745AlaLeu: 3.745 ± 0.674
0.268AlaMet: 0.268 ± 0.144
2.943AlaAsn: 2.943 ± 0.69
1.605AlaPro: 1.605 ± 0.928
1.338AlaGln: 1.338 ± 0.176
3.478AlaArg: 3.478 ± 0.082
2.943AlaSer: 2.943 ± 1.104
2.943AlaThr: 2.943 ± 0.655
3.478AlaVal: 3.478 ± 0.53
0.268AlaTrp: 0.268 ± 0.144
2.14AlaTyr: 2.14 ± 1.154
0.0AlaXaa: 0.0 ± 0.0
Cys
0.268CysAla: 0.268 ± 0.304
0.535CysCys: 0.535 ± 0.16
1.873CysAsp: 1.873 ± 0.561
0.803CysGlu: 0.803 ± 0.433
1.605CysPhe: 1.605 ± 0.928
1.605CysGly: 1.605 ± 0.031
0.535CysHis: 0.535 ± 0.16
0.268CysIle: 0.268 ± 0.144
3.478CysLys: 3.478 ± 0.082
1.338CysLeu: 1.338 ± 0.176
0.268CysMet: 0.268 ± 0.144
0.268CysAsn: 0.268 ± 0.144
1.07CysPro: 1.07 ± 0.577
0.535CysGln: 0.535 ± 0.608
0.535CysArg: 0.535 ± 0.288
3.478CysSer: 3.478 ± 0.53
0.268CysThr: 0.268 ± 0.304
1.07CysVal: 1.07 ± 0.577
0.0CysTrp: 0.0 ± 0.0
0.803CysTyr: 0.803 ± 0.016
0.0CysXaa: 0.0 ± 0.0
Asp
2.14AspAla: 2.14 ± 0.706
4.013AspCys: 4.013 ± 1.267
2.943AspAsp: 2.943 ± 0.207
6.421AspGlu: 6.421 ± 1.22
4.548AspPhe: 4.548 ± 0.238
5.083AspGly: 5.083 ± 2.64
1.07AspHis: 1.07 ± 0.32
4.013AspIle: 4.013 ± 0.37
4.013AspLys: 4.013 ± 0.078
7.758AspLeu: 7.758 ± 1.044
1.605AspMet: 1.605 ± 0.48
3.478AspAsn: 3.478 ± 0.082
2.14AspPro: 2.14 ± 0.257
1.605AspGln: 1.605 ± 0.417
3.21AspArg: 3.21 ± 0.386
4.28AspSer: 4.28 ± 0.831
4.28AspThr: 4.28 ± 1.279
2.675AspVal: 2.675 ± 0.097
0.803AspTrp: 0.803 ± 0.464
2.675AspTyr: 2.675 ± 0.994
0.0AspXaa: 0.0 ± 0.0
Glu
3.21GluAla: 3.21 ± 0.834
0.803GluCys: 0.803 ± 0.016
5.35GluAsp: 5.35 ± 1.54
8.293GluGlu: 8.293 ± 2.23
2.675GluPhe: 2.675 ± 0.351
1.605GluGly: 1.605 ± 0.417
0.803GluHis: 0.803 ± 0.016
3.21GluIle: 3.21 ± 1.408
6.153GluLys: 6.153 ± 1.972
3.21GluLeu: 3.21 ± 0.386
2.675GluMet: 2.675 ± 0.546
2.943GluAsn: 2.943 ± 0.655
2.408GluPro: 2.408 ± 0.047
3.478GluGln: 3.478 ± 0.53
4.013GluArg: 4.013 ± 0.818
4.28GluSer: 4.28 ± 0.066
3.478GluThr: 3.478 ± 0.978
4.013GluVal: 4.013 ± 1.715
2.14GluTrp: 2.14 ± 0.706
3.745GluTyr: 3.745 ± 0.223
0.0GluXaa: 0.0 ± 0.0
Phe
3.21PheAla: 3.21 ± 0.834
0.268PheCys: 0.268 ± 0.144
3.21PheAsp: 3.21 ± 0.386
2.408PheGlu: 2.408 ± 1.298
2.408PhePhe: 2.408 ± 0.85
4.28PheGly: 4.28 ± 0.831
1.338PheHis: 1.338 ± 0.273
2.14PheIle: 2.14 ± 1.088
3.21PheLys: 3.21 ± 0.386
2.14PheLeu: 2.14 ± 0.706
1.605PheMet: 1.605 ± 0.417
5.083PheAsn: 5.083 ± 0.499
2.14PhePro: 2.14 ± 0.191
1.07PheGln: 1.07 ± 0.129
1.605PheArg: 1.605 ± 0.417
2.408PheSer: 2.408 ± 0.944
3.478PheThr: 3.478 ± 0.53
5.083PheVal: 5.083 ± 0.499
0.535PheTrp: 0.535 ± 0.288
2.408PheTyr: 2.408 ± 0.401
0.0PheXaa: 0.0 ± 0.0
Gly
2.943GlyAla: 2.943 ± 1.552
0.535GlyCys: 0.535 ± 0.608
3.745GlyAsp: 3.745 ± 0.671
2.408GlyGlu: 2.408 ± 0.944
2.943GlyPhe: 2.943 ± 0.207
1.605GlyGly: 1.605 ± 0.48
1.338GlyHis: 1.338 ± 0.176
4.815GlyIle: 4.815 ± 0.354
4.548GlyLys: 4.548 ± 0.659
4.548GlyLeu: 4.548 ± 0.238
1.07GlyMet: 1.07 ± 0.129
4.815GlyAsn: 4.815 ± 0.542
1.605GlyPro: 1.605 ± 0.928
1.07GlyGln: 1.07 ± 0.129
2.408GlyArg: 2.408 ± 0.495
2.675GlySer: 2.675 ± 0.351
2.943GlyThr: 2.943 ± 1.104
2.408GlyVal: 2.408 ± 0.495
0.0GlyTrp: 0.0 ± 0.0
2.943GlyTyr: 2.943 ± 0.655
0.0GlyXaa: 0.0 ± 0.0
His
0.535HisAla: 0.535 ± 0.288
0.535HisCys: 0.535 ± 0.288
0.268HisAsp: 0.268 ± 0.144
1.07HisGlu: 1.07 ± 0.577
1.338HisPhe: 1.338 ± 0.624
1.07HisGly: 1.07 ± 0.129
0.535HisHis: 0.535 ± 0.608
1.338HisIle: 1.338 ± 0.273
1.338HisLys: 1.338 ± 0.176
2.14HisLeu: 2.14 ± 0.706
0.535HisMet: 0.535 ± 0.288
0.803HisAsn: 0.803 ± 0.016
0.803HisPro: 0.803 ± 0.433
1.07HisGln: 1.07 ± 0.768
0.268HisArg: 0.268 ± 0.144
2.675HisSer: 2.675 ± 1.248
0.803HisThr: 0.803 ± 0.016
2.408HisVal: 2.408 ± 0.495
0.268HisTrp: 0.268 ± 0.144
0.268HisTyr: 0.268 ± 0.144
0.0HisXaa: 0.0 ± 0.0
Ile
1.873IleAla: 1.873 ± 0.113
1.338IleCys: 1.338 ± 0.176
5.618IleAsp: 5.618 ± 0.11
1.338IleGlu: 1.338 ± 0.721
2.408IlePhe: 2.408 ± 1.298
2.675IleGly: 2.675 ± 1.248
0.803IleHis: 0.803 ± 0.464
2.14IleIle: 2.14 ± 0.64
3.745IleLys: 3.745 ± 0.674
4.013IleLeu: 4.013 ± 0.37
1.07IleMet: 1.07 ± 0.129
3.745IleAsn: 3.745 ± 0.226
2.14IlePro: 2.14 ± 1.536
2.14IleGln: 2.14 ± 0.191
1.338IleArg: 1.338 ± 0.176
5.083IleSer: 5.083 ± 2.192
4.28IleThr: 4.28 ± 2.176
7.223IleVal: 7.223 ± 0.756
0.268IleTrp: 0.268 ± 0.144
1.873IleTyr: 1.873 ± 0.335
0.0IleXaa: 0.0 ± 0.0
Lys
2.943LysAla: 2.943 ± 0.69
1.07LysCys: 1.07 ± 0.577
4.548LysAsp: 4.548 ± 1.555
6.956LysGlu: 6.956 ± 1.957
4.815LysPhe: 4.815 ± 0.354
4.28LysGly: 4.28 ± 0.963
1.338LysHis: 1.338 ± 0.721
4.28LysIle: 4.28 ± 0.514
6.421LysLys: 6.421 ± 2.117
5.886LysLeu: 5.886 ± 2.277
1.338LysMet: 1.338 ± 0.624
4.548LysAsn: 4.548 ± 1.107
3.745LysPro: 3.745 ± 1.119
1.338LysGln: 1.338 ± 0.273
2.14LysArg: 2.14 ± 0.706
4.815LysSer: 4.815 ± 2.596
3.21LysThr: 3.21 ± 0.386
6.153LysVal: 6.153 ± 0.179
0.803LysTrp: 0.803 ± 0.016
4.013LysTyr: 4.013 ± 0.37
0.0LysXaa: 0.0 ± 0.0
Leu
4.28LeuAla: 4.28 ± 0.514
1.338LeuCys: 1.338 ± 0.721
7.758LeuAsp: 7.758 ± 1.493
6.153LeuGlu: 6.153 ± 1.166
3.478LeuPhe: 3.478 ± 1.427
4.815LeuGly: 4.815 ± 0.094
1.873LeuHis: 1.873 ± 0.561
4.548LeuIle: 4.548 ± 0.238
8.561LeuLys: 8.561 ± 2.822
6.956LeuLeu: 6.956 ± 1.508
1.07LeuMet: 1.07 ± 0.32
2.408LeuAsn: 2.408 ± 0.047
4.013LeuPro: 4.013 ± 1.423
0.803LeuGln: 0.803 ± 0.016
2.943LeuArg: 2.943 ± 0.69
7.223LeuSer: 7.223 ± 0.307
4.013LeuThr: 4.013 ± 1.267
4.548LeuVal: 4.548 ± 2.48
1.338LeuTrp: 1.338 ± 0.273
3.745LeuTyr: 3.745 ± 1.568
0.0LeuXaa: 0.0 ± 0.0
Met
1.338MetAla: 1.338 ± 0.176
0.268MetCys: 0.268 ± 0.144
1.07MetAsp: 1.07 ± 0.32
2.408MetGlu: 2.408 ± 0.047
1.07MetPhe: 1.07 ± 0.129
0.268MetGly: 0.268 ± 0.144
0.803MetHis: 0.803 ± 0.016
0.268MetIle: 0.268 ± 0.144
1.873MetLys: 1.873 ± 0.113
1.605MetLeu: 1.605 ± 0.928
0.268MetMet: 0.268 ± 0.144
0.803MetAsn: 0.803 ± 0.433
2.408MetPro: 2.408 ± 0.944
1.338MetGln: 1.338 ± 0.624
1.338MetArg: 1.338 ± 0.273
1.07MetSer: 1.07 ± 0.577
1.605MetThr: 1.605 ± 0.48
1.07MetVal: 1.07 ± 0.32
0.268MetTrp: 0.268 ± 0.144
1.605MetTyr: 1.605 ± 0.48
0.0MetXaa: 0.0 ± 0.0
Asn
1.873AsnAla: 1.873 ± 0.113
2.408AsnCys: 2.408 ± 0.047
1.873AsnAsp: 1.873 ± 0.784
1.338AsnGlu: 1.338 ± 0.176
2.408AsnPhe: 2.408 ± 0.047
2.675AsnGly: 2.675 ± 1.248
0.268AsnHis: 0.268 ± 0.144
2.675AsnIle: 2.675 ± 1.248
3.745AsnLys: 3.745 ± 1.571
6.153AsnLeu: 6.153 ± 0.179
0.535AsnMet: 0.535 ± 0.288
2.943AsnAsn: 2.943 ± 1.104
4.013AsnPro: 4.013 ± 0.818
2.943AsnGln: 2.943 ± 0.242
3.21AsnArg: 3.21 ± 0.063
4.815AsnSer: 4.815 ± 0.094
1.605AsnThr: 1.605 ± 0.48
4.28AsnVal: 4.28 ± 0.514
1.07AsnTrp: 1.07 ± 0.577
3.21AsnTyr: 3.21 ± 0.959
0.0AsnXaa: 0.0 ± 0.0
Pro
2.14ProAla: 2.14 ± 1.088
0.803ProCys: 0.803 ± 0.912
1.605ProAsp: 1.605 ± 0.48
2.14ProGlu: 2.14 ± 0.706
3.21ProPhe: 3.21 ± 0.063
2.675ProGly: 2.675 ± 2.145
1.338ProHis: 1.338 ± 0.273
2.675ProIle: 2.675 ± 0.799
2.675ProLys: 2.675 ± 0.351
4.28ProLeu: 4.28 ± 0.831
0.268ProMet: 0.268 ± 0.144
2.408ProAsn: 2.408 ± 0.047
4.013ProPro: 4.013 ± 2.164
1.605ProGln: 1.605 ± 0.48
2.14ProArg: 2.14 ± 0.257
4.28ProSer: 4.28 ± 1.728
2.675ProThr: 2.675 ± 0.351
3.745ProVal: 3.745 ± 0.223
0.803ProTrp: 0.803 ± 0.464
2.408ProTyr: 2.408 ± 0.944
0.0ProXaa: 0.0 ± 0.0
Gln
1.338GlnAla: 1.338 ± 0.273
0.535GlnCys: 0.535 ± 0.288
2.943GlnAsp: 2.943 ± 1.552
2.675GlnGlu: 2.675 ± 0.994
1.07GlnPhe: 1.07 ± 0.577
1.338GlnGly: 1.338 ± 0.721
0.268GlnHis: 0.268 ± 0.144
1.873GlnIle: 1.873 ± 0.113
2.675GlnLys: 2.675 ± 0.994
2.675GlnLeu: 2.675 ± 0.799
1.338GlnMet: 1.338 ± 0.176
0.803GlnAsn: 0.803 ± 0.433
0.535GlnPro: 0.535 ± 0.608
0.535GlnGln: 0.535 ± 0.288
1.07GlnArg: 1.07 ± 0.32
2.675GlnSer: 2.675 ± 0.351
1.873GlnThr: 1.873 ± 1.232
2.675GlnVal: 2.675 ± 0.799
0.268GlnTrp: 0.268 ± 0.304
0.535GlnTyr: 0.535 ± 0.288
0.0GlnXaa: 0.0 ± 0.0
Arg
2.14ArgAla: 2.14 ± 0.706
1.07ArgCys: 1.07 ± 0.129
2.14ArgAsp: 2.14 ± 0.191
1.07ArgGlu: 1.07 ± 0.32
2.14ArgPhe: 2.14 ± 0.706
1.873ArgGly: 1.873 ± 0.561
2.14ArgHis: 2.14 ± 0.257
2.408ArgIle: 2.408 ± 0.944
3.478ArgLys: 3.478 ± 0.978
3.21ArgLeu: 3.21 ± 0.834
2.14ArgMet: 2.14 ± 0.191
1.338ArgAsn: 1.338 ± 0.721
2.14ArgPro: 2.14 ± 0.257
1.07ArgGln: 1.07 ± 0.577
2.14ArgArg: 2.14 ± 0.706
2.14ArgSer: 2.14 ± 1.154
1.873ArgThr: 1.873 ± 0.113
3.478ArgVal: 3.478 ± 0.082
0.803ArgTrp: 0.803 ± 0.464
3.21ArgTyr: 3.21 ± 0.959
0.0ArgXaa: 0.0 ± 0.0
Ser
4.815SerAla: 4.815 ± 0.991
1.873SerCys: 1.873 ± 0.784
4.548SerAsp: 4.548 ± 0.21
4.548SerGlu: 4.548 ± 0.659
2.943SerPhe: 2.943 ± 0.242
4.815SerGly: 4.815 ± 0.542
2.14SerHis: 2.14 ± 0.64
3.478SerIle: 3.478 ± 0.53
5.083SerLys: 5.083 ± 1.395
6.688SerLeu: 6.688 ± 0.019
0.803SerMet: 0.803 ± 0.016
3.21SerAsn: 3.21 ± 0.511
2.675SerPro: 2.675 ± 1.248
2.943SerGln: 2.943 ± 0.207
2.408SerArg: 2.408 ± 0.401
7.758SerSer: 7.758 ± 1.646
5.35SerThr: 5.35 ± 0.702
5.618SerVal: 5.618 ± 1.455
0.268SerTrp: 0.268 ± 0.144
3.478SerTyr: 3.478 ± 0.082
0.0SerXaa: 0.0 ± 0.0
Thr
4.013ThrAla: 4.013 ± 0.078
0.535ThrCys: 0.535 ± 0.16
3.745ThrAsp: 3.745 ± 2.016
5.083ThrGlu: 5.083 ± 0.05
2.675ThrPhe: 2.675 ± 0.097
2.408ThrGly: 2.408 ± 0.944
0.535ThrHis: 0.535 ± 0.16
3.478ThrIle: 3.478 ± 0.082
2.943ThrLys: 2.943 ± 0.207
4.013ThrLeu: 4.013 ± 1.423
1.07ThrMet: 1.07 ± 1.067
2.675ThrAsn: 2.675 ± 0.546
3.745ThrPro: 3.745 ± 1.119
1.338ThrGln: 1.338 ± 0.176
2.675ThrArg: 2.675 ± 0.097
4.28ThrSer: 4.28 ± 0.514
4.013ThrThr: 4.013 ± 0.527
5.35ThrVal: 5.35 ± 0.195
0.0ThrTrp: 0.0 ± 0.0
1.338ThrTyr: 1.338 ± 0.176
0.0ThrXaa: 0.0 ± 0.0
Val
3.745ValAla: 3.745 ± 1.123
1.338ValCys: 1.338 ± 0.176
7.491ValAsp: 7.491 ± 0.003
5.886ValGlu: 5.886 ± 1.38
2.675ValPhe: 2.675 ± 0.546
2.408ValGly: 2.408 ± 0.047
1.07ValHis: 1.07 ± 0.129
4.013ValIle: 4.013 ± 0.078
4.548ValLys: 4.548 ± 1.555
5.35ValLeu: 5.35 ± 0.195
2.408ValMet: 2.408 ± 0.436
4.013ValAsn: 4.013 ± 2.32
5.083ValPro: 5.083 ± 1.295
1.605ValGln: 1.605 ± 0.928
3.478ValArg: 3.478 ± 0.53
4.28ValSer: 4.28 ± 0.831
3.745ValThr: 3.745 ± 0.671
5.083ValVal: 5.083 ± 0.499
0.535ValTrp: 0.535 ± 0.16
4.28ValTyr: 4.28 ± 0.382
0.0ValXaa: 0.0 ± 0.0
Trp
1.07TrpAla: 1.07 ± 0.129
0.535TrpCys: 0.535 ± 0.288
0.803TrpAsp: 0.803 ± 0.464
0.535TrpGlu: 0.535 ± 0.288
0.268TrpPhe: 0.268 ± 0.144
0.803TrpGly: 0.803 ± 0.016
0.268TrpHis: 0.268 ± 0.304
0.803TrpIle: 0.803 ± 0.016
0.803TrpLys: 0.803 ± 0.016
1.338TrpLeu: 1.338 ± 0.721
0.0TrpMet: 0.0 ± 0.0
1.07TrpAsn: 1.07 ± 0.32
0.535TrpPro: 0.535 ± 0.16
0.268TrpGln: 0.268 ± 0.144
0.268TrpArg: 0.268 ± 0.304
1.07TrpSer: 1.07 ± 0.129
0.535TrpThr: 0.535 ± 0.288
0.268TrpVal: 0.268 ± 0.304
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.605TyrAla: 1.605 ± 0.031
0.535TyrCys: 0.535 ± 0.16
4.013TyrAsp: 4.013 ± 0.818
2.943TyrGlu: 2.943 ± 0.655
2.408TyrPhe: 2.408 ± 0.401
2.408TyrGly: 2.408 ± 0.495
0.535TyrHis: 0.535 ± 0.16
3.745TyrIle: 3.745 ± 0.223
2.14TyrLys: 2.14 ± 0.706
4.815TyrLeu: 4.815 ± 0.991
2.14TyrMet: 2.14 ± 1.088
3.745TyrAsn: 3.745 ± 1.119
1.07TyrPro: 1.07 ± 0.32
1.605TyrGln: 1.605 ± 0.417
1.338TyrArg: 1.338 ± 0.273
3.21TyrSer: 3.21 ± 0.386
2.943TyrThr: 2.943 ± 0.207
2.675TyrVal: 2.675 ± 0.351
0.803TyrTrp: 0.803 ± 0.464
4.013TyrTyr: 4.013 ± 2.32
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (3739 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski