Amino acid dipepetide frequency for Beihai sea slater virus 3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.576AlaAla: 7.576 ± 3.091
0.842AlaCys: 0.842 ± 0.202
3.788AlaAsp: 3.788 ± 1.864
5.892AlaGlu: 5.892 ± 2.08
2.525AlaPhe: 2.525 ± 1.063
3.367AlaGly: 3.367 ± 1.327
1.263AlaHis: 1.263 ± 0.809
3.367AlaIle: 3.367 ± 1.327
3.788AlaLys: 3.788 ± 1.564
5.892AlaLeu: 5.892 ± 2.024
3.367AlaMet: 3.367 ± 0.792
2.525AlaAsn: 2.525 ± 1.063
2.946AlaPro: 2.946 ± 1.329
2.104AlaGln: 2.104 ± 0.8
5.892AlaArg: 5.892 ± 1.189
7.576AlaSer: 7.576 ± 3.927
4.63AlaThr: 4.63 ± 0.63
4.63AlaVal: 4.63 ± 2.401
1.684AlaTrp: 1.684 ± 0.866
2.525AlaTyr: 2.525 ± 0.383
0.0AlaXaa: 0.0 ± 0.0
Cys
1.263CysAla: 1.263 ± 0.764
0.0CysCys: 0.0 ± 0.0
0.842CysAsp: 0.842 ± 0.539
0.842CysGlu: 0.842 ± 0.202
1.263CysPhe: 1.263 ± 1.088
0.842CysGly: 0.842 ± 0.725
0.0CysHis: 0.0 ± 0.0
1.263CysIle: 1.263 ± 1.558
0.421CysLys: 0.421 ± 0.363
0.842CysLeu: 0.842 ± 0.725
0.0CysMet: 0.0 ± 0.0
0.421CysAsn: 0.421 ± 0.363
0.0CysPro: 0.0 ± 0.0
1.263CysGln: 1.263 ± 0.309
1.263CysArg: 1.263 ± 1.031
0.842CysSer: 0.842 ± 0.75
0.0CysThr: 0.0 ± 0.0
0.421CysVal: 0.421 ± 0.829
0.421CysTrp: 0.421 ± 0.363
1.263CysTyr: 1.263 ± 0.764
0.0CysXaa: 0.0 ± 0.0
Asp
5.471AspAla: 5.471 ± 1.03
0.421AspCys: 0.421 ± 0.363
4.209AspAsp: 4.209 ± 1.009
4.209AspGlu: 4.209 ± 0.373
5.471AspPhe: 5.471 ± 1.311
2.946AspGly: 2.946 ± 0.362
0.421AspHis: 0.421 ± 0.27
3.367AspIle: 3.367 ± 0.771
4.209AspLys: 4.209 ± 1.405
7.155AspLeu: 7.155 ± 1.986
0.842AspMet: 0.842 ± 0.725
5.892AspAsn: 5.892 ± 1.189
3.788AspPro: 3.788 ± 0.502
0.842AspGln: 0.842 ± 0.725
0.842AspArg: 0.842 ± 0.725
3.788AspSer: 3.788 ± 1.09
2.104AspThr: 2.104 ± 0.447
3.367AspVal: 3.367 ± 1.75
1.684AspTrp: 1.684 ± 0.875
2.104AspTyr: 2.104 ± 0.702
0.0AspXaa: 0.0 ± 0.0
Glu
2.104GluAla: 2.104 ± 0.8
1.263GluCys: 1.263 ± 1.558
2.525GluAsp: 2.525 ± 1.043
2.946GluGlu: 2.946 ± 0.62
2.104GluPhe: 2.104 ± 0.702
2.104GluGly: 2.104 ± 1.031
0.842GluHis: 0.842 ± 0.863
2.946GluIle: 2.946 ± 1.04
2.525GluLys: 2.525 ± 0.605
2.104GluLeu: 2.104 ± 0.702
0.0GluMet: 0.0 ± 0.0
3.367GluAsn: 3.367 ± 0.771
3.367GluPro: 3.367 ± 2.998
2.946GluGln: 2.946 ± 1.04
5.051GluArg: 5.051 ± 1.681
4.209GluSer: 4.209 ± 1.869
2.525GluThr: 2.525 ± 1.063
4.209GluVal: 4.209 ± 0.893
0.421GluTrp: 0.421 ± 0.363
2.104GluTyr: 2.104 ± 0.447
0.0GluXaa: 0.0 ± 0.0
Phe
4.209PheAla: 4.209 ± 1.405
0.0PheCys: 0.0 ± 0.0
5.051PheAsp: 5.051 ± 2.625
2.946PheGlu: 2.946 ± 1.811
0.842PhePhe: 0.842 ± 0.75
2.525PheGly: 2.525 ± 0.682
1.263PheHis: 1.263 ± 0.309
1.263PheIle: 1.263 ± 0.521
2.525PheLys: 2.525 ± 0.383
1.263PheLeu: 1.263 ± 0.309
0.421PheMet: 0.421 ± 0.27
2.946PheAsn: 2.946 ± 1.04
2.525PhePro: 2.525 ± 0.75
0.842PheGln: 0.842 ± 0.863
2.525PheArg: 2.525 ± 1.595
5.051PheSer: 5.051 ± 1.235
2.946PheThr: 2.946 ± 0.62
5.471PheVal: 5.471 ± 1.481
0.842PheTrp: 0.842 ± 0.539
1.684PheTyr: 1.684 ± 0.875
0.0PheXaa: 0.0 ± 0.0
Gly
2.525GlyAla: 2.525 ± 0.682
0.842GlyCys: 0.842 ± 0.539
5.892GlyAsp: 5.892 ± 1.429
2.525GlyGlu: 2.525 ± 0.682
3.367GlyPhe: 3.367 ± 1.565
4.63GlyGly: 4.63 ± 0.913
0.421GlyHis: 0.421 ± 0.829
3.788GlyIle: 3.788 ± 0.926
4.209GlyLys: 4.209 ± 1.601
5.471GlyLeu: 5.471 ± 0.723
1.263GlyMet: 1.263 ± 0.309
3.367GlyAsn: 3.367 ± 2.157
2.104GlyPro: 2.104 ± 1.348
2.946GlyGln: 2.946 ± 1.29
2.946GlyArg: 2.946 ± 2.118
4.209GlySer: 4.209 ± 1.88
5.892GlyThr: 5.892 ± 1.189
4.63GlyVal: 4.63 ± 1.221
0.421GlyTrp: 0.421 ± 0.27
2.104GlyTyr: 2.104 ± 0.447
0.0GlyXaa: 0.0 ± 0.0
His
0.842HisAla: 0.842 ± 0.863
0.0HisCys: 0.0 ± 0.0
1.263HisAsp: 1.263 ± 0.521
0.421HisGlu: 0.421 ± 0.363
0.421HisPhe: 0.421 ± 0.363
1.263HisGly: 1.263 ± 1.088
0.0HisHis: 0.0 ± 0.0
0.842HisIle: 0.842 ± 0.202
0.421HisLys: 0.421 ± 0.27
1.684HisLeu: 1.684 ± 0.543
0.421HisMet: 0.421 ± 0.27
0.0HisAsn: 0.0 ± 0.0
0.421HisPro: 0.421 ± 0.27
0.421HisGln: 0.421 ± 0.27
1.684HisArg: 1.684 ± 0.56
0.842HisSer: 0.842 ± 0.863
1.263HisThr: 1.263 ± 0.809
0.421HisVal: 0.421 ± 0.27
0.0HisTrp: 0.0 ± 0.0
1.263HisTyr: 1.263 ± 1.088
0.0HisXaa: 0.0 ± 0.0
Ile
5.471IleAla: 5.471 ± 1.451
1.684IleCys: 1.684 ± 0.404
3.367IleAsp: 3.367 ± 2.494
2.104IleGlu: 2.104 ± 1.016
2.525IlePhe: 2.525 ± 0.605
1.684IleGly: 1.684 ± 0.543
0.842IleHis: 0.842 ± 0.725
4.63IleIle: 4.63 ± 1.8
0.842IleLys: 0.842 ± 0.202
4.209IleLeu: 4.209 ± 0.841
0.842IleMet: 0.842 ± 0.539
3.788IleAsn: 3.788 ± 0.807
5.051IlePro: 5.051 ± 0.925
3.367IleGln: 3.367 ± 1.694
3.788IleArg: 3.788 ± 1.329
3.367IleSer: 3.367 ± 1.221
4.209IleThr: 4.209 ± 1.914
3.788IleVal: 3.788 ± 0.926
0.421IleTrp: 0.421 ± 0.27
1.263IleTyr: 1.263 ± 1.088
0.0IleXaa: 0.0 ± 0.0
Lys
1.263LysAla: 1.263 ± 0.764
0.421LysCys: 0.421 ± 0.27
2.946LysAsp: 2.946 ± 0.893
2.525LysGlu: 2.525 ± 2.063
3.367LysPhe: 3.367 ± 1.221
1.684LysGly: 1.684 ± 0.543
2.525LysHis: 2.525 ± 1.309
5.051LysIle: 5.051 ± 0.95
3.788LysLys: 3.788 ± 1.564
6.734LysLeu: 6.734 ± 2.956
0.421LysMet: 0.421 ± 0.363
4.209LysAsn: 4.209 ± 1.914
0.842LysPro: 0.842 ± 0.202
0.842LysGln: 0.842 ± 0.75
2.946LysArg: 2.946 ± 2.23
3.367LysSer: 3.367 ± 1.565
2.946LysThr: 2.946 ± 0.62
4.209LysVal: 4.209 ± 0.893
1.684LysTrp: 1.684 ± 0.875
2.104LysTyr: 2.104 ± 0.447
0.0LysXaa: 0.0 ± 0.0
Leu
7.155LeuAla: 7.155 ± 0.681
2.525LeuCys: 2.525 ± 1.595
6.313LeuAsp: 6.313 ± 0.884
3.788LeuGlu: 3.788 ± 0.733
3.367LeuPhe: 3.367 ± 1.221
5.892LeuGly: 5.892 ± 0.85
0.421LeuHis: 0.421 ± 0.363
5.051LeuIle: 5.051 ± 1.211
7.155LeuLys: 7.155 ± 2.76
6.734LeuLeu: 6.734 ± 0.852
1.684LeuMet: 1.684 ± 0.543
4.63LeuAsn: 4.63 ± 1.741
2.104LeuPro: 2.104 ± 0.447
3.788LeuGln: 3.788 ± 1.864
7.155LeuArg: 7.155 ± 1.058
7.155LeuSer: 7.155 ± 1.235
8.418LeuThr: 8.418 ± 0.866
2.946LeuVal: 2.946 ± 0.842
0.842LeuTrp: 0.842 ± 0.202
2.525LeuTyr: 2.525 ± 0.617
0.0LeuXaa: 0.0 ± 0.0
Met
1.263MetAla: 1.263 ± 0.809
0.421MetCys: 0.421 ± 0.27
1.684MetAsp: 1.684 ± 0.783
0.842MetGlu: 0.842 ± 0.539
0.842MetPhe: 0.842 ± 0.202
1.263MetGly: 1.263 ± 0.309
0.0MetHis: 0.0 ± 0.0
0.842MetIle: 0.842 ± 0.539
1.263MetLys: 1.263 ± 0.521
2.946MetLeu: 2.946 ± 0.893
0.0MetMet: 0.0 ± 0.0
0.842MetAsn: 0.842 ± 0.539
0.842MetPro: 0.842 ± 0.202
1.263MetGln: 1.263 ± 0.309
0.842MetArg: 0.842 ± 0.539
1.684MetSer: 1.684 ± 0.875
1.263MetThr: 1.263 ± 0.809
0.842MetVal: 0.842 ± 0.539
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.63AsnAla: 4.63 ± 0.63
0.0AsnCys: 0.0 ± 0.0
3.367AsnAsp: 3.367 ± 1.596
2.525AsnGlu: 2.525 ± 0.605
2.946AsnPhe: 2.946 ± 0.893
2.946AsnGly: 2.946 ± 1.457
0.842AsnHis: 0.842 ± 0.725
5.051AsnIle: 5.051 ± 1.551
2.946AsnLys: 2.946 ± 1.04
6.734AsnLeu: 6.734 ± 0.511
0.421AsnMet: 0.421 ± 0.27
2.104AsnAsn: 2.104 ± 0.702
3.788AsnPro: 3.788 ± 1.415
2.525AsnGln: 2.525 ± 1.063
2.104AsnArg: 2.104 ± 0.702
3.367AsnSer: 3.367 ± 1.596
5.892AsnThr: 5.892 ± 1.88
2.525AsnVal: 2.525 ± 0.605
1.684AsnTrp: 1.684 ± 0.404
3.367AsnTyr: 3.367 ± 1.087
0.0AsnXaa: 0.0 ± 0.0
Pro
3.367ProAla: 3.367 ± 1.327
0.0ProCys: 0.0 ± 0.0
1.684ProAsp: 1.684 ± 1.45
0.842ProGlu: 0.842 ± 0.75
0.421ProPhe: 0.421 ± 0.27
4.63ProGly: 4.63 ± 1.837
0.842ProHis: 0.842 ± 0.202
3.367ProIle: 3.367 ± 1.221
2.104ProLys: 2.104 ± 0.563
2.946ProLeu: 2.946 ± 0.62
0.842ProMet: 0.842 ± 0.539
3.788ProAsn: 3.788 ± 1.938
2.946ProPro: 2.946 ± 1.29
2.104ProGln: 2.104 ± 0.8
2.525ProArg: 2.525 ± 1.233
7.155ProSer: 7.155 ± 2.731
3.367ProThr: 3.367 ± 0.245
5.051ProVal: 5.051 ± 1.63
0.421ProTrp: 0.421 ± 0.27
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
6.313GlnAla: 6.313 ± 2.551
0.421GlnCys: 0.421 ± 0.829
1.263GlnAsp: 1.263 ± 0.809
0.421GlnGlu: 0.421 ± 0.27
1.684GlnPhe: 1.684 ± 0.543
2.946GlnGly: 2.946 ± 1.329
0.421GlnHis: 0.421 ± 0.363
0.842GlnIle: 0.842 ± 0.202
1.263GlnLys: 1.263 ± 0.521
3.788GlnLeu: 3.788 ± 0.502
1.263GlnMet: 1.263 ± 0.309
2.104GlnAsn: 2.104 ± 0.8
1.684GlnPro: 1.684 ± 0.866
0.421GlnGln: 0.421 ± 0.27
0.842GlnArg: 0.842 ± 0.863
2.104GlnSer: 2.104 ± 0.563
1.684GlnThr: 1.684 ± 0.543
4.209GlnVal: 4.209 ± 1.585
0.0GlnTrp: 0.0 ± 0.0
0.842GlnTyr: 0.842 ± 0.539
0.0GlnXaa: 0.0 ± 0.0
Arg
2.946ArgAla: 2.946 ± 2.118
1.684ArgCys: 1.684 ± 1.725
4.63ArgAsp: 4.63 ± 2.593
2.104ArgGlu: 2.104 ± 0.702
3.367ArgPhe: 3.367 ± 1.565
5.051ArgGly: 5.051 ± 2.348
0.842ArgHis: 0.842 ± 0.202
4.209ArgIle: 4.209 ± 0.893
2.946ArgLys: 2.946 ± 1.631
7.576ArgLeu: 7.576 ± 0.872
0.842ArgMet: 0.842 ± 0.539
3.788ArgAsn: 3.788 ± 0.164
3.367ArgPro: 3.367 ± 1.219
0.421ArgGln: 0.421 ± 0.27
3.367ArgArg: 3.367 ± 2.998
1.684ArgSer: 1.684 ± 0.56
2.525ArgThr: 2.525 ± 1.233
2.525ArgVal: 2.525 ± 0.605
1.684ArgTrp: 1.684 ± 0.875
0.421ArgTyr: 0.421 ± 0.363
0.0ArgXaa: 0.0 ± 0.0
Ser
7.997SerAla: 7.997 ± 2.918
0.842SerCys: 0.842 ± 0.75
5.051SerAsp: 5.051 ± 1.898
3.788SerGlu: 3.788 ± 1.335
3.367SerPhe: 3.367 ± 1.219
7.997SerGly: 7.997 ± 2.945
0.842SerHis: 0.842 ± 0.539
3.367SerIle: 3.367 ± 0.245
3.367SerLys: 3.367 ± 1.967
5.471SerLeu: 5.471 ± 1.311
0.842SerMet: 0.842 ± 0.459
6.313SerAsn: 6.313 ± 1.382
3.788SerPro: 3.788 ± 1.938
3.788SerGln: 3.788 ± 1.341
5.471SerArg: 5.471 ± 3.722
8.418SerSer: 8.418 ± 5.761
5.471SerThr: 5.471 ± 0.581
2.104SerVal: 2.104 ± 0.8
0.842SerTrp: 0.842 ± 0.202
0.421SerTyr: 0.421 ± 0.363
0.0SerXaa: 0.0 ± 0.0
Thr
5.892ThrAla: 5.892 ± 0.373
0.421ThrCys: 0.421 ± 0.363
2.525ThrAsp: 2.525 ± 0.605
5.471ThrGlu: 5.471 ± 0.695
2.104ThrPhe: 2.104 ± 0.447
5.051ThrGly: 5.051 ± 0.894
1.263ThrHis: 1.263 ± 0.809
2.525ThrIle: 2.525 ± 0.383
4.209ThrLys: 4.209 ± 1.108
7.576ThrLeu: 7.576 ± 1.816
1.684ThrMet: 1.684 ± 0.45
3.788ThrAsn: 3.788 ± 0.926
2.104ThrPro: 2.104 ± 0.447
2.104ThrGln: 2.104 ± 1.49
3.788ThrArg: 3.788 ± 0.807
7.155ThrSer: 7.155 ± 1.113
4.209ThrThr: 4.209 ± 1.601
5.051ThrVal: 5.051 ± 1.235
0.842ThrTrp: 0.842 ± 0.202
1.263ThrTyr: 1.263 ± 0.809
0.0ThrXaa: 0.0 ± 0.0
Val
2.946ValAla: 2.946 ± 0.842
1.263ValCys: 1.263 ± 0.521
3.788ValAsp: 3.788 ± 1.087
4.63ValGlu: 4.63 ± 1.891
4.209ValPhe: 4.209 ± 0.893
4.209ValGly: 4.209 ± 2.132
0.842ValHis: 0.842 ± 0.202
2.946ValIle: 2.946 ± 1.329
2.946ValLys: 2.946 ± 1.956
5.892ValLeu: 5.892 ± 2.659
2.104ValMet: 2.104 ± 0.8
2.946ValAsn: 2.946 ± 0.62
4.63ValPro: 4.63 ± 1.383
2.104ValGln: 2.104 ± 0.8
1.684ValArg: 1.684 ± 0.404
5.471ValSer: 5.471 ± 0.251
6.313ValThr: 6.313 ± 1.34
2.946ValVal: 2.946 ± 0.62
0.0ValTrp: 0.0 ± 0.0
1.684ValTyr: 1.684 ± 0.543
0.0ValXaa: 0.0 ± 0.0
Trp
0.421TrpAla: 0.421 ± 0.363
0.0TrpCys: 0.0 ± 0.0
0.421TrpAsp: 0.421 ± 0.27
0.842TrpGlu: 0.842 ± 0.539
0.842TrpPhe: 0.842 ± 0.725
0.421TrpGly: 0.421 ± 0.27
0.0TrpHis: 0.0 ± 0.0
1.684TrpIle: 1.684 ± 0.404
0.421TrpLys: 0.421 ± 0.829
0.842TrpLeu: 0.842 ± 0.725
1.263TrpMet: 1.263 ± 0.521
1.263TrpAsn: 1.263 ± 0.809
0.421TrpPro: 0.421 ± 0.27
0.421TrpGln: 0.421 ± 0.27
0.842TrpArg: 0.842 ± 0.202
1.263TrpSer: 1.263 ± 0.309
0.842TrpThr: 0.842 ± 0.202
1.684TrpVal: 1.684 ± 1.45
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.525TyrAla: 2.525 ± 1.063
0.421TyrCys: 0.421 ± 0.829
2.525TyrAsp: 2.525 ± 1.043
0.0TyrGlu: 0.0 ± 0.0
2.104TyrPhe: 2.104 ± 0.702
2.104TyrGly: 2.104 ± 1.348
0.0TyrHis: 0.0 ± 0.0
0.842TyrIle: 0.842 ± 0.725
2.525TyrLys: 2.525 ± 1.043
3.788TyrLeu: 3.788 ± 1.101
0.0TyrMet: 0.0 ± 0.0
1.684TyrAsn: 1.684 ± 0.875
1.684TyrPro: 1.684 ± 1.078
0.0TyrGln: 0.0 ± 0.0
0.421TyrArg: 0.421 ± 0.27
1.263TyrSer: 1.263 ± 0.809
2.525TyrThr: 2.525 ± 0.605
2.525TyrVal: 2.525 ± 0.617
0.0TyrTrp: 0.0 ± 0.0
1.684TyrTyr: 1.684 ± 0.875
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (2377 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski