Amino acid dipepetide frequency for Beihai shrimp virus 3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.055AlaAla: 7.055 ± 4.163
1.587AlaCys: 1.587 ± 0.574
3.88AlaAsp: 3.88 ± 0.822
3.704AlaGlu: 3.704 ± 0.995
2.646AlaPhe: 2.646 ± 0.271
4.233AlaGly: 4.233 ± 1.564
1.235AlaHis: 1.235 ± 0.711
3.527AlaIle: 3.527 ± 1.216
2.646AlaLys: 2.646 ± 0.218
5.996AlaLeu: 5.996 ± 1.748
2.116AlaMet: 2.116 ± 1.516
2.822AlaAsn: 2.822 ± 0.297
4.056AlaPro: 4.056 ± 4.84
1.235AlaGln: 1.235 ± 0.578
3.527AlaArg: 3.527 ± 1.476
4.586AlaSer: 4.586 ± 0.752
5.291AlaThr: 5.291 ± 3.574
6.349AlaVal: 6.349 ± 0.933
1.235AlaTrp: 1.235 ± 0.944
2.293AlaTyr: 2.293 ± 0.584
0.0AlaXaa: 0.0 ± 0.0
Cys
0.882CysAla: 0.882 ± 0.776
0.705CysCys: 0.705 ± 0.453
2.116CysAsp: 2.116 ± 0.959
1.058CysGlu: 1.058 ± 0.385
1.058CysPhe: 1.058 ± 0.732
1.411CysGly: 1.411 ± 1.158
0.529CysHis: 0.529 ± 0.366
0.705CysIle: 0.705 ± 0.476
1.94CysLys: 1.94 ± 1.217
3.704CysLeu: 3.704 ± 0.923
0.529CysMet: 0.529 ± 0.328
0.353CysAsn: 0.353 ± 0.584
1.058CysPro: 1.058 ± 0.887
0.882CysGln: 0.882 ± 0.302
1.587CysArg: 1.587 ± 0.574
2.116CysSer: 2.116 ± 0.758
1.235CysThr: 1.235 ± 0.944
1.058CysVal: 1.058 ± 0.318
0.176CysTrp: 0.176 ± 0.376
1.411CysTyr: 1.411 ± 0.671
0.0CysXaa: 0.0 ± 0.0
Asp
2.822AspAla: 2.822 ± 0.678
1.94AspCys: 1.94 ± 0.748
3.88AspAsp: 3.88 ± 1.164
3.527AspGlu: 3.527 ± 1.068
2.116AspPhe: 2.116 ± 0.501
3.175AspGly: 3.175 ± 1.411
1.058AspHis: 1.058 ± 0.277
3.175AspIle: 3.175 ± 0.793
2.822AspLys: 2.822 ± 0.532
4.586AspLeu: 4.586 ± 0.986
1.764AspMet: 1.764 ± 0.754
2.822AspAsn: 2.822 ± 0.887
2.293AspPro: 2.293 ± 0.3
1.058AspGln: 1.058 ± 0.48
4.586AspArg: 4.586 ± 0.861
4.409AspSer: 4.409 ± 1.326
2.998AspThr: 2.998 ± 0.548
3.175AspVal: 3.175 ± 0.633
0.529AspTrp: 0.529 ± 0.159
1.411AspTyr: 1.411 ± 0.417
0.0AspXaa: 0.0 ± 0.0
Glu
3.88GluAla: 3.88 ± 0.906
2.116GluCys: 2.116 ± 0.959
4.762GluAsp: 4.762 ± 1.177
5.115GluGlu: 5.115 ± 1.68
2.998GluPhe: 2.998 ± 0.806
4.762GluGly: 4.762 ± 1.177
1.587GluHis: 1.587 ± 0.625
3.175GluIle: 3.175 ± 0.793
3.351GluLys: 3.351 ± 0.826
7.407GluLeu: 7.407 ± 1.625
2.469GluMet: 2.469 ± 0.399
2.116GluAsn: 2.116 ± 0.641
2.998GluPro: 2.998 ± 0.667
1.058GluGln: 1.058 ± 0.318
2.998GluArg: 2.998 ± 0.614
4.409GluSer: 4.409 ± 0.921
4.586GluThr: 4.586 ± 1.044
4.586GluVal: 4.586 ± 0.572
0.705GluTrp: 0.705 ± 0.32
1.411GluTyr: 1.411 ± 0.617
0.0GluXaa: 0.0 ± 0.0
Phe
1.94PheAla: 1.94 ± 0.468
0.529PheCys: 0.529 ± 0.647
1.764PheAsp: 1.764 ± 0.397
2.646PheGlu: 2.646 ± 0.721
1.411PhePhe: 1.411 ± 0.417
1.764PheGly: 1.764 ± 0.393
0.705PheHis: 0.705 ± 0.254
1.411PheIle: 1.411 ± 0.312
3.527PheLys: 3.527 ± 0.448
3.704PheLeu: 3.704 ± 0.445
0.705PheMet: 0.705 ± 0.407
1.411PheAsn: 1.411 ± 0.616
1.058PhePro: 1.058 ± 0.455
1.764PheGln: 1.764 ± 0.487
2.646PheArg: 2.646 ± 0.515
2.469PheSer: 2.469 ± 0.268
1.94PheThr: 1.94 ± 0.574
3.175PheVal: 3.175 ± 0.424
0.176PheTrp: 0.176 ± 0.216
0.529PheTyr: 0.529 ± 0.526
0.0PheXaa: 0.0 ± 0.0
Gly
3.88GlyAla: 3.88 ± 1.752
0.529GlyCys: 0.529 ± 0.558
3.527GlyAsp: 3.527 ± 0.445
4.056GlyGlu: 4.056 ± 0.633
1.94GlyPhe: 1.94 ± 0.574
3.704GlyGly: 3.704 ± 1.115
1.411GlyHis: 1.411 ± 0.671
3.527GlyIle: 3.527 ± 0.86
2.646GlyLys: 2.646 ± 0.47
3.88GlyLeu: 3.88 ± 0.516
1.058GlyMet: 1.058 ± 0.277
3.351GlyAsn: 3.351 ± 0.613
1.411GlyPro: 1.411 ± 0.446
1.587GlyGln: 1.587 ± 0.586
1.764GlyArg: 1.764 ± 0.604
6.526GlySer: 6.526 ± 1.376
3.704GlyThr: 3.704 ± 0.917
5.467GlyVal: 5.467 ± 0.844
1.058GlyTrp: 1.058 ± 0.48
1.764GlyTyr: 1.764 ± 0.782
0.0GlyXaa: 0.0 ± 0.0
His
0.705HisAla: 0.705 ± 0.316
0.353HisCys: 0.353 ± 0.16
0.882HisAsp: 0.882 ± 0.602
0.529HisGlu: 0.529 ± 0.159
1.235HisPhe: 1.235 ± 0.68
1.411HisGly: 1.411 ± 0.456
1.235HisHis: 1.235 ± 0.331
1.058HisIle: 1.058 ± 0.379
1.058HisLys: 1.058 ± 0.348
3.175HisLeu: 3.175 ± 1.074
1.587HisMet: 1.587 ± 0.196
1.235HisAsn: 1.235 ± 0.514
0.882HisPro: 0.882 ± 0.508
0.529HisGln: 0.529 ± 0.305
0.353HisArg: 0.353 ± 0.203
1.411HisSer: 1.411 ± 0.174
0.705HisThr: 0.705 ± 0.373
1.058HisVal: 1.058 ± 0.277
0.176HisTrp: 0.176 ± 0.216
0.705HisTyr: 0.705 ± 0.407
0.0HisXaa: 0.0 ± 0.0
Ile
3.88IleAla: 3.88 ± 1.277
1.764IleCys: 1.764 ± 0.193
1.94IleAsp: 1.94 ± 0.341
2.822IleGlu: 2.822 ± 0.719
1.587IlePhe: 1.587 ± 0.196
1.587IleGly: 1.587 ± 1.075
0.529IleHis: 0.529 ± 0.302
1.94IleIle: 1.94 ± 0.468
4.938IleLys: 4.938 ± 0.739
5.115IleLeu: 5.115 ± 0.827
1.235IleMet: 1.235 ± 0.475
2.116IleAsn: 2.116 ± 0.58
2.293IlePro: 2.293 ± 0.817
0.882IleGln: 0.882 ± 0.615
2.646IleArg: 2.646 ± 0.817
3.88IleSer: 3.88 ± 0.87
3.351IleThr: 3.351 ± 0.351
3.704IleVal: 3.704 ± 0.816
0.705IleTrp: 0.705 ± 0.308
0.705IleTyr: 0.705 ± 0.863
0.0IleXaa: 0.0 ± 0.0
Lys
4.762LysAla: 4.762 ± 1.169
1.411LysCys: 1.411 ± 0.887
2.998LysAsp: 2.998 ± 0.356
4.409LysGlu: 4.409 ± 0.611
2.646LysPhe: 2.646 ± 0.515
2.293LysGly: 2.293 ± 0.921
1.235LysHis: 1.235 ± 0.363
2.998LysIle: 2.998 ± 0.779
5.115LysLys: 5.115 ± 1.195
8.289LysLeu: 8.289 ± 1.817
1.235LysMet: 1.235 ± 0.578
2.116LysAsn: 2.116 ± 0.227
2.646LysPro: 2.646 ± 0.63
2.116LysGln: 2.116 ± 0.227
2.646LysArg: 2.646 ± 0.975
3.527LysSer: 3.527 ± 0.69
4.056LysThr: 4.056 ± 0.513
3.704LysVal: 3.704 ± 1.416
0.529LysTrp: 0.529 ± 0.159
1.764LysTyr: 1.764 ± 0.852
0.0LysXaa: 0.0 ± 0.0
Leu
5.644LeuAla: 5.644 ± 0.813
1.587LeuCys: 1.587 ± 0.625
4.233LeuAsp: 4.233 ± 0.682
7.055LeuGlu: 7.055 ± 2.404
3.704LeuPhe: 3.704 ± 1.086
5.467LeuGly: 5.467 ± 1.518
1.411LeuHis: 1.411 ± 0.427
4.586LeuIle: 4.586 ± 1.013
7.584LeuLys: 7.584 ± 1.852
9.524LeuLeu: 9.524 ± 1.696
1.94LeuMet: 1.94 ± 0.185
4.409LeuAsn: 4.409 ± 2.278
4.056LeuPro: 4.056 ± 0.547
2.822LeuGln: 2.822 ± 0.407
5.82LeuArg: 5.82 ± 1.498
10.406LeuSer: 10.406 ± 1.429
5.82LeuThr: 5.82 ± 0.962
5.644LeuVal: 5.644 ± 0.56
0.882LeuTrp: 0.882 ± 0.302
3.88LeuTyr: 3.88 ± 0.109
0.0LeuXaa: 0.0 ± 0.0
Met
2.469MetAla: 2.469 ± 1.192
1.058MetCys: 1.058 ± 0.277
0.705MetAsp: 0.705 ± 0.316
1.587MetGlu: 1.587 ± 0.477
1.411MetPhe: 1.411 ± 0.374
1.058MetGly: 1.058 ± 0.305
0.529MetHis: 0.529 ± 0.345
1.587MetIle: 1.587 ± 0.323
1.94MetLys: 1.94 ± 0.341
2.822MetLeu: 2.822 ± 1.074
1.235MetMet: 1.235 ± 0.609
0.882MetAsn: 0.882 ± 0.294
0.882MetPro: 0.882 ± 0.359
0.529MetGln: 0.529 ± 0.302
1.587MetArg: 1.587 ± 0.614
2.646MetSer: 2.646 ± 0.41
2.116MetThr: 2.116 ± 0.85
1.764MetVal: 1.764 ± 0.804
0.353MetTrp: 0.353 ± 0.203
1.235MetTyr: 1.235 ± 0.569
0.0MetXaa: 0.0 ± 0.0
Asn
2.469AsnAla: 2.469 ± 1.254
1.235AsnCys: 1.235 ± 0.208
2.293AsnAsp: 2.293 ± 0.344
2.469AsnGlu: 2.469 ± 0.798
1.411AsnPhe: 1.411 ± 0.174
1.94AsnGly: 1.94 ± 0.795
1.235AsnHis: 1.235 ± 0.457
2.116AsnIle: 2.116 ± 1.236
2.646AsnLys: 2.646 ± 0.619
3.351AsnLeu: 3.351 ± 0.447
1.058AsnMet: 1.058 ± 0.385
2.116AsnAsn: 2.116 ± 0.553
2.646AsnPro: 2.646 ± 0.749
1.411AsnGln: 1.411 ± 0.312
2.293AsnArg: 2.293 ± 1.052
4.056AsnSer: 4.056 ± 1.731
1.94AsnThr: 1.94 ± 0.54
2.469AsnVal: 2.469 ± 0.763
0.529AsnTrp: 0.529 ± 0.345
1.411AsnTyr: 1.411 ± 1.092
0.0AsnXaa: 0.0 ± 0.0
Pro
3.88ProAla: 3.88 ± 2.756
1.235ProCys: 1.235 ± 0.736
2.998ProAsp: 2.998 ± 0.706
3.351ProGlu: 3.351 ± 0.245
1.587ProPhe: 1.587 ± 0.585
3.175ProGly: 3.175 ± 1.37
0.529ProHis: 0.529 ± 0.159
1.235ProIle: 1.235 ± 0.363
1.94ProLys: 1.94 ± 0.778
2.469ProLeu: 2.469 ± 0.415
0.705ProMet: 0.705 ± 1.051
1.235ProAsn: 1.235 ± 0.351
2.822ProPro: 2.822 ± 1.487
2.116ProGln: 2.116 ± 1.257
1.764ProArg: 1.764 ± 0.627
4.586ProSer: 4.586 ± 1.804
2.822ProThr: 2.822 ± 1.117
3.175ProVal: 3.175 ± 1.057
0.0ProTrp: 0.0 ± 0.0
1.411ProTyr: 1.411 ± 0.631
0.0ProXaa: 0.0 ± 0.0
Gln
1.764GlnAla: 1.764 ± 0.905
0.529GlnCys: 0.529 ± 0.305
0.529GlnAsp: 0.529 ± 0.159
2.116GlnGlu: 2.116 ± 0.976
1.411GlnPhe: 1.411 ± 0.427
1.235GlnGly: 1.235 ± 0.514
0.705GlnHis: 0.705 ± 0.308
1.764GlnIle: 1.764 ± 0.518
1.587GlnLys: 1.587 ± 0.323
1.764GlnLeu: 1.764 ± 0.269
0.882GlnMet: 0.882 ± 0.432
1.235GlnAsn: 1.235 ± 0.689
2.293GlnPro: 2.293 ± 0.828
2.469GlnGln: 2.469 ± 2.58
2.646GlnArg: 2.646 ± 0.565
1.058GlnSer: 1.058 ± 0.199
1.764GlnThr: 1.764 ± 0.589
2.822GlnVal: 2.822 ± 1.058
0.705GlnTrp: 0.705 ± 0.407
0.705GlnTyr: 0.705 ± 0.214
0.0GlnXaa: 0.0 ± 0.0
Arg
4.233ArgAla: 4.233 ± 0.863
0.705ArgCys: 0.705 ± 0.373
5.291ArgAsp: 5.291 ± 1.207
3.88ArgGlu: 3.88 ± 1.185
0.882ArgPhe: 0.882 ± 0.508
2.998ArgGly: 2.998 ± 0.706
0.705ArgHis: 0.705 ± 0.373
2.293ArgIle: 2.293 ± 0.3
3.527ArgLys: 3.527 ± 1.068
5.82ArgLeu: 5.82 ± 0.439
2.116ArgMet: 2.116 ± 0.227
2.469ArgAsn: 2.469 ± 0.431
1.058ArgPro: 1.058 ± 0.506
1.764ArgGln: 1.764 ± 0.589
3.351ArgArg: 3.351 ± 0.826
4.762ArgSer: 4.762 ± 1.358
3.704ArgThr: 3.704 ± 0.824
4.056ArgVal: 4.056 ± 1.522
0.176ArgTrp: 0.176 ± 0.102
1.587ArgTyr: 1.587 ± 0.477
0.0ArgXaa: 0.0 ± 0.0
Ser
6.702SerAla: 6.702 ± 0.894
3.175SerCys: 3.175 ± 2.058
2.646SerAsp: 2.646 ± 0.907
6.702SerGlu: 6.702 ± 1.512
2.293SerPhe: 2.293 ± 0.667
6.173SerGly: 6.173 ± 0.975
1.587SerHis: 1.587 ± 0.672
3.704SerIle: 3.704 ± 0.993
3.527SerLys: 3.527 ± 1.043
8.995SerLeu: 8.995 ± 1.584
1.94SerMet: 1.94 ± 0.678
2.646SerAsn: 2.646 ± 0.394
1.94SerPro: 1.94 ± 0.585
2.822SerGln: 2.822 ± 1.091
5.996SerArg: 5.996 ± 1.297
9.171SerSer: 9.171 ± 0.84
6.349SerThr: 6.349 ± 1.068
7.76SerVal: 7.76 ± 1.117
1.058SerTrp: 1.058 ± 0.318
3.351SerTyr: 3.351 ± 0.633
0.0SerXaa: 0.0 ± 0.0
Thr
5.644ThrAla: 5.644 ± 1.594
0.882ThrCys: 0.882 ± 0.776
3.527ThrAsp: 3.527 ± 0.683
4.233ThrGlu: 4.233 ± 0.889
1.94ThrPhe: 1.94 ± 0.518
3.704ThrGly: 3.704 ± 1.086
1.587ThrHis: 1.587 ± 0.979
3.351ThrIle: 3.351 ± 0.99
3.527ThrLys: 3.527 ± 0.815
5.996ThrLeu: 5.996 ± 1.029
2.822ThrMet: 2.822 ± 0.719
3.175ThrAsn: 3.175 ± 0.944
4.586ThrPro: 4.586 ± 0.993
1.587ThrGln: 1.587 ± 0.396
2.998ThrArg: 2.998 ± 0.678
5.82ThrSer: 5.82 ± 0.658
3.704ThrThr: 3.704 ± 1.324
4.586ThrVal: 4.586 ± 1.48
0.705ThrTrp: 0.705 ± 0.308
0.882ThrTyr: 0.882 ± 0.294
0.0ThrXaa: 0.0 ± 0.0
Val
5.291ValAla: 5.291 ± 2.789
1.94ValCys: 1.94 ± 0.612
4.233ValAsp: 4.233 ± 0.288
5.467ValGlu: 5.467 ± 0.616
1.235ValPhe: 1.235 ± 0.315
4.762ValGly: 4.762 ± 2.201
1.235ValHis: 1.235 ± 0.618
3.527ValIle: 3.527 ± 0.896
4.409ValLys: 4.409 ± 0.808
4.938ValLeu: 4.938 ± 0.861
1.94ValMet: 1.94 ± 0.426
2.822ValAsn: 2.822 ± 0.726
2.646ValPro: 2.646 ± 2.012
1.764ValGln: 1.764 ± 0.258
4.056ValArg: 4.056 ± 0.538
8.466ValSer: 8.466 ± 1.393
5.644ValThr: 5.644 ± 1.841
5.82ValVal: 5.82 ± 1.196
0.882ValTrp: 0.882 ± 0.405
2.646ValTyr: 2.646 ± 0.974
0.0ValXaa: 0.0 ± 0.0
Trp
0.705TrpAla: 0.705 ± 0.214
0.705TrpCys: 0.705 ± 0.863
0.705TrpAsp: 0.705 ± 0.32
0.529TrpGlu: 0.529 ± 0.715
0.176TrpPhe: 0.176 ± 0.376
0.882TrpGly: 0.882 ± 0.302
0.176TrpHis: 0.176 ± 0.102
0.176TrpIle: 0.176 ± 0.216
0.529TrpLys: 0.529 ± 0.366
0.882TrpLeu: 0.882 ± 0.508
0.176TrpMet: 0.176 ± 0.102
0.176TrpAsn: 0.176 ± 0.102
0.353TrpPro: 0.353 ± 0.203
0.705TrpGln: 0.705 ± 0.476
0.176TrpArg: 0.176 ± 0.102
1.411TrpSer: 1.411 ± 0.456
0.705TrpThr: 0.705 ± 0.579
1.235TrpVal: 1.235 ± 0.315
0.353TrpTrp: 0.353 ± 0.424
0.353TrpTyr: 0.353 ± 0.16
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.587TyrAla: 1.587 ± 0.247
0.529TyrCys: 0.529 ± 0.159
1.235TyrAsp: 1.235 ± 0.569
1.235TyrGlu: 1.235 ± 0.331
1.587TyrPhe: 1.587 ± 0.632
1.058TyrGly: 1.058 ± 0.348
1.058TyrHis: 1.058 ± 0.61
1.764TyrIle: 1.764 ± 0.718
1.235TyrLys: 1.235 ± 0.457
4.056TyrLeu: 4.056 ± 0.313
0.705TyrMet: 0.705 ± 0.316
1.587TyrAsn: 1.587 ± 0.449
1.058TyrPro: 1.058 ± 0.506
0.882TyrGln: 0.882 ± 0.668
1.94TyrArg: 1.94 ± 1.071
2.822TyrSer: 2.822 ± 0.719
2.822TyrThr: 2.822 ± 0.542
2.116TyrVal: 2.116 ± 0.501
0.176TyrTrp: 0.176 ± 0.216
1.94TyrTyr: 1.94 ± 0.533
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (5671 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski