Amino acid dipepetide frequency for Wenzhou pacific spadenose shark paramyxovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.63AlaAla: 4.63 ± 1.062
0.926AlaCys: 0.926 ± 0.475
2.161AlaAsp: 2.161 ± 0.576
2.006AlaGlu: 2.006 ± 0.773
1.852AlaPhe: 1.852 ± 0.523
2.933AlaGly: 2.933 ± 1.132
1.698AlaHis: 1.698 ± 0.583
3.087AlaIle: 3.087 ± 0.45
2.933AlaLys: 2.933 ± 0.715
6.482AlaLeu: 6.482 ± 1.22
2.161AlaMet: 2.161 ± 0.434
3.087AlaAsn: 3.087 ± 0.708
3.087AlaPro: 3.087 ± 0.856
1.852AlaGln: 1.852 ± 0.51
3.087AlaArg: 3.087 ± 0.775
5.711AlaSer: 5.711 ± 1.248
3.396AlaThr: 3.396 ± 0.65
3.241AlaVal: 3.241 ± 0.523
0.772AlaTrp: 0.772 ± 0.248
1.389AlaTyr: 1.389 ± 0.325
0.0AlaXaa: 0.0 ± 0.0
Cys
0.309CysAla: 0.309 ± 0.15
0.309CysCys: 0.309 ± 0.197
1.235CysAsp: 1.235 ± 0.38
1.235CysGlu: 1.235 ± 0.362
0.309CysPhe: 0.309 ± 0.245
0.463CysGly: 0.463 ± 0.365
1.235CysHis: 1.235 ± 0.426
1.08CysIle: 1.08 ± 0.344
0.617CysLys: 0.617 ± 0.212
0.926CysLeu: 0.926 ± 0.4
0.309CysMet: 0.309 ± 0.222
0.772CysAsn: 0.772 ± 0.42
0.926CysPro: 0.926 ± 0.339
1.235CysGln: 1.235 ± 0.286
0.309CysArg: 0.309 ± 0.168
2.315CysSer: 2.315 ± 0.612
0.617CysThr: 0.617 ± 0.281
0.772CysVal: 0.772 ± 0.365
0.154CysTrp: 0.154 ± 0.102
0.617CysTyr: 0.617 ± 0.294
0.0CysXaa: 0.0 ± 0.0
Asp
1.698AspAla: 1.698 ± 0.527
0.617AspCys: 0.617 ± 0.297
3.55AspAsp: 3.55 ± 1.519
4.013AspGlu: 4.013 ± 0.969
2.006AspPhe: 2.006 ± 0.447
2.624AspGly: 2.624 ± 0.565
1.08AspHis: 1.08 ± 0.378
2.47AspIle: 2.47 ± 0.685
2.778AspLys: 2.778 ± 0.636
6.637AspLeu: 6.637 ± 1.023
0.926AspMet: 0.926 ± 0.294
1.698AspAsn: 1.698 ± 0.631
4.322AspPro: 4.322 ± 0.994
2.624AspGln: 2.624 ± 0.643
3.241AspArg: 3.241 ± 0.706
3.396AspSer: 3.396 ± 0.646
2.933AspThr: 2.933 ± 0.56
2.47AspVal: 2.47 ± 0.514
0.617AspTrp: 0.617 ± 0.251
2.778AspTyr: 2.778 ± 0.611
0.0AspXaa: 0.0 ± 0.0
Glu
3.704GluAla: 3.704 ± 0.765
0.309GluCys: 0.309 ± 0.343
3.859GluAsp: 3.859 ± 1.097
5.248GluGlu: 5.248 ± 1.092
2.006GluPhe: 2.006 ± 0.559
3.396GluGly: 3.396 ± 0.465
1.235GluHis: 1.235 ± 0.492
5.093GluIle: 5.093 ± 0.992
2.47GluLys: 2.47 ± 0.896
5.556GluLeu: 5.556 ± 1.164
0.617GluMet: 0.617 ± 0.396
1.698GluAsn: 1.698 ± 0.456
2.47GluPro: 2.47 ± 0.489
3.087GluGln: 3.087 ± 0.723
3.55GluArg: 3.55 ± 0.67
5.865GluSer: 5.865 ± 0.907
4.013GluThr: 4.013 ± 0.844
3.241GluVal: 3.241 ± 0.638
0.309GluTrp: 0.309 ± 0.197
1.08GluTyr: 1.08 ± 0.57
0.0GluXaa: 0.0 ± 0.0
Phe
2.006PheAla: 2.006 ± 0.468
0.617PheCys: 0.617 ± 0.249
1.698PheAsp: 1.698 ± 0.362
2.161PheGlu: 2.161 ± 0.407
1.235PhePhe: 1.235 ± 0.472
2.47PheGly: 2.47 ± 0.692
0.926PheHis: 0.926 ± 0.256
2.006PheIle: 2.006 ± 0.371
1.852PheLys: 1.852 ± 0.429
3.859PheLeu: 3.859 ± 0.872
1.235PheMet: 1.235 ± 0.417
1.543PheAsn: 1.543 ± 0.502
1.235PhePro: 1.235 ± 0.323
1.389PheGln: 1.389 ± 0.397
1.698PheArg: 1.698 ± 0.311
3.396PheSer: 3.396 ± 0.635
2.315PheThr: 2.315 ± 0.587
1.852PheVal: 1.852 ± 0.418
0.772PheTrp: 0.772 ± 0.221
1.389PheTyr: 1.389 ± 0.373
0.0PheXaa: 0.0 ± 0.0
Gly
3.55GlyAla: 3.55 ± 0.968
1.235GlyCys: 1.235 ± 0.339
3.241GlyAsp: 3.241 ± 0.613
2.624GlyGlu: 2.624 ± 0.634
2.315GlyPhe: 2.315 ± 0.552
4.167GlyGly: 4.167 ± 1.033
1.389GlyHis: 1.389 ± 0.429
4.939GlyIle: 4.939 ± 0.721
2.933GlyLys: 2.933 ± 0.759
5.556GlyLeu: 5.556 ± 0.838
0.772GlyMet: 0.772 ± 0.404
2.933GlyAsn: 2.933 ± 0.636
2.624GlyPro: 2.624 ± 0.482
1.698GlyGln: 1.698 ± 0.613
2.778GlyArg: 2.778 ± 0.683
4.939GlySer: 4.939 ± 0.836
4.167GlyThr: 4.167 ± 0.896
3.087GlyVal: 3.087 ± 0.68
0.772GlyTrp: 0.772 ± 0.286
1.698GlyTyr: 1.698 ± 0.453
0.0GlyXaa: 0.0 ± 0.0
His
0.926HisAla: 0.926 ± 0.319
0.309HisCys: 0.309 ± 0.153
0.926HisAsp: 0.926 ± 0.303
0.617HisGlu: 0.617 ± 0.198
1.08HisPhe: 1.08 ± 0.585
1.389HisGly: 1.389 ± 0.255
0.463HisHis: 0.463 ± 0.247
1.852HisIle: 1.852 ± 0.321
0.926HisLys: 0.926 ± 0.373
3.55HisLeu: 3.55 ± 0.828
0.617HisMet: 0.617 ± 0.256
0.926HisAsn: 0.926 ± 0.409
2.624HisPro: 2.624 ± 0.787
1.08HisGln: 1.08 ± 0.354
1.698HisArg: 1.698 ± 0.518
1.389HisSer: 1.389 ± 0.437
0.463HisThr: 0.463 ± 0.428
1.543HisVal: 1.543 ± 0.483
0.0HisTrp: 0.0 ± 0.0
1.235HisTyr: 1.235 ± 0.247
0.0HisXaa: 0.0 ± 0.0
Ile
4.63IleAla: 4.63 ± 0.835
1.543IleCys: 1.543 ± 0.562
2.624IleAsp: 2.624 ± 0.293
4.476IleGlu: 4.476 ± 0.812
2.006IlePhe: 2.006 ± 0.378
2.315IleGly: 2.315 ± 0.769
1.698IleHis: 1.698 ± 0.574
5.865IleIle: 5.865 ± 1.265
5.865IleLys: 5.865 ± 0.667
5.865IleLeu: 5.865 ± 1.174
1.543IleMet: 1.543 ± 0.448
2.315IleAsn: 2.315 ± 0.497
4.322IlePro: 4.322 ± 1.025
3.859IleGln: 3.859 ± 0.665
4.167IleArg: 4.167 ± 0.527
5.093IleSer: 5.093 ± 0.935
5.556IleThr: 5.556 ± 1.257
5.556IleVal: 5.556 ± 1.173
0.617IleTrp: 0.617 ± 0.212
3.396IleTyr: 3.396 ± 0.409
0.0IleXaa: 0.0 ± 0.0
Lys
4.013LysAla: 4.013 ± 0.682
0.772LysCys: 0.772 ± 0.375
3.241LysAsp: 3.241 ± 0.444
3.55LysGlu: 3.55 ± 1.073
1.08LysPhe: 1.08 ± 0.253
3.241LysGly: 3.241 ± 0.758
1.08LysHis: 1.08 ± 0.409
4.322LysIle: 4.322 ± 0.713
2.624LysLys: 2.624 ± 1.35
4.785LysLeu: 4.785 ± 0.734
1.08LysMet: 1.08 ± 0.32
1.852LysAsn: 1.852 ± 0.715
4.013LysPro: 4.013 ± 0.695
1.852LysGln: 1.852 ± 0.526
4.013LysArg: 4.013 ± 1.263
4.785LysSer: 4.785 ± 1.383
3.396LysThr: 3.396 ± 1.079
3.55LysVal: 3.55 ± 0.711
0.154LysTrp: 0.154 ± 0.149
2.161LysTyr: 2.161 ± 0.698
0.0LysXaa: 0.0 ± 0.0
Leu
6.174LeuAla: 6.174 ± 0.753
1.698LeuCys: 1.698 ± 0.485
5.556LeuAsp: 5.556 ± 0.442
5.711LeuGlu: 5.711 ± 0.662
3.396LeuPhe: 3.396 ± 0.903
6.482LeuGly: 6.482 ± 1.128
2.161LeuHis: 2.161 ± 0.395
8.643LeuIle: 8.643 ± 1.492
4.939LeuLys: 4.939 ± 1.166
10.187LeuLeu: 10.187 ± 1.362
3.396LeuMet: 3.396 ± 0.607
4.476LeuAsn: 4.476 ± 0.778
6.019LeuPro: 6.019 ± 0.992
3.859LeuGln: 3.859 ± 0.699
5.093LeuArg: 5.093 ± 0.991
8.952LeuSer: 8.952 ± 1.58
6.019LeuThr: 6.019 ± 1.051
5.865LeuVal: 5.865 ± 0.989
1.08LeuTrp: 1.08 ± 0.414
3.396LeuTyr: 3.396 ± 0.958
0.0LeuXaa: 0.0 ± 0.0
Met
1.698MetAla: 1.698 ± 0.662
0.463MetCys: 0.463 ± 0.292
2.006MetAsp: 2.006 ± 0.779
1.698MetGlu: 1.698 ± 0.449
0.926MetPhe: 0.926 ± 0.305
0.309MetGly: 0.309 ± 0.15
0.772MetHis: 0.772 ± 0.265
2.006MetIle: 2.006 ± 0.732
2.006MetLys: 2.006 ± 0.456
2.161MetLeu: 2.161 ± 0.336
0.463MetMet: 0.463 ± 0.212
0.617MetAsn: 0.617 ± 0.286
1.543MetPro: 1.543 ± 0.344
0.154MetGln: 0.154 ± 0.102
0.926MetArg: 0.926 ± 0.381
2.006MetSer: 2.006 ± 0.558
1.235MetThr: 1.235 ± 0.402
1.389MetVal: 1.389 ± 0.465
0.309MetTrp: 0.309 ± 0.205
1.235MetTyr: 1.235 ± 0.564
0.0MetXaa: 0.0 ± 0.0
Asn
1.389AsnAla: 1.389 ± 0.391
1.08AsnCys: 1.08 ± 0.353
1.543AsnAsp: 1.543 ± 0.83
1.698AsnGlu: 1.698 ± 0.748
1.08AsnPhe: 1.08 ± 0.319
2.006AsnGly: 2.006 ± 0.355
0.772AsnHis: 0.772 ± 0.357
3.859AsnIle: 3.859 ± 0.783
1.543AsnLys: 1.543 ± 0.586
5.248AsnLeu: 5.248 ± 0.597
1.389AsnMet: 1.389 ± 0.533
1.543AsnAsn: 1.543 ± 0.542
2.624AsnPro: 2.624 ± 0.597
2.161AsnGln: 2.161 ± 0.305
1.543AsnArg: 1.543 ± 0.574
2.778AsnSer: 2.778 ± 1.196
1.389AsnThr: 1.389 ± 0.443
1.852AsnVal: 1.852 ± 0.534
0.463AsnTrp: 0.463 ± 0.228
1.08AsnTyr: 1.08 ± 0.278
0.0AsnXaa: 0.0 ± 0.0
Pro
2.006ProAla: 2.006 ± 0.483
0.926ProCys: 0.926 ± 0.381
3.241ProAsp: 3.241 ± 0.626
3.396ProGlu: 3.396 ± 0.756
3.396ProPhe: 3.396 ± 0.896
3.241ProGly: 3.241 ± 0.983
1.235ProHis: 1.235 ± 0.387
4.63ProIle: 4.63 ± 0.838
4.167ProLys: 4.167 ± 1.243
5.556ProLeu: 5.556 ± 1.087
1.235ProMet: 1.235 ± 0.348
1.08ProAsn: 1.08 ± 0.357
2.933ProPro: 2.933 ± 0.785
1.543ProGln: 1.543 ± 0.591
2.315ProArg: 2.315 ± 0.314
7.1ProSer: 7.1 ± 1.317
4.013ProThr: 4.013 ± 0.817
2.933ProVal: 2.933 ± 0.84
0.154ProTrp: 0.154 ± 0.176
1.543ProTyr: 1.543 ± 0.408
0.0ProXaa: 0.0 ± 0.0
Gln
3.087GlnAla: 3.087 ± 0.688
1.235GlnCys: 1.235 ± 0.546
2.161GlnAsp: 2.161 ± 0.408
2.624GlnGlu: 2.624 ± 0.744
1.698GlnPhe: 1.698 ± 0.325
2.778GlnGly: 2.778 ± 0.409
0.926GlnHis: 0.926 ± 0.398
3.241GlnIle: 3.241 ± 0.67
1.698GlnLys: 1.698 ± 0.803
3.704GlnLeu: 3.704 ± 0.689
0.772GlnMet: 0.772 ± 0.207
1.852GlnAsn: 1.852 ± 0.488
2.006GlnPro: 2.006 ± 0.708
1.852GlnGln: 1.852 ± 0.453
1.698GlnArg: 1.698 ± 0.49
3.241GlnSer: 3.241 ± 0.962
1.852GlnThr: 1.852 ± 0.265
2.624GlnVal: 2.624 ± 0.797
0.154GlnTrp: 0.154 ± 0.16
1.08GlnTyr: 1.08 ± 0.238
0.0GlnXaa: 0.0 ± 0.0
Arg
2.778ArgAla: 2.778 ± 0.725
0.309ArgCys: 0.309 ± 0.153
2.161ArgAsp: 2.161 ± 0.436
4.322ArgGlu: 4.322 ± 0.826
2.624ArgPhe: 2.624 ± 0.345
2.315ArgGly: 2.315 ± 0.575
1.235ArgHis: 1.235 ± 0.648
3.55ArgIle: 3.55 ± 0.637
3.55ArgLys: 3.55 ± 1.319
5.248ArgLeu: 5.248 ± 1.131
1.543ArgMet: 1.543 ± 0.31
2.315ArgAsn: 2.315 ± 0.488
1.698ArgPro: 1.698 ± 0.413
2.315ArgGln: 2.315 ± 0.338
3.704ArgArg: 3.704 ± 0.867
4.63ArgSer: 4.63 ± 0.847
4.167ArgThr: 4.167 ± 0.589
3.55ArgVal: 3.55 ± 0.598
0.463ArgTrp: 0.463 ± 0.214
1.235ArgTyr: 1.235 ± 0.607
0.0ArgXaa: 0.0 ± 0.0
Ser
4.785SerAla: 4.785 ± 0.904
1.698SerCys: 1.698 ± 0.411
6.174SerAsp: 6.174 ± 1.787
6.791SerGlu: 6.791 ± 0.987
2.315SerPhe: 2.315 ± 0.525
6.482SerGly: 6.482 ± 1.253
2.161SerHis: 2.161 ± 0.657
6.328SerIle: 6.328 ± 1.267
3.55SerLys: 3.55 ± 1.207
9.261SerLeu: 9.261 ± 0.694
2.006SerMet: 2.006 ± 0.548
2.315SerAsn: 2.315 ± 0.864
3.55SerPro: 3.55 ± 0.828
2.933SerGln: 2.933 ± 0.736
4.63SerArg: 4.63 ± 0.877
8.643SerSer: 8.643 ± 1.534
6.482SerThr: 6.482 ± 1.231
3.396SerVal: 3.396 ± 0.48
1.08SerTrp: 1.08 ± 0.311
3.55SerTyr: 3.55 ± 0.693
0.0SerXaa: 0.0 ± 0.0
Thr
3.859ThrAla: 3.859 ± 0.63
0.772ThrCys: 0.772 ± 0.302
3.087ThrAsp: 3.087 ± 0.869
3.396ThrGlu: 3.396 ± 0.617
1.389ThrPhe: 1.389 ± 0.432
3.55ThrGly: 3.55 ± 0.504
1.08ThrHis: 1.08 ± 0.334
3.55ThrIle: 3.55 ± 0.459
3.704ThrLys: 3.704 ± 0.934
7.717ThrLeu: 7.717 ± 1.179
1.852ThrMet: 1.852 ± 0.493
2.006ThrAsn: 2.006 ± 0.537
4.63ThrPro: 4.63 ± 1.021
2.624ThrGln: 2.624 ± 0.475
3.55ThrArg: 3.55 ± 0.459
4.939ThrSer: 4.939 ± 1.117
4.63ThrThr: 4.63 ± 0.981
4.013ThrVal: 4.013 ± 0.931
0.772ThrTrp: 0.772 ± 0.432
2.933ThrTyr: 2.933 ± 0.719
0.0ThrXaa: 0.0 ± 0.0
Val
2.778ValAla: 2.778 ± 0.83
0.463ValCys: 0.463 ± 0.38
2.161ValAsp: 2.161 ± 0.697
1.698ValGlu: 1.698 ± 0.409
2.778ValPhe: 2.778 ± 0.683
4.476ValGly: 4.476 ± 0.656
1.08ValHis: 1.08 ± 0.415
3.55ValIle: 3.55 ± 0.73
4.476ValLys: 4.476 ± 0.425
6.482ValLeu: 6.482 ± 0.953
0.926ValMet: 0.926 ± 0.393
3.087ValAsn: 3.087 ± 0.338
2.624ValPro: 2.624 ± 0.487
2.006ValGln: 2.006 ± 0.507
2.778ValArg: 2.778 ± 0.739
5.711ValSer: 5.711 ± 1.086
4.322ValThr: 4.322 ± 0.595
3.396ValVal: 3.396 ± 0.97
0.617ValTrp: 0.617 ± 0.348
2.624ValTyr: 2.624 ± 0.598
0.0ValXaa: 0.0 ± 0.0
Trp
0.154TrpAla: 0.154 ± 0.102
0.309TrpCys: 0.309 ± 0.197
0.772TrpAsp: 0.772 ± 0.355
0.154TrpGlu: 0.154 ± 0.102
0.463TrpPhe: 0.463 ± 0.24
0.463TrpGly: 0.463 ± 0.293
0.0TrpHis: 0.0 ± 0.0
0.617TrpIle: 0.617 ± 0.256
0.926TrpLys: 0.926 ± 0.386
1.235TrpLeu: 1.235 ± 0.312
0.309TrpMet: 0.309 ± 0.153
0.154TrpAsn: 0.154 ± 0.148
0.617TrpPro: 0.617 ± 0.242
0.0TrpGln: 0.0 ± 0.0
0.926TrpArg: 0.926 ± 0.404
0.772TrpSer: 0.772 ± 0.289
0.926TrpThr: 0.926 ± 0.307
0.926TrpVal: 0.926 ± 0.252
0.309TrpTrp: 0.309 ± 0.15
0.154TrpTyr: 0.154 ± 0.176
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.315TyrAla: 2.315 ± 0.462
0.309TyrCys: 0.309 ± 0.193
1.08TyrAsp: 1.08 ± 0.322
1.389TyrGlu: 1.389 ± 0.211
1.698TyrPhe: 1.698 ± 0.458
2.315TyrGly: 2.315 ± 0.619
1.389TyrHis: 1.389 ± 0.427
2.624TyrIle: 2.624 ± 0.737
2.006TyrLys: 2.006 ± 0.698
3.087TyrLeu: 3.087 ± 0.528
0.617TyrMet: 0.617 ± 0.198
0.772TyrAsn: 0.772 ± 0.528
2.778TyrPro: 2.778 ± 0.732
2.161TyrGln: 2.161 ± 0.424
2.006TyrArg: 2.006 ± 0.53
2.47TyrSer: 2.47 ± 0.572
2.161TyrThr: 2.161 ± 0.654
2.778TyrVal: 2.778 ± 0.538
0.617TyrTrp: 0.617 ± 0.201
1.235TyrTyr: 1.235 ± 0.219
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (6480 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski