Amino acid dipepetide frequency for Pata virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.929AlaAla: 7.929 ± 1.745
0.324AlaCys: 0.324 ± 0.204
4.531AlaAsp: 4.531 ± 0.544
4.854AlaGlu: 4.854 ± 0.712
2.265AlaPhe: 2.265 ± 0.582
3.074AlaGly: 3.074 ± 0.459
1.618AlaHis: 1.618 ± 0.55
4.693AlaIle: 4.693 ± 0.805
2.589AlaLys: 2.589 ± 0.488
7.929AlaLeu: 7.929 ± 1.23
1.942AlaMet: 1.942 ± 0.784
2.589AlaAsn: 2.589 ± 0.491
2.589AlaPro: 2.589 ± 0.963
2.751AlaGln: 2.751 ± 0.977
4.045AlaArg: 4.045 ± 0.954
2.751AlaSer: 2.751 ± 0.83
4.369AlaThr: 4.369 ± 0.777
4.369AlaVal: 4.369 ± 0.4
1.294AlaTrp: 1.294 ± 0.298
3.236AlaTyr: 3.236 ± 0.773
0.0AlaXaa: 0.0 ± 0.0
Cys
0.647CysAla: 0.647 ± 0.359
0.324CysCys: 0.324 ± 0.175
0.324CysAsp: 0.324 ± 0.198
0.647CysGlu: 0.647 ± 0.443
0.485CysPhe: 0.485 ± 0.341
0.971CysGly: 0.971 ± 0.282
0.162CysHis: 0.162 ± 0.133
0.809CysIle: 0.809 ± 0.269
0.485CysLys: 0.485 ± 0.206
1.456CysLeu: 1.456 ± 0.504
0.162CysMet: 0.162 ± 0.133
0.485CysAsn: 0.485 ± 0.309
0.162CysPro: 0.162 ± 0.175
0.485CysGln: 0.485 ± 0.231
0.324CysArg: 0.324 ± 0.225
0.485CysSer: 0.485 ± 0.204
0.0CysThr: 0.0 ± 0.0
0.485CysVal: 0.485 ± 0.317
0.0CysTrp: 0.0 ± 0.0
0.647CysTyr: 0.647 ± 0.348
0.0CysXaa: 0.0 ± 0.0
Asp
3.883AspAla: 3.883 ± 0.676
0.647AspCys: 0.647 ± 0.191
2.589AspAsp: 2.589 ± 0.856
5.016AspGlu: 5.016 ± 1.086
2.427AspPhe: 2.427 ± 0.82
2.913AspGly: 2.913 ± 0.486
1.133AspHis: 1.133 ± 0.536
3.722AspIle: 3.722 ± 0.794
2.265AspLys: 2.265 ± 0.703
6.472AspLeu: 6.472 ± 0.957
1.133AspMet: 1.133 ± 0.518
1.942AspAsn: 1.942 ± 0.447
1.942AspPro: 1.942 ± 0.606
1.618AspGln: 1.618 ± 0.391
3.56AspArg: 3.56 ± 0.664
2.913AspSer: 2.913 ± 0.648
2.913AspThr: 2.913 ± 0.731
5.663AspVal: 5.663 ± 0.893
0.162AspTrp: 0.162 ± 0.175
1.78AspTyr: 1.78 ± 0.711
0.0AspXaa: 0.0 ± 0.0
Glu
4.693GluAla: 4.693 ± 0.767
0.809GluCys: 0.809 ± 0.522
3.398GluAsp: 3.398 ± 0.561
9.871GluGlu: 9.871 ± 2.546
1.942GluPhe: 1.942 ± 0.831
3.722GluGly: 3.722 ± 0.561
1.294GluHis: 1.294 ± 0.473
5.825GluIle: 5.825 ± 1.216
5.178GluLys: 5.178 ± 0.845
7.282GluLeu: 7.282 ± 0.972
2.913GluMet: 2.913 ± 0.565
2.265GluAsn: 2.265 ± 0.467
2.589GluPro: 2.589 ± 0.846
4.369GluGln: 4.369 ± 1.021
6.149GluArg: 6.149 ± 0.944
4.207GluSer: 4.207 ± 0.788
4.693GluThr: 4.693 ± 0.824
5.34GluVal: 5.34 ± 1.219
0.809GluTrp: 0.809 ± 0.358
3.074GluTyr: 3.074 ± 0.707
0.0GluXaa: 0.0 ± 0.0
Phe
2.265PheAla: 2.265 ± 0.781
0.485PheCys: 0.485 ± 0.214
2.589PheAsp: 2.589 ± 0.494
2.913PheGlu: 2.913 ± 0.552
1.133PhePhe: 1.133 ± 0.423
2.751PheGly: 2.751 ± 0.524
0.971PheHis: 0.971 ± 0.295
3.56PheIle: 3.56 ± 0.855
1.78PheLys: 1.78 ± 0.582
3.398PheLeu: 3.398 ± 0.829
1.133PheMet: 1.133 ± 0.282
0.971PheAsn: 0.971 ± 0.196
1.618PhePro: 1.618 ± 0.545
1.294PheGln: 1.294 ± 0.445
3.722PheArg: 3.722 ± 0.599
2.104PheSer: 2.104 ± 0.573
2.913PheThr: 2.913 ± 0.736
2.913PheVal: 2.913 ± 0.704
0.162PheTrp: 0.162 ± 0.143
1.618PheTyr: 1.618 ± 0.641
0.0PheXaa: 0.0 ± 0.0
Gly
3.56GlyAla: 3.56 ± 0.958
0.485GlyCys: 0.485 ± 0.458
3.074GlyAsp: 3.074 ± 0.773
5.016GlyGlu: 5.016 ± 0.516
2.427GlyPhe: 2.427 ± 0.817
2.751GlyGly: 2.751 ± 0.715
1.294GlyHis: 1.294 ± 0.332
2.913GlyIle: 2.913 ± 0.562
4.854GlyLys: 4.854 ± 1.484
3.398GlyLeu: 3.398 ± 0.928
1.942GlyMet: 1.942 ± 0.535
1.618GlyAsn: 1.618 ± 0.464
2.427GlyPro: 2.427 ± 0.702
1.942GlyGln: 1.942 ± 0.575
4.369GlyArg: 4.369 ± 0.664
3.074GlySer: 3.074 ± 0.648
2.427GlyThr: 2.427 ± 0.537
4.045GlyVal: 4.045 ± 0.83
0.647GlyTrp: 0.647 ± 0.396
2.265GlyTyr: 2.265 ± 0.827
0.0GlyXaa: 0.0 ± 0.0
His
0.971HisAla: 0.971 ± 0.327
0.162HisCys: 0.162 ± 0.167
1.133HisAsp: 1.133 ± 0.335
0.809HisGlu: 0.809 ± 0.403
0.647HisPhe: 0.647 ± 0.343
1.456HisGly: 1.456 ± 0.503
0.647HisHis: 0.647 ± 0.362
1.294HisIle: 1.294 ± 0.718
0.647HisLys: 0.647 ± 0.443
2.104HisLeu: 2.104 ± 0.413
0.809HisMet: 0.809 ± 0.443
1.133HisAsn: 1.133 ± 0.24
1.133HisPro: 1.133 ± 0.425
1.133HisGln: 1.133 ± 0.414
1.942HisArg: 1.942 ± 0.537
0.971HisSer: 0.971 ± 0.224
0.809HisThr: 0.809 ± 0.387
1.133HisVal: 1.133 ± 0.314
0.162HisTrp: 0.162 ± 0.167
1.294HisTyr: 1.294 ± 0.505
0.0HisXaa: 0.0 ± 0.0
Ile
4.854IleAla: 4.854 ± 0.816
0.809IleCys: 0.809 ± 0.326
4.207IleAsp: 4.207 ± 0.748
4.854IleGlu: 4.854 ± 0.832
3.398IlePhe: 3.398 ± 0.731
3.398IleGly: 3.398 ± 0.779
1.942IleHis: 1.942 ± 0.642
3.398IleIle: 3.398 ± 0.658
3.883IleLys: 3.883 ± 0.682
6.796IleLeu: 6.796 ± 0.669
2.427IleMet: 2.427 ± 0.579
2.913IleAsn: 2.913 ± 0.731
3.56IlePro: 3.56 ± 0.903
3.236IleGln: 3.236 ± 0.537
4.207IleArg: 4.207 ± 1.001
4.854IleSer: 4.854 ± 0.389
4.531IleThr: 4.531 ± 0.853
3.236IleVal: 3.236 ± 0.909
1.456IleTrp: 1.456 ± 0.598
1.78IleTyr: 1.78 ± 0.244
0.0IleXaa: 0.0 ± 0.0
Lys
3.56LysAla: 3.56 ± 0.679
0.485LysCys: 0.485 ± 0.314
3.56LysAsp: 3.56 ± 0.887
4.369LysGlu: 4.369 ± 1.714
2.751LysPhe: 2.751 ± 0.582
3.722LysGly: 3.722 ± 0.906
0.971LysHis: 0.971 ± 0.381
3.883LysIle: 3.883 ± 0.856
3.722LysLys: 3.722 ± 0.948
3.883LysLeu: 3.883 ± 0.481
2.265LysMet: 2.265 ± 0.605
2.427LysAsn: 2.427 ± 0.645
1.456LysPro: 1.456 ± 0.619
1.294LysGln: 1.294 ± 0.437
4.045LysArg: 4.045 ± 0.788
1.456LysSer: 1.456 ± 0.56
3.883LysThr: 3.883 ± 1.193
4.854LysVal: 4.854 ± 1.242
0.647LysTrp: 0.647 ± 0.34
3.236LysTyr: 3.236 ± 0.559
0.0LysXaa: 0.0 ± 0.0
Leu
6.958LeuAla: 6.958 ± 1.042
0.971LeuCys: 0.971 ± 0.346
5.016LeuAsp: 5.016 ± 0.898
5.663LeuGlu: 5.663 ± 0.949
3.56LeuPhe: 3.56 ± 0.754
3.883LeuGly: 3.883 ± 0.67
2.104LeuHis: 2.104 ± 0.744
6.634LeuIle: 6.634 ± 0.865
5.34LeuLys: 5.34 ± 1.025
6.634LeuLeu: 6.634 ± 0.723
2.427LeuMet: 2.427 ± 0.51
4.854LeuAsn: 4.854 ± 0.616
4.854LeuPro: 4.854 ± 1.012
3.883LeuGln: 3.883 ± 0.781
8.9LeuArg: 8.9 ± 1.387
6.149LeuSer: 6.149 ± 1.313
5.178LeuThr: 5.178 ± 0.495
5.34LeuVal: 5.34 ± 0.691
0.809LeuTrp: 0.809 ± 0.34
2.427LeuTyr: 2.427 ± 0.674
0.0LeuXaa: 0.0 ± 0.0
Met
1.456MetAla: 1.456 ± 0.373
0.647MetCys: 0.647 ± 0.233
1.294MetAsp: 1.294 ± 0.421
2.104MetGlu: 2.104 ± 0.457
1.618MetPhe: 1.618 ± 0.604
1.294MetGly: 1.294 ± 0.415
0.971MetHis: 0.971 ± 0.414
3.722MetIle: 3.722 ± 0.697
1.78MetLys: 1.78 ± 0.572
4.207MetLeu: 4.207 ± 0.915
1.294MetMet: 1.294 ± 0.397
1.456MetAsn: 1.456 ± 0.333
0.971MetPro: 0.971 ± 0.637
1.294MetGln: 1.294 ± 0.674
2.913MetArg: 2.913 ± 0.65
2.104MetSer: 2.104 ± 0.617
1.133MetThr: 1.133 ± 0.443
1.618MetVal: 1.618 ± 0.48
0.485MetTrp: 0.485 ± 0.228
0.971MetTyr: 0.971 ± 0.594
0.0MetXaa: 0.0 ± 0.0
Asn
2.589AsnAla: 2.589 ± 0.502
0.324AsnCys: 0.324 ± 0.191
2.104AsnAsp: 2.104 ± 0.535
4.045AsnGlu: 4.045 ± 0.63
1.456AsnPhe: 1.456 ± 0.277
3.074AsnGly: 3.074 ± 0.657
0.324AsnHis: 0.324 ± 0.179
3.56AsnIle: 3.56 ± 1.192
0.971AsnLys: 0.971 ± 0.381
4.045AsnLeu: 4.045 ± 0.32
1.618AsnMet: 1.618 ± 0.522
0.971AsnAsn: 0.971 ± 0.407
2.427AsnPro: 2.427 ± 0.589
1.618AsnGln: 1.618 ± 0.429
2.913AsnArg: 2.913 ± 0.641
1.942AsnSer: 1.942 ± 0.693
2.104AsnThr: 2.104 ± 0.615
4.693AsnVal: 4.693 ± 1.19
0.647AsnTrp: 0.647 ± 0.4
0.971AsnTyr: 0.971 ± 0.555
0.0AsnXaa: 0.0 ± 0.0
Pro
2.427ProAla: 2.427 ± 0.549
0.324ProCys: 0.324 ± 0.305
2.265ProAsp: 2.265 ± 0.772
2.913ProGlu: 2.913 ± 0.618
1.78ProPhe: 1.78 ± 0.742
2.751ProGly: 2.751 ± 0.765
0.971ProHis: 0.971 ± 0.478
3.074ProIle: 3.074 ± 0.894
1.618ProLys: 1.618 ± 0.39
4.045ProLeu: 4.045 ± 0.534
1.294ProMet: 1.294 ± 0.586
1.942ProAsn: 1.942 ± 0.722
1.456ProPro: 1.456 ± 0.447
2.104ProGln: 2.104 ± 0.707
2.265ProArg: 2.265 ± 0.667
1.942ProSer: 1.942 ± 0.447
2.265ProThr: 2.265 ± 1.072
1.942ProVal: 1.942 ± 0.525
0.809ProTrp: 0.809 ± 0.242
1.456ProTyr: 1.456 ± 0.416
0.0ProXaa: 0.0 ± 0.0
Gln
2.589GlnAla: 2.589 ± 0.791
0.324GlnCys: 0.324 ± 0.305
1.133GlnAsp: 1.133 ± 0.369
3.883GlnGlu: 3.883 ± 0.79
1.294GlnPhe: 1.294 ± 0.302
2.589GlnGly: 2.589 ± 0.642
0.485GlnHis: 0.485 ± 0.219
4.207GlnIle: 4.207 ± 0.871
2.589GlnLys: 2.589 ± 0.655
3.398GlnLeu: 3.398 ± 0.448
1.942GlnMet: 1.942 ± 0.501
3.074GlnAsn: 3.074 ± 0.651
2.104GlnPro: 2.104 ± 0.605
1.133GlnGln: 1.133 ± 0.287
3.236GlnArg: 3.236 ± 1.102
1.456GlnSer: 1.456 ± 0.405
2.104GlnThr: 2.104 ± 0.59
2.427GlnVal: 2.427 ± 0.463
0.485GlnTrp: 0.485 ± 0.419
1.133GlnTyr: 1.133 ± 0.39
0.0GlnXaa: 0.0 ± 0.0
Arg
6.149ArgAla: 6.149 ± 1.083
0.162ArgCys: 0.162 ± 0.143
4.207ArgAsp: 4.207 ± 0.552
6.472ArgGlu: 6.472 ± 0.718
4.531ArgPhe: 4.531 ± 0.668
3.883ArgGly: 3.883 ± 0.795
0.647ArgHis: 0.647 ± 0.31
5.502ArgIle: 5.502 ± 0.911
4.369ArgLys: 4.369 ± 0.64
6.472ArgLeu: 6.472 ± 1.014
2.589ArgMet: 2.589 ± 0.813
3.398ArgAsn: 3.398 ± 0.835
2.104ArgPro: 2.104 ± 0.475
3.236ArgGln: 3.236 ± 0.544
5.016ArgArg: 5.016 ± 0.91
2.751ArgSer: 2.751 ± 0.825
2.427ArgThr: 2.427 ± 0.478
4.854ArgVal: 4.854 ± 1.071
1.133ArgTrp: 1.133 ± 0.465
2.104ArgTyr: 2.104 ± 0.494
0.0ArgXaa: 0.0 ± 0.0
Ser
3.883SerAla: 3.883 ± 0.615
0.0SerCys: 0.0 ± 0.0
2.751SerAsp: 2.751 ± 0.875
5.016SerGlu: 5.016 ± 0.822
2.751SerPhe: 2.751 ± 0.476
3.398SerGly: 3.398 ± 0.709
0.971SerHis: 0.971 ± 0.47
2.751SerIle: 2.751 ± 0.644
3.236SerLys: 3.236 ± 0.987
4.693SerLeu: 4.693 ± 0.695
1.618SerMet: 1.618 ± 0.576
2.913SerAsn: 2.913 ± 0.914
2.589SerPro: 2.589 ± 0.668
2.265SerGln: 2.265 ± 0.373
3.398SerArg: 3.398 ± 0.89
3.236SerSer: 3.236 ± 0.743
2.913SerThr: 2.913 ± 0.753
2.751SerVal: 2.751 ± 0.538
1.133SerTrp: 1.133 ± 0.38
2.589SerTyr: 2.589 ± 0.74
0.0SerXaa: 0.0 ± 0.0
Thr
3.883ThrAla: 3.883 ± 0.915
0.647ThrCys: 0.647 ± 0.348
2.751ThrAsp: 2.751 ± 1.02
3.56ThrGlu: 3.56 ± 0.727
0.809ThrPhe: 0.809 ± 0.392
3.398ThrGly: 3.398 ± 0.671
1.294ThrHis: 1.294 ± 0.429
3.722ThrIle: 3.722 ± 0.663
4.045ThrLys: 4.045 ± 0.893
5.178ThrLeu: 5.178 ± 0.922
1.294ThrMet: 1.294 ± 0.338
2.104ThrAsn: 2.104 ± 0.576
0.971ThrPro: 0.971 ± 0.297
2.751ThrGln: 2.751 ± 0.483
3.398ThrArg: 3.398 ± 0.595
3.074ThrSer: 3.074 ± 0.764
3.56ThrThr: 3.56 ± 0.74
3.398ThrVal: 3.398 ± 1.016
0.162ThrTrp: 0.162 ± 0.168
2.589ThrTyr: 2.589 ± 0.797
0.0ThrXaa: 0.0 ± 0.0
Val
4.045ValAla: 4.045 ± 1.444
0.809ValCys: 0.809 ± 0.39
3.883ValAsp: 3.883 ± 0.745
3.883ValGlu: 3.883 ± 0.662
3.074ValPhe: 3.074 ± 0.445
2.913ValGly: 2.913 ± 0.554
1.294ValHis: 1.294 ± 0.603
3.56ValIle: 3.56 ± 0.674
4.207ValLys: 4.207 ± 0.956
5.825ValLeu: 5.825 ± 0.95
3.236ValMet: 3.236 ± 0.867
2.427ValAsn: 2.427 ± 0.537
3.074ValPro: 3.074 ± 0.298
3.398ValGln: 3.398 ± 0.509
5.016ValArg: 5.016 ± 1.029
5.34ValSer: 5.34 ± 0.624
2.104ValThr: 2.104 ± 0.631
3.56ValVal: 3.56 ± 0.696
0.809ValTrp: 0.809 ± 0.418
3.236ValTyr: 3.236 ± 0.389
0.0ValXaa: 0.0 ± 0.0
Trp
0.647TrpAla: 0.647 ± 0.173
0.0TrpCys: 0.0 ± 0.0
1.133TrpAsp: 1.133 ± 0.293
1.133TrpGlu: 1.133 ± 0.413
0.647TrpPhe: 0.647 ± 0.248
0.485TrpGly: 0.485 ± 0.201
0.809TrpHis: 0.809 ± 0.427
1.133TrpIle: 1.133 ± 0.42
1.133TrpLys: 1.133 ± 0.289
0.971TrpLeu: 0.971 ± 0.411
0.0TrpMet: 0.0 ± 0.0
0.647TrpAsn: 0.647 ± 0.395
0.162TrpPro: 0.162 ± 0.175
0.162TrpGln: 0.162 ± 0.175
0.809TrpArg: 0.809 ± 0.239
0.647TrpSer: 0.647 ± 0.19
0.485TrpThr: 0.485 ± 0.211
0.647TrpVal: 0.647 ± 0.237
0.324TrpTrp: 0.324 ± 0.191
0.324TrpTyr: 0.324 ± 0.24
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.751TyrAla: 2.751 ± 0.763
0.809TyrCys: 0.809 ± 0.252
2.751TyrAsp: 2.751 ± 0.799
3.398TyrGlu: 3.398 ± 0.804
1.294TyrPhe: 1.294 ± 0.476
2.104TyrGly: 2.104 ± 0.571
0.485TyrHis: 0.485 ± 0.204
1.78TyrIle: 1.78 ± 0.438
1.78TyrLys: 1.78 ± 0.374
3.398TyrLeu: 3.398 ± 0.593
1.133TyrMet: 1.133 ± 0.442
2.265TyrAsn: 2.265 ± 0.585
1.456TyrPro: 1.456 ± 0.565
1.618TyrGln: 1.618 ± 0.611
1.78TyrArg: 1.78 ± 0.521
3.398TyrSer: 3.398 ± 0.794
1.942TyrThr: 1.942 ± 0.524
2.427TyrVal: 2.427 ± 0.524
0.162TyrTrp: 0.162 ± 0.167
1.456TyrTyr: 1.456 ± 0.538
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (6181 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski