Amino acid dipepetide frequency for Bat paramyxovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.922AlaAla: 2.922 ± 0.341
0.73AlaCys: 0.73 ± 0.457
2.557AlaAsp: 2.557 ± 0.942
2.374AlaGlu: 2.374 ± 0.59
1.278AlaPhe: 1.278 ± 0.253
3.287AlaGly: 3.287 ± 1.077
0.913AlaHis: 0.913 ± 0.236
5.844AlaIle: 5.844 ± 1.528
3.104AlaLys: 3.104 ± 0.616
5.844AlaLeu: 5.844 ± 1.192
2.009AlaMet: 2.009 ± 0.555
3.652AlaAsn: 3.652 ± 0.414
1.644AlaPro: 1.644 ± 0.636
2.009AlaGln: 2.009 ± 0.347
1.826AlaArg: 1.826 ± 0.441
3.835AlaSer: 3.835 ± 0.63
2.374AlaThr: 2.374 ± 0.551
2.374AlaVal: 2.374 ± 0.604
0.548AlaTrp: 0.548 ± 0.246
1.644AlaTyr: 1.644 ± 0.613
0.0AlaXaa: 0.0 ± 0.0
Cys
0.365CysAla: 0.365 ± 0.176
0.365CysCys: 0.365 ± 0.209
0.73CysAsp: 0.73 ± 0.23
0.548CysGlu: 0.548 ± 0.217
0.73CysPhe: 0.73 ± 0.33
0.548CysGly: 0.548 ± 0.245
0.0CysHis: 0.0 ± 0.0
1.461CysIle: 1.461 ± 0.632
0.548CysLys: 0.548 ± 0.321
1.096CysLeu: 1.096 ± 0.406
0.365CysMet: 0.365 ± 0.306
0.913CysAsn: 0.913 ± 0.779
0.73CysPro: 0.73 ± 0.395
0.73CysGln: 0.73 ± 0.352
0.365CysArg: 0.365 ± 0.228
1.826CysSer: 1.826 ± 0.795
0.548CysThr: 0.548 ± 0.381
0.365CysVal: 0.365 ± 0.176
0.0CysTrp: 0.0 ± 0.0
0.73CysTyr: 0.73 ± 0.245
0.0CysXaa: 0.0 ± 0.0
Asp
2.191AspAla: 2.191 ± 0.741
1.096AspCys: 1.096 ± 0.361
4.565AspAsp: 4.565 ± 1.077
4.018AspGlu: 4.018 ± 1.094
3.47AspPhe: 3.47 ± 0.487
2.739AspGly: 2.739 ± 0.861
1.826AspHis: 1.826 ± 0.829
3.835AspIle: 3.835 ± 0.689
4.748AspLys: 4.748 ± 0.901
6.757AspLeu: 6.757 ± 0.856
1.278AspMet: 1.278 ± 0.462
3.287AspAsn: 3.287 ± 0.848
2.191AspPro: 2.191 ± 0.465
3.104AspGln: 3.104 ± 1.071
1.826AspArg: 1.826 ± 0.49
3.47AspSer: 3.47 ± 0.779
3.104AspThr: 3.104 ± 0.58
3.47AspVal: 3.47 ± 0.437
1.096AspTrp: 1.096 ± 0.38
2.739AspTyr: 2.739 ± 0.755
0.0AspXaa: 0.0 ± 0.0
Glu
2.374GluAla: 2.374 ± 0.528
0.365GluCys: 0.365 ± 0.295
3.104GluAsp: 3.104 ± 0.733
4.018GluGlu: 4.018 ± 1.087
3.104GluPhe: 3.104 ± 0.917
3.287GluGly: 3.287 ± 0.871
0.913GluHis: 0.913 ± 0.411
4.931GluIle: 4.931 ± 0.676
3.835GluLys: 3.835 ± 0.927
4.931GluLeu: 4.931 ± 0.687
1.278GluMet: 1.278 ± 0.235
4.748GluAsn: 4.748 ± 1.301
1.644GluPro: 1.644 ± 0.532
2.374GluGln: 2.374 ± 0.418
2.009GluArg: 2.009 ± 0.388
5.844GluSer: 5.844 ± 0.912
3.287GluThr: 3.287 ± 0.89
2.922GluVal: 2.922 ± 0.896
0.548GluTrp: 0.548 ± 0.217
2.191GluTyr: 2.191 ± 0.845
0.0GluXaa: 0.0 ± 0.0
Phe
1.461PheAla: 1.461 ± 0.494
0.548PheCys: 0.548 ± 0.343
2.922PheAsp: 2.922 ± 0.809
1.461PheGlu: 1.461 ± 0.519
1.644PhePhe: 1.644 ± 0.785
2.922PheGly: 2.922 ± 0.552
0.73PheHis: 0.73 ± 0.345
1.278PheIle: 1.278 ± 0.305
2.191PheLys: 2.191 ± 0.62
4.748PheLeu: 4.748 ± 1.317
0.913PheMet: 0.913 ± 0.284
3.652PheAsn: 3.652 ± 0.578
1.278PhePro: 1.278 ± 0.268
1.461PheGln: 1.461 ± 0.332
4.018PheArg: 4.018 ± 1.02
2.191PheSer: 2.191 ± 0.739
1.278PheThr: 1.278 ± 0.422
2.374PheVal: 2.374 ± 0.43
0.73PheTrp: 0.73 ± 0.299
0.548PheTyr: 0.548 ± 0.351
0.0PheXaa: 0.0 ± 0.0
Gly
3.287GlyAla: 3.287 ± 0.845
0.913GlyCys: 0.913 ± 0.418
3.104GlyAsp: 3.104 ± 0.908
2.009GlyGlu: 2.009 ± 1.014
1.826GlyPhe: 1.826 ± 0.699
3.47GlyGly: 3.47 ± 1.184
1.096GlyHis: 1.096 ± 0.392
3.835GlyIle: 3.835 ± 1.016
2.739GlyLys: 2.739 ± 1.077
5.113GlyLeu: 5.113 ± 0.88
1.461GlyMet: 1.461 ± 0.476
3.104GlyAsn: 3.104 ± 0.949
2.557GlyPro: 2.557 ± 0.771
1.096GlyGln: 1.096 ± 0.652
3.287GlyArg: 3.287 ± 0.614
4.565GlySer: 4.565 ± 0.503
1.826GlyThr: 1.826 ± 0.507
3.287GlyVal: 3.287 ± 1.121
0.365GlyTrp: 0.365 ± 0.218
2.009GlyTyr: 2.009 ± 0.474
0.0GlyXaa: 0.0 ± 0.0
His
0.913HisAla: 0.913 ± 0.304
0.183HisCys: 0.183 ± 0.114
0.365HisAsp: 0.365 ± 0.228
0.913HisGlu: 0.913 ± 0.28
0.73HisPhe: 0.73 ± 0.33
0.913HisGly: 0.913 ± 0.372
0.365HisHis: 0.365 ± 0.194
1.826HisIle: 1.826 ± 0.559
1.461HisLys: 1.461 ± 0.225
1.826HisLeu: 1.826 ± 0.444
0.548HisMet: 0.548 ± 0.275
1.461HisAsn: 1.461 ± 0.385
1.461HisPro: 1.461 ± 0.419
0.548HisGln: 0.548 ± 0.343
0.73HisArg: 0.73 ± 0.345
1.278HisSer: 1.278 ± 0.363
0.73HisThr: 0.73 ± 0.682
2.009HisVal: 2.009 ± 0.487
0.183HisTrp: 0.183 ± 0.114
0.73HisTyr: 0.73 ± 0.293
0.0HisXaa: 0.0 ± 0.0
Ile
4.383IleAla: 4.383 ± 0.773
1.461IleCys: 1.461 ± 0.461
5.844IleAsp: 5.844 ± 0.904
4.931IleGlu: 4.931 ± 0.841
2.191IlePhe: 2.191 ± 0.759
3.835IleGly: 3.835 ± 0.981
1.278IleHis: 1.278 ± 0.554
7.122IleIle: 7.122 ± 0.836
7.305IleLys: 7.305 ± 1.008
8.218IleLeu: 8.218 ± 1.316
2.739IleMet: 2.739 ± 0.789
6.026IleAsn: 6.026 ± 1.193
4.018IlePro: 4.018 ± 0.517
2.922IleGln: 2.922 ± 0.787
3.104IleArg: 3.104 ± 0.651
7.122IleSer: 7.122 ± 0.956
5.661IleThr: 5.661 ± 1.163
3.47IleVal: 3.47 ± 0.667
0.548IleTrp: 0.548 ± 0.223
3.104IleTyr: 3.104 ± 0.858
0.0IleXaa: 0.0 ± 0.0
Lys
3.835LysAla: 3.835 ± 1.025
0.73LysCys: 0.73 ± 0.294
4.2LysAsp: 4.2 ± 0.675
4.748LysGlu: 4.748 ± 1.123
2.009LysPhe: 2.009 ± 0.669
3.47LysGly: 3.47 ± 1.504
0.913LysHis: 0.913 ± 0.439
6.757LysIle: 6.757 ± 1.023
5.478LysLys: 5.478 ± 1.136
7.305LysLeu: 7.305 ± 1.221
1.826LysMet: 1.826 ± 0.473
5.296LysAsn: 5.296 ± 0.922
2.374LysPro: 2.374 ± 0.22
0.913LysGln: 0.913 ± 0.353
4.018LysArg: 4.018 ± 0.456
4.931LysSer: 4.931 ± 1.106
2.557LysThr: 2.557 ± 0.862
4.383LysVal: 4.383 ± 1.091
0.548LysTrp: 0.548 ± 0.261
2.374LysTyr: 2.374 ± 0.415
0.0LysXaa: 0.0 ± 0.0
Leu
4.565LeuAla: 4.565 ± 0.947
0.73LeuCys: 0.73 ± 0.434
4.748LeuAsp: 4.748 ± 1.25
6.209LeuGlu: 6.209 ± 0.985
4.565LeuPhe: 4.565 ± 0.739
4.2LeuGly: 4.2 ± 0.665
1.644LeuHis: 1.644 ± 0.516
7.122LeuIle: 7.122 ± 1.186
8.583LeuLys: 8.583 ± 0.993
6.574LeuLeu: 6.574 ± 1.23
2.739LeuMet: 2.739 ± 0.874
6.574LeuAsn: 6.574 ± 1.088
2.557LeuPro: 2.557 ± 0.698
5.478LeuGln: 5.478 ± 0.606
4.2LeuArg: 4.2 ± 0.545
8.218LeuSer: 8.218 ± 1.494
6.209LeuThr: 6.209 ± 1.047
4.931LeuVal: 4.931 ± 1.201
0.73LeuTrp: 0.73 ± 0.232
3.287LeuTyr: 3.287 ± 0.861
0.0LeuXaa: 0.0 ± 0.0
Met
2.191MetAla: 2.191 ± 0.906
0.183MetCys: 0.183 ± 0.114
1.826MetAsp: 1.826 ± 0.611
1.461MetGlu: 1.461 ± 0.378
0.548MetPhe: 0.548 ± 0.217
0.73MetGly: 0.73 ± 0.341
0.365MetHis: 0.365 ± 0.228
2.009MetIle: 2.009 ± 0.359
2.009MetLys: 2.009 ± 0.542
2.191MetLeu: 2.191 ± 0.461
1.096MetMet: 1.096 ± 0.414
1.096MetAsn: 1.096 ± 0.554
0.365MetPro: 0.365 ± 0.191
0.183MetGln: 0.183 ± 0.233
1.461MetArg: 1.461 ± 0.583
2.557MetSer: 2.557 ± 0.88
2.374MetThr: 2.374 ± 0.5
2.009MetVal: 2.009 ± 0.591
0.548MetTrp: 0.548 ± 0.275
1.461MetTyr: 1.461 ± 0.419
0.0MetXaa: 0.0 ± 0.0
Asn
3.287AsnAla: 3.287 ± 1.062
0.365AsnCys: 0.365 ± 0.405
4.2AsnAsp: 4.2 ± 0.432
3.47AsnGlu: 3.47 ± 0.509
1.278AsnPhe: 1.278 ± 0.427
2.739AsnGly: 2.739 ± 0.642
1.644AsnHis: 1.644 ± 0.435
5.478AsnIle: 5.478 ± 0.737
3.835AsnLys: 3.835 ± 1.479
7.67AsnLeu: 7.67 ± 1.781
1.278AsnMet: 1.278 ± 0.696
3.287AsnAsn: 3.287 ± 0.525
4.2AsnPro: 4.2 ± 0.817
4.383AsnGln: 4.383 ± 1.075
3.47AsnArg: 3.47 ± 0.617
4.383AsnSer: 4.383 ± 0.895
3.47AsnThr: 3.47 ± 0.896
2.009AsnVal: 2.009 ± 0.508
0.913AsnTrp: 0.913 ± 0.571
3.287AsnTyr: 3.287 ± 0.827
0.0AsnXaa: 0.0 ± 0.0
Pro
1.644ProAla: 1.644 ± 0.348
0.183ProCys: 0.183 ± 0.114
1.644ProAsp: 1.644 ± 0.487
2.739ProGlu: 2.739 ± 0.722
1.826ProPhe: 1.826 ± 0.617
1.826ProGly: 1.826 ± 0.597
0.73ProHis: 0.73 ± 0.318
3.835ProIle: 3.835 ± 0.785
3.47ProLys: 3.47 ± 0.774
2.739ProLeu: 2.739 ± 0.69
1.278ProMet: 1.278 ± 0.493
2.557ProAsn: 2.557 ± 0.499
1.278ProPro: 1.278 ± 0.293
1.096ProGln: 1.096 ± 0.592
2.739ProArg: 2.739 ± 0.703
3.104ProSer: 3.104 ± 0.784
2.739ProThr: 2.739 ± 0.731
1.461ProVal: 1.461 ± 0.456
0.365ProTrp: 0.365 ± 0.295
2.009ProTyr: 2.009 ± 0.588
0.0ProXaa: 0.0 ± 0.0
Gln
1.826GlnAla: 1.826 ± 0.415
0.73GlnCys: 0.73 ± 0.283
2.739GlnAsp: 2.739 ± 0.737
3.104GlnGlu: 3.104 ± 1.044
0.548GlnPhe: 0.548 ± 0.245
2.009GlnGly: 2.009 ± 0.637
1.278GlnHis: 1.278 ± 0.263
4.018GlnIle: 4.018 ± 0.827
2.191GlnLys: 2.191 ± 0.461
3.287GlnLeu: 3.287 ± 0.946
1.461GlnMet: 1.461 ± 0.642
1.644GlnAsn: 1.644 ± 0.966
1.278GlnPro: 1.278 ± 0.48
1.096GlnGln: 1.096 ± 0.522
1.644GlnArg: 1.644 ± 0.411
3.104GlnSer: 3.104 ± 1.174
1.826GlnThr: 1.826 ± 0.609
1.644GlnVal: 1.644 ± 0.518
0.183GlnTrp: 0.183 ± 0.163
1.278GlnTyr: 1.278 ± 0.739
0.0GlnXaa: 0.0 ± 0.0
Arg
2.922ArgAla: 2.922 ± 0.58
0.73ArgCys: 0.73 ± 0.407
3.104ArgAsp: 3.104 ± 0.904
2.374ArgGlu: 2.374 ± 0.328
3.104ArgPhe: 3.104 ± 0.388
1.826ArgGly: 1.826 ± 0.599
0.73ArgHis: 0.73 ± 0.318
3.835ArgIle: 3.835 ± 0.585
3.287ArgLys: 3.287 ± 0.711
4.748ArgLeu: 4.748 ± 1.263
0.913ArgMet: 0.913 ± 0.261
2.739ArgAsn: 2.739 ± 0.58
2.739ArgPro: 2.739 ± 0.476
0.913ArgGln: 0.913 ± 0.459
4.565ArgArg: 4.565 ± 1.258
4.931ArgSer: 4.931 ± 0.741
2.922ArgThr: 2.922 ± 0.966
3.287ArgVal: 3.287 ± 0.708
1.096ArgTrp: 1.096 ± 0.355
1.096ArgTyr: 1.096 ± 0.462
0.0ArgXaa: 0.0 ± 0.0
Ser
2.922SerAla: 2.922 ± 0.659
1.278SerCys: 1.278 ± 0.552
4.565SerAsp: 4.565 ± 0.91
4.931SerGlu: 4.931 ± 0.86
4.2SerPhe: 4.2 ± 0.6
4.748SerGly: 4.748 ± 0.534
2.009SerHis: 2.009 ± 0.487
7.487SerIle: 7.487 ± 1.824
3.835SerLys: 3.835 ± 0.588
7.305SerLeu: 7.305 ± 1.019
2.009SerMet: 2.009 ± 0.639
6.209SerAsn: 6.209 ± 0.821
2.374SerPro: 2.374 ± 0.538
3.104SerGln: 3.104 ± 0.504
4.2SerArg: 4.2 ± 1.159
8.583SerSer: 8.583 ± 1.729
6.392SerThr: 6.392 ± 0.515
2.922SerVal: 2.922 ± 0.503
1.096SerTrp: 1.096 ± 0.373
2.191SerTyr: 2.191 ± 0.499
0.0SerXaa: 0.0 ± 0.0
Thr
3.652ThrAla: 3.652 ± 0.799
0.548ThrCys: 0.548 ± 0.449
5.113ThrAsp: 5.113 ± 0.496
3.47ThrGlu: 3.47 ± 0.86
1.278ThrPhe: 1.278 ± 0.493
3.104ThrGly: 3.104 ± 0.953
0.548ThrHis: 0.548 ± 0.269
5.478ThrIle: 5.478 ± 0.339
3.652ThrLys: 3.652 ± 0.575
5.478ThrLeu: 5.478 ± 0.77
0.913ThrMet: 0.913 ± 0.445
3.47ThrAsn: 3.47 ± 0.782
1.644ThrPro: 1.644 ± 0.309
2.557ThrGln: 2.557 ± 0.615
3.104ThrArg: 3.104 ± 1.054
4.565ThrSer: 4.565 ± 0.707
4.565ThrThr: 4.565 ± 1.423
3.652ThrVal: 3.652 ± 0.72
0.365ThrTrp: 0.365 ± 0.228
1.644ThrTyr: 1.644 ± 0.709
0.0ThrXaa: 0.0 ± 0.0
Val
3.287ValAla: 3.287 ± 0.59
0.548ValCys: 0.548 ± 0.275
2.922ValAsp: 2.922 ± 1.136
3.104ValGlu: 3.104 ± 0.444
1.644ValPhe: 1.644 ± 0.519
2.739ValGly: 2.739 ± 0.608
1.278ValHis: 1.278 ± 0.384
4.565ValIle: 4.565 ± 0.598
3.287ValLys: 3.287 ± 1.061
3.47ValLeu: 3.47 ± 1.023
1.096ValMet: 1.096 ± 0.345
3.104ValAsn: 3.104 ± 0.643
3.104ValPro: 3.104 ± 0.988
1.278ValGln: 1.278 ± 0.336
3.287ValArg: 3.287 ± 1.053
2.739ValSer: 2.739 ± 0.68
4.018ValThr: 4.018 ± 0.734
3.104ValVal: 3.104 ± 1.086
0.913ValTrp: 0.913 ± 0.549
2.739ValTyr: 2.739 ± 0.536
0.0ValXaa: 0.0 ± 0.0
Trp
0.73TrpAla: 0.73 ± 0.279
0.365TrpCys: 0.365 ± 0.283
0.365TrpAsp: 0.365 ± 0.228
0.913TrpGlu: 0.913 ± 0.349
0.548TrpPhe: 0.548 ± 0.343
0.365TrpGly: 0.365 ± 0.318
0.183TrpHis: 0.183 ± 0.203
0.913TrpIle: 0.913 ± 0.459
0.548TrpLys: 0.548 ± 0.282
0.913TrpLeu: 0.913 ± 0.299
0.365TrpMet: 0.365 ± 0.176
0.365TrpAsn: 0.365 ± 0.218
0.183TrpPro: 0.183 ± 0.114
0.183TrpGln: 0.183 ± 0.114
0.365TrpArg: 0.365 ± 0.218
1.826TrpSer: 1.826 ± 0.656
0.548TrpThr: 0.548 ± 0.282
0.913TrpVal: 0.913 ± 0.237
0.365TrpTrp: 0.365 ± 0.174
0.548TrpTyr: 0.548 ± 0.245
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.009TyrAla: 2.009 ± 0.71
0.913TyrCys: 0.913 ± 0.349
2.191TyrAsp: 2.191 ± 0.496
0.913TyrGlu: 0.913 ± 0.429
1.826TyrPhe: 1.826 ± 0.386
2.191TyrGly: 2.191 ± 0.431
0.73TyrHis: 0.73 ± 0.414
3.652TyrIle: 3.652 ± 0.486
2.557TyrLys: 2.557 ± 0.797
3.652TyrLeu: 3.652 ± 0.614
0.913TyrMet: 0.913 ± 0.46
1.461TyrAsn: 1.461 ± 0.225
1.826TyrPro: 1.826 ± 0.662
1.461TyrGln: 1.461 ± 0.479
1.644TyrArg: 1.644 ± 0.612
3.287TyrSer: 3.287 ± 0.608
2.374TyrThr: 2.374 ± 0.727
1.644TyrVal: 1.644 ± 0.591
0.365TyrTrp: 0.365 ± 0.209
1.644TyrTyr: 1.644 ± 0.573
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (5477 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski