Amino acid dipepetide frequency for Canine distemper virus (strain Onderstepoort) (CDV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.606AlaAla: 2.606 ± 0.509
1.403AlaCys: 1.403 ± 0.393
1.604AlaAsp: 1.604 ± 0.578
2.606AlaGlu: 2.606 ± 1.049
1.403AlaPhe: 1.403 ± 0.315
3.208AlaGly: 3.208 ± 0.622
1.002AlaHis: 1.002 ± 0.414
5.213AlaIle: 5.213 ± 1.171
3.609AlaLys: 3.609 ± 0.912
8.621AlaLeu: 8.621 ± 1.379
1.804AlaMet: 1.804 ± 0.392
2.406AlaAsn: 2.406 ± 0.42
1.604AlaPro: 1.604 ± 0.436
2.205AlaGln: 2.205 ± 0.979
2.606AlaArg: 2.606 ± 0.694
6.215AlaSer: 6.215 ± 1.19
3.208AlaThr: 3.208 ± 1.267
3.007AlaVal: 3.007 ± 0.226
0.2AlaTrp: 0.2 ± 0.119
1.604AlaTyr: 1.604 ± 0.435
0.0AlaXaa: 0.0 ± 0.0
Cys
1.002CysAla: 1.002 ± 0.422
0.401CysCys: 0.401 ± 0.187
0.601CysAsp: 0.601 ± 0.265
0.601CysGlu: 0.601 ± 0.237
0.802CysPhe: 0.802 ± 0.301
1.203CysGly: 1.203 ± 0.4
0.2CysHis: 0.2 ± 0.232
1.203CysIle: 1.203 ± 0.37
1.203CysLys: 1.203 ± 0.689
1.604CysLeu: 1.604 ± 0.382
0.0CysMet: 0.0 ± 0.0
1.403CysAsn: 1.403 ± 0.443
0.601CysPro: 0.601 ± 0.247
0.802CysGln: 0.802 ± 0.315
0.601CysArg: 0.601 ± 0.344
1.203CysSer: 1.203 ± 0.373
1.403CysThr: 1.403 ± 0.541
1.203CysVal: 1.203 ± 0.37
0.0CysTrp: 0.0 ± 0.0
1.403CysTyr: 1.403 ± 0.315
0.0CysXaa: 0.0 ± 0.0
Asp
1.002AspAla: 1.002 ± 0.258
0.601AspCys: 0.601 ± 0.357
3.809AspAsp: 3.809 ± 0.774
3.408AspGlu: 3.408 ± 0.611
1.403AspPhe: 1.403 ± 0.621
2.606AspGly: 2.606 ± 1.105
1.604AspHis: 1.604 ± 0.56
5.012AspIle: 5.012 ± 0.989
2.606AspLys: 2.606 ± 0.635
4.812AspLeu: 4.812 ± 1.192
1.002AspMet: 1.002 ± 0.52
3.408AspAsn: 3.408 ± 0.662
3.609AspPro: 3.609 ± 0.456
2.205AspGln: 2.205 ± 0.601
3.007AspArg: 3.007 ± 0.501
6.014AspSer: 6.014 ± 1.23
2.406AspThr: 2.406 ± 0.54
2.807AspVal: 2.807 ± 0.652
0.401AspTrp: 0.401 ± 0.238
1.002AspTyr: 1.002 ± 0.417
0.0AspXaa: 0.0 ± 0.0
Glu
4.01GluAla: 4.01 ± 0.554
1.002GluCys: 1.002 ± 0.867
3.809GluAsp: 3.809 ± 1.256
4.411GluGlu: 4.411 ± 1.66
2.406GluPhe: 2.406 ± 0.718
3.208GluGly: 3.208 ± 0.625
0.802GluHis: 0.802 ± 0.477
5.413GluIle: 5.413 ± 0.707
2.406GluLys: 2.406 ± 0.471
4.411GluLeu: 4.411 ± 0.769
1.403GluMet: 1.403 ± 0.733
2.406GluAsn: 2.406 ± 0.822
2.005GluPro: 2.005 ± 0.458
1.604GluGln: 1.604 ± 0.363
3.007GluArg: 3.007 ± 0.805
5.814GluSer: 5.814 ± 0.908
3.208GluThr: 3.208 ± 0.812
3.408GluVal: 3.408 ± 0.514
0.802GluTrp: 0.802 ± 0.276
1.002GluTyr: 1.002 ± 0.329
0.0GluXaa: 0.0 ± 0.0
Phe
1.604PheAla: 1.604 ± 0.266
0.601PheCys: 0.601 ± 0.291
2.005PheAsp: 2.005 ± 0.338
1.604PheGlu: 1.604 ± 0.422
1.403PhePhe: 1.403 ± 0.498
1.804PheGly: 1.804 ± 0.678
1.002PheHis: 1.002 ± 0.414
3.809PheIle: 3.809 ± 1.011
2.807PheLys: 2.807 ± 0.766
2.807PheLeu: 2.807 ± 0.728
1.203PheMet: 1.203 ± 0.604
1.604PheAsn: 1.604 ± 0.266
0.802PhePro: 0.802 ± 0.475
0.601PheGln: 0.601 ± 0.357
2.205PheArg: 2.205 ± 0.888
2.606PheSer: 2.606 ± 0.61
1.604PheThr: 1.604 ± 0.654
2.205PheVal: 2.205 ± 0.59
0.601PheTrp: 0.601 ± 0.236
1.002PheTyr: 1.002 ± 0.563
0.0PheXaa: 0.0 ± 0.0
Gly
2.205GlyAla: 2.205 ± 0.364
0.601GlyCys: 0.601 ± 0.281
4.21GlyAsp: 4.21 ± 0.633
3.408GlyGlu: 3.408 ± 0.895
3.007GlyPhe: 3.007 ± 0.763
3.208GlyGly: 3.208 ± 0.661
1.604GlyHis: 1.604 ± 0.475
6.014GlyIle: 6.014 ± 0.688
2.606GlyLys: 2.606 ± 0.585
7.017GlyLeu: 7.017 ± 1.666
2.406GlyMet: 2.406 ± 0.523
3.007GlyAsn: 3.007 ± 1.193
2.005GlyPro: 2.005 ± 0.507
2.005GlyGln: 2.005 ± 0.645
3.809GlyArg: 3.809 ± 1.35
5.413GlySer: 5.413 ± 1.419
3.208GlyThr: 3.208 ± 0.837
3.809GlyVal: 3.809 ± 0.695
0.802GlyTrp: 0.802 ± 0.309
2.205GlyTyr: 2.205 ± 0.501
0.0GlyXaa: 0.0 ± 0.0
His
1.604HisAla: 1.604 ± 0.542
0.601HisCys: 0.601 ± 0.357
1.203HisAsp: 1.203 ± 0.521
1.203HisGlu: 1.203 ± 0.266
0.601HisPhe: 0.601 ± 0.302
1.604HisGly: 1.604 ± 0.382
0.601HisHis: 0.601 ± 0.236
2.406HisIle: 2.406 ± 0.774
0.601HisLys: 0.601 ± 0.457
1.804HisLeu: 1.804 ± 0.73
0.802HisMet: 0.802 ± 0.328
1.403HisAsn: 1.403 ± 0.532
1.002HisPro: 1.002 ± 0.422
1.804HisGln: 1.804 ± 0.622
2.005HisArg: 2.005 ± 0.572
1.203HisSer: 1.203 ± 0.266
0.802HisThr: 0.802 ± 0.427
2.606HisVal: 2.606 ± 0.531
0.401HisTrp: 0.401 ± 0.294
0.601HisTyr: 0.601 ± 0.362
0.0HisXaa: 0.0 ± 0.0
Ile
5.814IleAla: 5.814 ± 0.604
0.601IleCys: 0.601 ± 0.251
3.007IleAsp: 3.007 ± 0.427
5.413IleGlu: 5.413 ± 0.295
2.005IlePhe: 2.005 ± 0.477
4.21IleGly: 4.21 ± 0.856
1.403IleHis: 1.403 ± 0.319
5.814IleIle: 5.814 ± 1.204
4.611IleLys: 4.611 ± 0.851
10.826IleLeu: 10.826 ± 1.341
1.403IleMet: 1.403 ± 0.359
4.611IleAsn: 4.611 ± 0.614
3.809IlePro: 3.809 ± 0.621
3.208IleGln: 3.208 ± 1.092
5.613IleArg: 5.613 ± 0.792
6.415IleSer: 6.415 ± 0.996
6.215IleThr: 6.215 ± 1.384
3.609IleVal: 3.609 ± 1.133
0.401IleTrp: 0.401 ± 0.49
2.406IleTyr: 2.406 ± 1.112
0.0IleXaa: 0.0 ± 0.0
Lys
3.609LysAla: 3.609 ± 0.319
1.002LysCys: 1.002 ± 0.292
4.21LysAsp: 4.21 ± 0.712
3.408LysGlu: 3.408 ± 0.73
1.002LysPhe: 1.002 ± 0.517
5.012LysGly: 5.012 ± 1.34
1.002LysHis: 1.002 ± 0.316
4.01LysIle: 4.01 ± 1.097
2.807LysLys: 2.807 ± 0.748
4.411LysLeu: 4.411 ± 0.536
1.804LysMet: 1.804 ± 0.678
1.804LysAsn: 1.804 ± 0.323
2.005LysPro: 2.005 ± 0.814
2.406LysGln: 2.406 ± 0.577
3.007LysArg: 3.007 ± 0.927
3.809LysSer: 3.809 ± 0.903
2.005LysThr: 2.005 ± 0.72
3.208LysVal: 3.208 ± 0.647
0.0LysTrp: 0.0 ± 0.0
1.804LysTyr: 1.804 ± 0.871
0.0LysXaa: 0.0 ± 0.0
Leu
6.816LeuAla: 6.816 ± 0.908
1.403LeuCys: 1.403 ± 0.357
5.413LeuAsp: 5.413 ± 0.816
5.814LeuGlu: 5.814 ± 1.219
3.809LeuPhe: 3.809 ± 0.816
6.816LeuGly: 6.816 ± 1.711
3.609LeuHis: 3.609 ± 0.88
7.017LeuIle: 7.017 ± 1.043
6.215LeuLys: 6.215 ± 1.219
10.826LeuLeu: 10.826 ± 1.213
2.606LeuMet: 2.606 ± 0.949
4.611LeuAsn: 4.611 ± 0.64
4.01LeuPro: 4.01 ± 1.502
2.606LeuGln: 2.606 ± 0.702
7.217LeuArg: 7.217 ± 1.618
8.019LeuSer: 8.019 ± 1.042
7.819LeuThr: 7.819 ± 0.993
6.816LeuVal: 6.816 ± 0.604
1.604LeuTrp: 1.604 ± 0.277
2.606LeuTyr: 2.606 ± 0.979
0.0LeuXaa: 0.0 ± 0.0
Met
1.203MetAla: 1.203 ± 0.266
0.2MetCys: 0.2 ± 0.232
1.203MetAsp: 1.203 ± 0.472
1.203MetGlu: 1.203 ± 0.694
0.802MetPhe: 0.802 ± 0.61
1.804MetGly: 1.804 ± 0.831
0.2MetHis: 0.2 ± 0.194
2.606MetIle: 2.606 ± 0.775
1.002MetLys: 1.002 ± 0.414
2.807MetLeu: 2.807 ± 0.695
0.401MetMet: 0.401 ± 0.283
1.002MetAsn: 1.002 ± 0.35
0.802MetPro: 0.802 ± 0.234
0.601MetGln: 0.601 ± 0.251
0.802MetArg: 0.802 ± 0.496
2.205MetSer: 2.205 ± 0.654
1.804MetThr: 1.804 ± 0.426
2.205MetVal: 2.205 ± 0.847
0.2MetTrp: 0.2 ± 0.119
1.203MetTyr: 1.203 ± 0.395
0.0MetXaa: 0.0 ± 0.0
Asn
2.606AsnAla: 2.606 ± 0.933
1.804AsnCys: 1.804 ± 0.584
1.804AsnAsp: 1.804 ± 0.559
1.804AsnGlu: 1.804 ± 0.382
1.804AsnPhe: 1.804 ± 0.385
2.005AsnGly: 2.005 ± 0.394
1.604AsnHis: 1.604 ± 0.47
3.609AsnIle: 3.609 ± 0.353
2.005AsnLys: 2.005 ± 1.097
4.411AsnLeu: 4.411 ± 0.806
1.203AsnMet: 1.203 ± 0.411
1.002AsnAsn: 1.002 ± 0.473
3.408AsnPro: 3.408 ± 1.072
3.408AsnGln: 3.408 ± 0.433
1.604AsnArg: 1.604 ± 0.834
4.21AsnSer: 4.21 ± 1.037
1.203AsnThr: 1.203 ± 0.551
1.604AsnVal: 1.604 ± 0.65
1.002AsnTrp: 1.002 ± 0.407
1.604AsnTyr: 1.604 ± 0.372
0.0AsnXaa: 0.0 ± 0.0
Pro
2.606ProAla: 2.606 ± 0.723
0.2ProCys: 0.2 ± 0.119
3.208ProAsp: 3.208 ± 0.505
1.804ProGlu: 1.804 ± 0.59
0.802ProPhe: 0.802 ± 0.402
3.208ProGly: 3.208 ± 1.326
1.203ProHis: 1.203 ± 0.414
3.809ProIle: 3.809 ± 0.436
3.007ProLys: 3.007 ± 0.564
4.01ProLeu: 4.01 ± 1.323
1.403ProMet: 1.403 ± 0.429
1.604ProAsn: 1.604 ± 0.495
3.208ProPro: 3.208 ± 0.792
1.403ProGln: 1.403 ± 0.712
3.208ProArg: 3.208 ± 0.449
3.609ProSer: 3.609 ± 0.418
1.403ProThr: 1.403 ± 0.561
2.205ProVal: 2.205 ± 0.481
0.2ProTrp: 0.2 ± 0.307
1.804ProTyr: 1.804 ± 0.583
0.0ProXaa: 0.0 ± 0.0
Gln
2.807GlnAla: 2.807 ± 1.297
0.802GlnCys: 0.802 ± 0.374
2.406GlnAsp: 2.406 ± 0.326
2.205GlnGlu: 2.205 ± 0.794
1.403GlnPhe: 1.403 ± 0.606
1.604GlnGly: 1.604 ± 0.507
0.601GlnHis: 0.601 ± 0.581
3.007GlnIle: 3.007 ± 0.67
1.604GlnLys: 1.604 ± 0.783
4.21GlnLeu: 4.21 ± 0.686
0.802GlnMet: 0.802 ± 0.266
1.403GlnAsn: 1.403 ± 0.591
2.005GlnPro: 2.005 ± 0.938
2.005GlnGln: 2.005 ± 0.754
2.606GlnArg: 2.606 ± 0.49
3.408GlnSer: 3.408 ± 0.659
2.205GlnThr: 2.205 ± 0.583
2.005GlnVal: 2.005 ± 0.811
0.401GlnTrp: 0.401 ± 0.238
1.002GlnTyr: 1.002 ± 0.422
0.0GlnXaa: 0.0 ± 0.0
Arg
2.606ArgAla: 2.606 ± 0.558
1.002ArgCys: 1.002 ± 0.34
3.208ArgAsp: 3.208 ± 1.052
2.807ArgGlu: 2.807 ± 0.893
2.406ArgPhe: 2.406 ± 0.648
4.01ArgGly: 4.01 ± 0.457
1.604ArgHis: 1.604 ± 0.785
3.609ArgIle: 3.609 ± 0.892
3.007ArgLys: 3.007 ± 0.403
7.418ArgLeu: 7.418 ± 1.194
1.403ArgMet: 1.403 ± 0.478
3.007ArgAsn: 3.007 ± 0.972
2.406ArgPro: 2.406 ± 0.606
1.403ArgGln: 1.403 ± 0.543
4.611ArgArg: 4.611 ± 1.026
5.413ArgSer: 5.413 ± 1.166
4.21ArgThr: 4.21 ± 1.262
2.807ArgVal: 2.807 ± 0.79
0.401ArgTrp: 0.401 ± 0.305
2.606ArgTyr: 2.606 ± 0.739
0.0ArgXaa: 0.0 ± 0.0
Ser
5.814SerAla: 5.814 ± 1.217
1.604SerCys: 1.604 ± 0.692
3.609SerAsp: 3.609 ± 0.354
5.413SerGlu: 5.413 ± 1.852
3.007SerPhe: 3.007 ± 0.966
7.618SerGly: 7.618 ± 0.828
2.807SerHis: 2.807 ± 0.85
6.014SerIle: 6.014 ± 0.64
4.21SerLys: 4.21 ± 0.972
9.623SerLeu: 9.623 ± 0.978
2.005SerMet: 2.005 ± 0.456
3.408SerAsn: 3.408 ± 0.805
2.606SerPro: 2.606 ± 0.748
4.01SerGln: 4.01 ± 0.671
5.213SerArg: 5.213 ± 0.189
6.415SerSer: 6.415 ± 1.247
6.616SerThr: 6.616 ± 1.406
4.812SerVal: 4.812 ± 0.552
1.002SerTrp: 1.002 ± 0.35
3.007SerTyr: 3.007 ± 0.662
0.0SerXaa: 0.0 ± 0.0
Thr
4.21ThrAla: 4.21 ± 0.535
0.401ThrCys: 0.401 ± 0.307
1.604ThrAsp: 1.604 ± 0.576
3.609ThrGlu: 3.609 ± 0.477
1.804ThrPhe: 1.804 ± 0.595
3.208ThrGly: 3.208 ± 0.444
1.403ThrHis: 1.403 ± 0.543
5.613ThrIle: 5.613 ± 0.775
4.01ThrLys: 4.01 ± 0.501
4.611ThrLeu: 4.611 ± 0.632
1.203ThrMet: 1.203 ± 0.406
2.205ThrAsn: 2.205 ± 0.781
1.604ThrPro: 1.604 ± 0.609
2.807ThrGln: 2.807 ± 0.649
3.007ThrArg: 3.007 ± 0.374
6.415ThrSer: 6.415 ± 1.232
4.01ThrThr: 4.01 ± 0.517
3.208ThrVal: 3.208 ± 0.574
1.403ThrTrp: 1.403 ± 0.47
2.005ThrTyr: 2.005 ± 0.789
0.0ThrXaa: 0.0 ± 0.0
Val
2.005ValAla: 2.005 ± 0.692
1.002ValCys: 1.002 ± 0.318
2.807ValAsp: 2.807 ± 0.486
3.809ValGlu: 3.809 ± 0.722
2.406ValPhe: 2.406 ± 0.774
4.411ValGly: 4.411 ± 0.538
1.804ValHis: 1.804 ± 0.492
5.413ValIle: 5.413 ± 0.742
2.205ValLys: 2.205 ± 0.485
5.613ValLeu: 5.613 ± 1.293
0.2ValMet: 0.2 ± 0.119
1.203ValAsn: 1.203 ± 0.283
2.606ValPro: 2.606 ± 0.547
1.604ValGln: 1.604 ± 0.544
4.01ValArg: 4.01 ± 0.673
6.014ValSer: 6.014 ± 0.931
2.606ValThr: 2.606 ± 0.698
1.604ValVal: 1.604 ± 0.437
0.2ValTrp: 0.2 ± 0.232
3.408ValTyr: 3.408 ± 0.705
0.0ValXaa: 0.0 ± 0.0
Trp
0.802TrpAla: 0.802 ± 0.315
0.601TrpCys: 0.601 ± 0.492
0.802TrpAsp: 0.802 ± 0.315
0.2TrpGlu: 0.2 ± 0.232
0.802TrpPhe: 0.802 ± 0.477
0.401TrpGly: 0.401 ± 0.2
0.0TrpHis: 0.0 ± 0.0
0.401TrpIle: 0.401 ± 0.49
0.401TrpLys: 0.401 ± 0.274
1.403TrpLeu: 1.403 ± 0.473
0.2TrpMet: 0.2 ± 0.232
0.601TrpAsn: 0.601 ± 0.283
0.2TrpPro: 0.2 ± 0.119
0.2TrpGln: 0.2 ± 0.119
0.601TrpArg: 0.601 ± 0.344
0.802TrpSer: 0.802 ± 0.496
0.601TrpThr: 0.601 ± 0.357
0.401TrpVal: 0.401 ± 0.238
0.0TrpTrp: 0.0 ± 0.0
0.601TrpTyr: 0.601 ± 0.43
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.203TyrAla: 1.203 ± 0.433
1.604TyrCys: 1.604 ± 0.386
2.205TyrAsp: 2.205 ± 0.514
1.804TyrGlu: 1.804 ± 0.584
0.802TyrPhe: 0.802 ± 0.496
1.403TyrGly: 1.403 ± 0.618
0.802TyrHis: 0.802 ± 0.256
1.403TyrIle: 1.403 ± 0.381
1.403TyrLys: 1.403 ± 0.302
4.411TyrLeu: 4.411 ± 1.398
1.002TyrMet: 1.002 ± 0.328
1.604TyrAsn: 1.604 ± 0.583
3.609TyrPro: 3.609 ± 0.747
1.604TyrGln: 1.604 ± 0.208
1.203TyrArg: 1.203 ± 0.374
3.609TyrSer: 3.609 ± 0.935
1.804TyrThr: 1.804 ± 0.574
1.403TyrVal: 1.403 ± 0.502
0.0TyrTrp: 0.0 ± 0.0
1.203TyrTyr: 1.203 ± 0.395
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (4989 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski