Amino acid dipepetide frequency for Human respiratory syncytial virus B (strain B1)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.296AlaAla: 2.296 ± 1.234
0.417AlaCys: 0.417 ± 0.41
1.044AlaAsp: 1.044 ± 0.482
2.505AlaGlu: 2.505 ± 0.777
1.252AlaPhe: 1.252 ± 0.479
3.131AlaGly: 3.131 ± 1.049
0.0AlaHis: 0.0 ± 0.0
4.174AlaIle: 4.174 ± 0.811
3.34AlaLys: 3.34 ± 0.423
4.174AlaLeu: 4.174 ± 0.661
1.67AlaMet: 1.67 ± 0.943
3.131AlaAsn: 3.131 ± 0.851
1.044AlaPro: 1.044 ± 0.294
2.087AlaGln: 2.087 ± 0.597
1.044AlaArg: 1.044 ± 0.975
3.34AlaSer: 3.34 ± 0.731
2.505AlaThr: 2.505 ± 0.821
2.087AlaVal: 2.087 ± 0.942
0.0AlaTrp: 0.0 ± 0.0
1.67AlaTyr: 1.67 ± 0.612
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
1.252CysAsp: 1.252 ± 0.658
0.835CysGlu: 0.835 ± 0.578
0.209CysPhe: 0.209 ± 0.129
1.044CysGly: 1.044 ± 0.597
0.626CysHis: 0.626 ± 0.282
2.087CysIle: 2.087 ± 0.722
2.087CysLys: 2.087 ± 1.033
1.461CysLeu: 1.461 ± 0.367
0.209CysMet: 0.209 ± 0.129
1.879CysAsn: 1.879 ± 0.566
0.626CysPro: 0.626 ± 0.302
0.209CysGln: 0.209 ± 0.129
0.209CysArg: 0.209 ± 0.244
2.296CysSer: 2.296 ± 0.396
0.626CysThr: 0.626 ± 0.446
0.835CysVal: 0.835 ± 0.357
0.209CysTrp: 0.209 ± 0.244
0.626CysTyr: 0.626 ± 0.306
0.0CysXaa: 0.0 ± 0.0
Asp
2.505AspAla: 2.505 ± 0.746
1.044AspCys: 1.044 ± 0.335
2.505AspAsp: 2.505 ± 0.684
3.131AspGlu: 3.131 ± 0.634
0.835AspPhe: 0.835 ± 0.415
0.209AspGly: 0.209 ± 0.302
1.044AspHis: 1.044 ± 0.645
4.801AspIle: 4.801 ± 1.087
2.713AspLys: 2.713 ± 0.862
5.844AspLeu: 5.844 ± 1.492
1.461AspMet: 1.461 ± 0.431
5.009AspAsn: 5.009 ± 1.302
1.879AspPro: 1.879 ± 0.465
1.67AspGln: 1.67 ± 0.685
1.879AspArg: 1.879 ± 0.5
2.087AspSer: 2.087 ± 0.791
3.757AspThr: 3.757 ± 1.561
1.879AspVal: 1.879 ± 0.738
0.417AspTrp: 0.417 ± 0.258
1.67AspTyr: 1.67 ± 0.453
0.0AspXaa: 0.0 ± 0.0
Glu
1.67GluAla: 1.67 ± 0.847
1.044GluCys: 1.044 ± 0.446
2.296GluAsp: 2.296 ± 1.042
2.713GluGlu: 2.713 ± 1.433
2.922GluPhe: 2.922 ± 0.847
2.296GluGly: 2.296 ± 0.7
1.044GluHis: 1.044 ± 0.485
3.966GluIle: 3.966 ± 1.253
4.383GluLys: 4.383 ± 1.252
6.47GluLeu: 6.47 ± 1.593
1.252GluMet: 1.252 ± 0.784
2.296GluAsn: 2.296 ± 0.589
1.461GluPro: 1.461 ± 0.393
1.461GluGln: 1.461 ± 0.627
2.296GluArg: 2.296 ± 0.774
3.966GluSer: 3.966 ± 1.317
2.922GluThr: 2.922 ± 1.224
3.966GluVal: 3.966 ± 0.963
0.417GluTrp: 0.417 ± 0.254
1.879GluTyr: 1.879 ± 0.759
0.0GluXaa: 0.0 ± 0.0
Phe
1.044PheAla: 1.044 ± 0.551
0.626PheCys: 0.626 ± 0.363
1.461PheAsp: 1.461 ± 0.338
1.67PheGlu: 1.67 ± 0.93
0.835PhePhe: 0.835 ± 0.354
0.835PheGly: 0.835 ± 0.36
1.044PheHis: 1.044 ± 0.516
3.34PheIle: 3.34 ± 0.721
1.252PheLys: 1.252 ± 0.64
3.966PheLeu: 3.966 ± 1.103
1.044PheMet: 1.044 ± 0.377
3.34PheAsn: 3.34 ± 0.885
1.879PhePro: 1.879 ± 0.531
1.044PheGln: 1.044 ± 0.379
1.252PheArg: 1.252 ± 0.774
3.757PheSer: 3.757 ± 0.907
1.879PheThr: 1.879 ± 0.86
1.67PheVal: 1.67 ± 0.461
0.209PheTrp: 0.209 ± 0.361
2.296PheTyr: 2.296 ± 0.847
0.0PheXaa: 0.0 ± 0.0
Gly
1.252GlyAla: 1.252 ± 0.57
1.044GlyCys: 1.044 ± 0.362
2.296GlyAsp: 2.296 ± 1.046
2.505GlyGlu: 2.505 ± 0.681
1.879GlyPhe: 1.879 ± 0.555
1.461GlyGly: 1.461 ± 0.568
1.461GlyHis: 1.461 ± 0.525
3.34GlyIle: 3.34 ± 0.805
1.879GlyLys: 1.879 ± 0.589
3.966GlyLeu: 3.966 ± 1.133
1.252GlyMet: 1.252 ± 0.822
1.879GlyAsn: 1.879 ± 0.412
1.252GlyPro: 1.252 ± 0.545
0.626GlyGln: 0.626 ± 0.472
1.252GlyArg: 1.252 ± 0.486
3.34GlySer: 3.34 ± 0.735
1.461GlyThr: 1.461 ± 0.316
2.713GlyVal: 2.713 ± 1.266
0.626GlyTrp: 0.626 ± 0.446
1.044GlyTyr: 1.044 ± 0.363
0.0GlyXaa: 0.0 ± 0.0
His
1.67HisAla: 1.67 ± 0.417
0.417HisCys: 0.417 ± 0.41
0.417HisAsp: 0.417 ± 0.233
0.626HisGlu: 0.626 ± 0.525
0.835HisPhe: 0.835 ± 0.444
1.044HisGly: 1.044 ± 0.373
0.626HisHis: 0.626 ± 0.414
1.044HisIle: 1.044 ± 0.466
3.131HisLys: 3.131 ± 0.897
2.713HisLeu: 2.713 ± 0.663
1.461HisMet: 1.461 ± 0.731
1.252HisAsn: 1.252 ± 0.352
1.252HisPro: 1.252 ± 0.581
0.626HisGln: 0.626 ± 0.465
0.626HisArg: 0.626 ± 0.264
1.461HisSer: 1.461 ± 0.55
2.922HisThr: 2.922 ± 2.02
1.67HisVal: 1.67 ± 0.677
0.835HisTrp: 0.835 ± 0.392
0.417HisTyr: 0.417 ± 0.254
0.0HisXaa: 0.0 ± 0.0
Ile
5.009IleAla: 5.009 ± 0.811
1.67IleCys: 1.67 ± 0.76
4.592IleAsp: 4.592 ± 1.524
4.801IleGlu: 4.801 ± 0.816
2.922IlePhe: 2.922 ± 0.453
1.67IleGly: 1.67 ± 0.401
2.505IleHis: 2.505 ± 0.989
9.81IleIle: 9.81 ± 1.267
8.975IleLys: 8.975 ± 0.915
8.558IleLeu: 8.558 ± 2.727
2.713IleMet: 2.713 ± 0.8
6.888IleAsn: 6.888 ± 1.445
2.505IlePro: 2.505 ± 0.584
1.879IleGln: 1.879 ± 0.506
2.087IleArg: 2.087 ± 0.633
8.558IleSer: 8.558 ± 1.143
8.766IleThr: 8.766 ± 0.927
3.757IleVal: 3.757 ± 1.048
0.835IleTrp: 0.835 ± 0.374
1.67IleTyr: 1.67 ± 0.715
0.0IleXaa: 0.0 ± 0.0
Lys
3.548LysAla: 3.548 ± 0.675
0.835LysCys: 0.835 ± 0.439
5.009LysAsp: 5.009 ± 1.074
4.383LysGlu: 4.383 ± 1.237
4.174LysPhe: 4.174 ± 1.032
3.966LysGly: 3.966 ± 0.771
1.67LysHis: 1.67 ± 0.502
4.383LysIle: 4.383 ± 0.727
6.47LysLys: 6.47 ± 2.031
10.228LysLeu: 10.228 ± 1.981
1.252LysMet: 1.252 ± 0.295
6.888LysAsn: 6.888 ± 2.04
5.427LysPro: 5.427 ± 3.602
2.296LysGln: 2.296 ± 0.524
2.296LysArg: 2.296 ± 0.575
5.427LysSer: 5.427 ± 0.989
5.218LysThr: 5.218 ± 1.519
3.548LysVal: 3.548 ± 0.703
0.417LysTrp: 0.417 ± 0.258
3.34LysTyr: 3.34 ± 0.962
0.0LysXaa: 0.0 ± 0.0
Leu
3.34LeuAla: 3.34 ± 0.809
2.505LeuCys: 2.505 ± 0.386
5.009LeuAsp: 5.009 ± 0.6
6.888LeuGlu: 6.888 ± 1.083
2.505LeuPhe: 2.505 ± 1.153
3.966LeuGly: 3.966 ± 1.303
2.922LeuHis: 2.922 ± 0.579
9.601LeuIle: 9.601 ± 1.821
8.766LeuLys: 8.766 ± 2.486
10.228LeuLeu: 10.228 ± 2.091
2.505LeuMet: 2.505 ± 0.463
8.558LeuAsn: 8.558 ± 1.747
3.548LeuPro: 3.548 ± 1.206
2.087LeuGln: 2.087 ± 0.627
3.757LeuArg: 3.757 ± 1.027
10.645LeuSer: 10.645 ± 2.031
9.81LeuThr: 9.81 ± 0.837
3.131LeuVal: 3.131 ± 0.931
0.417LeuTrp: 0.417 ± 0.258
5.218LeuTyr: 5.218 ± 1.499
0.0LeuXaa: 0.0 ± 0.0
Met
0.835MetAla: 0.835 ± 0.573
0.209MetCys: 0.209 ± 0.129
1.252MetAsp: 1.252 ± 0.429
1.044MetGlu: 1.044 ± 0.419
1.252MetPhe: 1.252 ± 0.642
1.252MetGly: 1.252 ± 0.727
0.417MetHis: 0.417 ± 0.338
2.922MetIle: 2.922 ± 1.193
1.252MetLys: 1.252 ± 0.385
2.505MetLeu: 2.505 ± 1.19
0.417MetMet: 0.417 ± 0.258
1.67MetAsn: 1.67 ± 0.502
1.461MetPro: 1.461 ± 0.441
1.252MetGln: 1.252 ± 0.417
0.626MetArg: 0.626 ± 0.252
2.922MetSer: 2.922 ± 0.585
1.252MetThr: 1.252 ± 0.474
0.417MetVal: 0.417 ± 0.291
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.548AsnAla: 3.548 ± 1.093
1.044AsnCys: 1.044 ± 0.501
4.174AsnAsp: 4.174 ± 1.352
3.131AsnGlu: 3.131 ± 0.71
2.087AsnPhe: 2.087 ± 0.486
3.548AsnGly: 3.548 ± 1.084
3.548AsnHis: 3.548 ± 1.351
8.975AsnIle: 8.975 ± 1.158
7.932AsnLys: 7.932 ± 0.951
5.427AsnLeu: 5.427 ± 2.28
0.0AsnMet: 0.0 ± 0.0
6.053AsnAsn: 6.053 ± 0.74
2.922AsnPro: 2.922 ± 1.72
4.174AsnGln: 4.174 ± 0.74
2.713AsnArg: 2.713 ± 0.969
6.47AsnSer: 6.47 ± 0.858
5.636AsnThr: 5.636 ± 1.806
2.713AsnVal: 2.713 ± 0.809
0.417AsnTrp: 0.417 ± 0.272
3.966AsnTyr: 3.966 ± 0.697
0.0AsnXaa: 0.0 ± 0.0
Pro
2.296ProAla: 2.296 ± 0.517
1.461ProCys: 1.461 ± 0.628
1.461ProAsp: 1.461 ± 0.476
2.087ProGlu: 2.087 ± 1.172
0.835ProPhe: 0.835 ± 0.415
0.209ProGly: 0.209 ± 0.129
0.626ProHis: 0.626 ± 0.363
3.548ProIle: 3.548 ± 0.86
4.592ProLys: 4.592 ± 1.966
2.296ProLeu: 2.296 ± 0.752
1.044ProMet: 1.044 ± 0.494
2.922ProAsn: 2.922 ± 0.749
1.879ProPro: 1.879 ± 0.804
1.044ProGln: 1.044 ± 0.467
1.044ProArg: 1.044 ± 0.573
3.34ProSer: 3.34 ± 1.217
5.427ProThr: 5.427 ± 3.318
1.252ProVal: 1.252 ± 0.579
0.835ProTrp: 0.835 ± 0.516
0.835ProTyr: 0.835 ± 0.424
0.0ProXaa: 0.0 ± 0.0
Gln
1.67GlnAla: 1.67 ± 0.461
0.0GlnCys: 0.0 ± 0.0
1.461GlnAsp: 1.461 ± 0.627
0.626GlnGlu: 0.626 ± 0.264
2.087GlnPhe: 2.087 ± 0.719
0.0GlnGly: 0.0 ± 0.0
1.044GlnHis: 1.044 ± 0.504
2.296GlnIle: 2.296 ± 0.568
1.879GlnLys: 1.879 ± 0.615
2.922GlnLeu: 2.922 ± 1.022
0.626GlnMet: 0.626 ± 0.426
2.713GlnAsn: 2.713 ± 0.725
0.626GlnPro: 0.626 ± 0.414
1.67GlnGln: 1.67 ± 1.192
1.044GlnArg: 1.044 ± 0.517
4.801GlnSer: 4.801 ± 1.115
2.922GlnThr: 2.922 ± 1.931
1.67GlnVal: 1.67 ± 0.744
0.0GlnTrp: 0.0 ± 0.0
1.252GlnTyr: 1.252 ± 0.465
0.0GlnXaa: 0.0 ± 0.0
Arg
1.044ArgAla: 1.044 ± 0.294
0.626ArgCys: 0.626 ± 0.282
2.296ArgAsp: 2.296 ± 1.084
1.67ArgGlu: 1.67 ± 0.421
1.252ArgPhe: 1.252 ± 0.465
2.087ArgGly: 2.087 ± 0.746
0.417ArgHis: 0.417 ± 0.233
2.087ArgIle: 2.087 ± 0.886
1.67ArgLys: 1.67 ± 0.387
4.383ArgLeu: 4.383 ± 0.963
0.417ArgMet: 0.417 ± 0.258
1.879ArgAsn: 1.879 ± 0.576
0.626ArgPro: 0.626 ± 0.344
1.879ArgGln: 1.879 ± 0.433
1.879ArgArg: 1.879 ± 0.799
2.296ArgSer: 2.296 ± 0.688
2.296ArgThr: 2.296 ± 0.845
2.505ArgVal: 2.505 ± 0.441
0.626ArgTrp: 0.626 ± 0.274
1.461ArgTyr: 1.461 ± 0.701
0.0ArgXaa: 0.0 ± 0.0
Ser
3.966SerAla: 3.966 ± 1.063
1.044SerCys: 1.044 ± 0.413
3.548SerAsp: 3.548 ± 1.024
4.801SerGlu: 4.801 ± 0.977
1.67SerPhe: 1.67 ± 0.443
3.131SerGly: 3.131 ± 0.701
1.252SerHis: 1.252 ± 0.437
8.14SerIle: 8.14 ± 1.158
7.097SerLys: 7.097 ± 0.851
11.689SerLeu: 11.689 ± 2.231
2.296SerMet: 2.296 ± 0.72
6.888SerAsn: 6.888 ± 1.6
2.505SerPro: 2.505 ± 0.811
1.461SerGln: 1.461 ± 0.656
3.131SerArg: 3.131 ± 0.786
6.053SerSer: 6.053 ± 1.215
8.14SerThr: 8.14 ± 3.753
5.009SerVal: 5.009 ± 1.156
0.626SerTrp: 0.626 ± 0.387
3.548SerTyr: 3.548 ± 0.97
0.0SerXaa: 0.0 ± 0.0
Thr
2.922ThrAla: 2.922 ± 1.119
1.044ThrCys: 1.044 ± 0.541
3.131ThrAsp: 3.131 ± 0.955
3.966ThrGlu: 3.966 ± 0.504
2.296ThrPhe: 2.296 ± 0.884
2.296ThrGly: 2.296 ± 0.623
1.67ThrHis: 1.67 ± 0.529
7.305ThrIle: 7.305 ± 1.332
7.305ThrLys: 7.305 ± 3.025
6.262ThrLeu: 6.262 ± 1.777
1.461ThrMet: 1.461 ± 0.557
6.679ThrAsn: 6.679 ± 2.248
4.592ThrPro: 4.592 ± 2.221
3.548ThrGln: 3.548 ± 2.072
1.252ThrArg: 1.252 ± 0.347
9.184ThrSer: 9.184 ± 3.069
13.15ThrThr: 13.15 ± 7.511
3.548ThrVal: 3.548 ± 1.249
0.626ThrTrp: 0.626 ± 0.31
3.757ThrTyr: 3.757 ± 1.004
0.0ThrXaa: 0.0 ± 0.0
Val
0.835ValAla: 0.835 ± 0.581
1.252ValCys: 1.252 ± 0.618
1.67ValAsp: 1.67 ± 0.492
1.67ValGlu: 1.67 ± 0.763
2.922ValPhe: 2.922 ± 0.499
1.879ValGly: 1.879 ± 0.457
0.626ValHis: 0.626 ± 0.274
3.757ValIle: 3.757 ± 1.185
2.922ValLys: 2.922 ± 1.176
6.888ValLeu: 6.888 ± 1.001
0.417ValMet: 0.417 ± 0.389
4.801ValAsn: 4.801 ± 1.196
1.461ValPro: 1.461 ± 1.184
2.087ValGln: 2.087 ± 0.788
2.087ValArg: 2.087 ± 0.546
3.966ValSer: 3.966 ± 2.155
3.757ValThr: 3.757 ± 1.136
3.757ValVal: 3.757 ± 0.945
0.0ValTrp: 0.0 ± 0.0
2.087ValTyr: 2.087 ± 0.856
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.209TrpCys: 0.209 ± 0.129
0.209TrpAsp: 0.209 ± 0.297
0.209TrpGlu: 0.209 ± 0.253
0.417TrpPhe: 0.417 ± 0.258
0.417TrpGly: 0.417 ± 0.233
0.209TrpHis: 0.209 ± 0.129
0.835TrpIle: 0.835 ± 0.516
0.835TrpLys: 0.835 ± 0.322
1.044TrpLeu: 1.044 ± 0.645
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.626TrpPro: 0.626 ± 0.471
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
0.835TrpSer: 0.835 ± 0.516
0.626TrpThr: 0.626 ± 0.295
0.835TrpVal: 0.835 ± 0.42
0.0TrpTrp: 0.0 ± 0.0
0.417TrpTyr: 0.417 ± 0.488
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.67TyrAla: 1.67 ± 0.751
0.835TyrCys: 0.835 ± 0.439
1.252TyrAsp: 1.252 ± 0.855
1.461TyrGlu: 1.461 ± 0.475
1.252TyrPhe: 1.252 ± 0.431
2.087TyrGly: 2.087 ± 0.591
2.296TyrHis: 2.296 ± 0.637
3.757TyrIle: 3.757 ± 1.128
2.296TyrLys: 2.296 ± 0.536
4.383TyrLeu: 4.383 ± 1.434
1.252TyrMet: 1.252 ± 0.476
3.757TyrAsn: 3.757 ± 1.676
1.461TyrPro: 1.461 ± 0.617
0.209TyrGln: 0.209 ± 0.244
2.922TyrArg: 2.922 ± 1.076
1.252TyrSer: 1.252 ± 0.371
2.922TyrThr: 2.922 ± 0.591
1.879TyrVal: 1.879 ± 0.948
0.209TyrTrp: 0.209 ± 0.129
1.461TyrTyr: 1.461 ± 0.524
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 12 proteins (4792 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski