Amino acid dipepetide frequency for Nkolbisson virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.381AlaAla: 3.381 ± 1.493
0.564AlaCys: 0.564 ± 0.284
3.663AlaAsp: 3.663 ± 0.723
3.381AlaGlu: 3.381 ± 0.595
0.845AlaPhe: 0.845 ± 0.367
3.663AlaGly: 3.663 ± 1.602
0.564AlaHis: 0.564 ± 0.284
2.818AlaIle: 2.818 ± 0.865
3.663AlaLys: 3.663 ± 1.919
3.099AlaLeu: 3.099 ± 0.77
1.127AlaMet: 1.127 ± 0.611
2.254AlaAsn: 2.254 ± 0.744
1.972AlaPro: 1.972 ± 0.432
2.254AlaGln: 2.254 ± 0.549
2.536AlaArg: 2.536 ± 0.587
3.663AlaSer: 3.663 ± 0.797
3.099AlaThr: 3.099 ± 0.601
2.254AlaVal: 2.254 ± 1.826
0.564AlaTrp: 0.564 ± 0.341
1.972AlaTyr: 1.972 ± 0.63
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.282CysCys: 0.282 ± 0.153
0.845CysAsp: 0.845 ± 0.726
2.254CysGlu: 2.254 ± 0.537
1.127CysPhe: 1.127 ± 0.412
0.845CysGly: 0.845 ± 0.296
0.845CysHis: 0.845 ± 0.367
0.564CysIle: 0.564 ± 0.391
1.691CysLys: 1.691 ± 0.541
2.818CysLeu: 2.818 ± 0.87
0.282CysMet: 0.282 ± 0.399
1.691CysAsn: 1.691 ± 0.623
1.409CysPro: 1.409 ± 0.913
1.127CysGln: 1.127 ± 0.567
0.564CysArg: 0.564 ± 0.341
1.409CysSer: 1.409 ± 0.763
1.972CysThr: 1.972 ± 0.771
0.564CysVal: 0.564 ± 0.284
0.564CysTrp: 0.564 ± 0.305
0.282CysTyr: 0.282 ± 0.347
0.0CysXaa: 0.0 ± 0.0
Asp
1.127AspAla: 1.127 ± 0.567
1.409AspCys: 1.409 ± 0.531
3.099AspAsp: 3.099 ± 0.43
5.917AspGlu: 5.917 ± 3.202
3.381AspPhe: 3.381 ± 0.848
4.508AspGly: 4.508 ± 0.926
2.254AspHis: 2.254 ± 0.427
2.818AspIle: 2.818 ± 0.835
3.099AspLys: 3.099 ± 1.009
5.635AspLeu: 5.635 ± 0.751
2.536AspMet: 2.536 ± 0.503
1.409AspAsn: 1.409 ± 0.755
3.663AspPro: 3.663 ± 0.578
1.127AspGln: 1.127 ± 0.75
2.818AspArg: 2.818 ± 0.556
3.663AspSer: 3.663 ± 0.316
4.227AspThr: 4.227 ± 1.207
5.354AspVal: 5.354 ± 0.671
1.409AspTrp: 1.409 ± 0.517
3.099AspTyr: 3.099 ± 0.895
0.0AspXaa: 0.0 ± 0.0
Glu
3.099GluAla: 3.099 ± 1.902
0.564GluCys: 0.564 ± 0.341
5.635GluAsp: 5.635 ± 1.596
5.072GluGlu: 5.072 ± 4.014
2.536GluPhe: 2.536 ± 1.0
5.072GluGly: 5.072 ± 1.463
1.972GluHis: 1.972 ± 0.715
5.354GluIle: 5.354 ± 1.11
4.79GluLys: 4.79 ± 1.553
7.608GluLeu: 7.608 ± 3.361
1.127GluMet: 1.127 ± 0.567
3.381GluAsn: 3.381 ± 1.201
1.691GluPro: 1.691 ± 0.598
1.972GluGln: 1.972 ± 0.789
1.972GluArg: 1.972 ± 0.508
3.945GluSer: 3.945 ± 0.5
2.818GluThr: 2.818 ± 1.592
7.326GluVal: 7.326 ± 1.021
1.409GluTrp: 1.409 ± 0.25
4.508GluTyr: 4.508 ± 0.877
0.0GluXaa: 0.0 ± 0.0
Phe
1.691PheAla: 1.691 ± 0.404
1.409PheCys: 1.409 ± 0.853
3.099PheAsp: 3.099 ± 1.352
2.536PheGlu: 2.536 ± 0.303
1.972PhePhe: 1.972 ± 0.775
2.536PheGly: 2.536 ± 0.554
0.564PheHis: 0.564 ± 0.798
1.691PheIle: 1.691 ± 0.31
3.099PheLys: 3.099 ± 0.982
3.381PheLeu: 3.381 ± 1.245
0.564PheMet: 0.564 ± 0.341
1.409PheAsn: 1.409 ± 0.517
1.691PhePro: 1.691 ± 0.541
2.254PheGln: 2.254 ± 0.607
2.818PheArg: 2.818 ± 0.718
3.381PheSer: 3.381 ± 0.916
1.127PheThr: 1.127 ± 0.517
2.818PheVal: 2.818 ± 0.5
0.282PheTrp: 0.282 ± 0.153
0.845PheTyr: 0.845 ± 0.346
0.0PheXaa: 0.0 ± 0.0
Gly
3.099GlyAla: 3.099 ± 0.9
1.409GlyCys: 1.409 ± 0.559
3.099GlyAsp: 3.099 ± 0.554
3.381GlyGlu: 3.381 ± 0.703
2.254GlyPhe: 2.254 ± 0.607
4.227GlyGly: 4.227 ± 1.141
0.845GlyHis: 0.845 ± 0.296
3.945GlyIle: 3.945 ± 0.962
3.381GlyLys: 3.381 ± 0.917
9.862GlyLeu: 9.862 ± 1.708
1.127GlyMet: 1.127 ± 0.413
1.691GlyAsn: 1.691 ± 0.789
3.381GlyPro: 3.381 ± 0.758
2.254GlyGln: 2.254 ± 0.551
4.227GlyArg: 4.227 ± 1.16
4.508GlySer: 4.508 ± 0.569
2.818GlyThr: 2.818 ± 1.05
1.972GlyVal: 1.972 ± 0.746
1.127GlyTrp: 1.127 ± 0.783
1.409GlyTyr: 1.409 ± 0.559
0.0GlyXaa: 0.0 ± 0.0
His
2.254HisAla: 2.254 ± 1.14
0.282HisCys: 0.282 ± 0.153
1.691HisAsp: 1.691 ± 0.851
1.691HisGlu: 1.691 ± 0.393
0.564HisPhe: 0.564 ± 0.391
0.564HisGly: 0.564 ± 0.284
1.409HisHis: 1.409 ± 0.342
2.818HisIle: 2.818 ± 0.539
1.972HisLys: 1.972 ± 1.061
1.972HisLeu: 1.972 ± 0.715
0.845HisMet: 0.845 ± 0.549
1.691HisAsn: 1.691 ± 0.623
1.972HisPro: 1.972 ± 0.408
1.127HisGln: 1.127 ± 0.426
1.127HisArg: 1.127 ± 0.375
1.691HisSer: 1.691 ± 0.31
0.282HisThr: 0.282 ± 0.153
1.127HisVal: 1.127 ± 0.355
0.282HisTrp: 0.282 ± 0.153
1.691HisTyr: 1.691 ± 0.637
0.0HisXaa: 0.0 ± 0.0
Ile
2.254IleAla: 2.254 ± 1.185
1.972IleCys: 1.972 ± 0.587
3.663IleAsp: 3.663 ± 0.948
3.099IleGlu: 3.099 ± 1.357
1.972IlePhe: 1.972 ± 0.25
3.663IleGly: 3.663 ± 0.985
1.127IleHis: 1.127 ± 0.426
5.072IleIle: 5.072 ± 0.951
4.508IleLys: 4.508 ± 0.732
6.199IleLeu: 6.199 ± 1.277
0.845IleMet: 0.845 ± 0.458
2.536IleAsn: 2.536 ± 1.0
3.945IlePro: 3.945 ± 0.757
1.972IleGln: 1.972 ± 0.408
5.635IleArg: 5.635 ± 1.452
4.79IleSer: 4.79 ± 0.907
3.381IleThr: 3.381 ± 0.457
3.945IleVal: 3.945 ± 0.385
0.845IleTrp: 0.845 ± 0.427
2.536IleTyr: 2.536 ± 0.473
0.0IleXaa: 0.0 ± 0.0
Lys
3.099LysAla: 3.099 ± 0.705
1.127LysCys: 1.127 ± 0.492
5.072LysAsp: 5.072 ± 0.639
4.508LysGlu: 4.508 ± 0.842
1.691LysPhe: 1.691 ± 0.841
4.79LysGly: 4.79 ± 0.908
2.254LysHis: 2.254 ± 0.34
2.818LysIle: 2.818 ± 0.53
3.945LysLys: 3.945 ± 0.841
5.917LysLeu: 5.917 ± 1.314
1.691LysMet: 1.691 ± 0.641
1.972LysAsn: 1.972 ± 0.432
2.536LysPro: 2.536 ± 0.503
1.691LysGln: 1.691 ± 0.773
4.508LysArg: 4.508 ± 0.695
4.79LysSer: 4.79 ± 0.374
3.099LysThr: 3.099 ± 0.359
4.79LysVal: 4.79 ± 1.353
2.536LysTrp: 2.536 ± 1.116
1.127LysTyr: 1.127 ± 0.776
0.0LysXaa: 0.0 ± 0.0
Leu
5.354LeuAla: 5.354 ± 0.557
2.254LeuCys: 2.254 ± 0.985
5.917LeuAsp: 5.917 ± 1.447
10.425LeuGlu: 10.425 ± 3.473
3.945LeuPhe: 3.945 ± 1.395
6.199LeuGly: 6.199 ± 1.263
3.381LeuHis: 3.381 ± 1.951
4.508LeuIle: 4.508 ± 1.291
5.072LeuLys: 5.072 ± 1.404
9.298LeuLeu: 9.298 ± 1.723
2.254LeuMet: 2.254 ± 0.607
6.199LeuAsn: 6.199 ± 0.877
3.945LeuPro: 3.945 ± 1.509
1.972LeuGln: 1.972 ± 0.421
7.89LeuArg: 7.89 ± 0.642
8.735LeuSer: 8.735 ± 1.47
4.79LeuThr: 4.79 ± 1.548
4.227LeuVal: 4.227 ± 1.011
0.282LeuTrp: 0.282 ± 0.399
3.099LeuTyr: 3.099 ± 0.804
0.0LeuXaa: 0.0 ± 0.0
Met
1.972MetAla: 1.972 ± 0.52
0.845MetCys: 0.845 ± 0.346
1.409MetAsp: 1.409 ± 0.763
2.536MetGlu: 2.536 ± 0.69
1.691MetPhe: 1.691 ± 0.682
1.409MetGly: 1.409 ± 0.763
0.282MetHis: 0.282 ± 0.153
1.972MetIle: 1.972 ± 0.725
0.564MetLys: 0.564 ± 0.284
0.845MetLeu: 0.845 ± 0.357
0.564MetMet: 0.564 ± 0.491
0.845MetAsn: 0.845 ± 0.824
0.564MetPro: 0.564 ± 0.64
0.564MetGln: 0.564 ± 0.284
1.691MetArg: 1.691 ± 0.514
1.691MetSer: 1.691 ± 0.598
1.409MetThr: 1.409 ± 0.763
1.691MetVal: 1.691 ± 0.773
1.409MetTrp: 1.409 ± 0.853
0.282MetTyr: 0.282 ± 0.399
0.0MetXaa: 0.0 ± 0.0
Asn
2.254AsnAla: 2.254 ± 0.855
0.282AsnCys: 0.282 ± 0.347
2.536AsnAsp: 2.536 ± 0.554
1.409AsnGlu: 1.409 ± 0.487
2.254AsnPhe: 2.254 ± 0.687
1.691AsnGly: 1.691 ± 0.623
2.536AsnHis: 2.536 ± 0.587
3.099AsnIle: 3.099 ± 0.806
2.818AsnLys: 2.818 ± 0.809
3.663AsnLeu: 3.663 ± 0.705
1.127AsnMet: 1.127 ± 0.611
1.972AsnAsn: 1.972 ± 0.421
2.254AsnPro: 2.254 ± 0.985
2.818AsnGln: 2.818 ± 0.422
1.409AsnArg: 1.409 ± 0.393
2.818AsnSer: 2.818 ± 0.87
2.818AsnThr: 2.818 ± 0.358
1.972AsnVal: 1.972 ± 0.52
1.127AsnTrp: 1.127 ± 0.375
1.127AsnTyr: 1.127 ± 0.426
0.0AsnXaa: 0.0 ± 0.0
Pro
1.691ProAla: 1.691 ± 0.418
0.564ProCys: 0.564 ± 0.305
5.072ProAsp: 5.072 ± 2.054
3.381ProGlu: 3.381 ± 0.891
0.845ProPhe: 0.845 ± 0.346
2.536ProGly: 2.536 ± 1.4
1.127ProHis: 1.127 ± 0.375
2.818ProIle: 2.818 ± 0.877
2.536ProLys: 2.536 ± 0.887
5.072ProLeu: 5.072 ± 1.05
1.691ProMet: 1.691 ± 1.785
0.845ProAsn: 0.845 ± 0.346
2.818ProPro: 2.818 ± 2.263
1.127ProGln: 1.127 ± 0.606
1.691ProArg: 1.691 ± 0.31
2.536ProSer: 2.536 ± 0.612
3.099ProThr: 3.099 ± 1.354
3.099ProVal: 3.099 ± 0.651
0.564ProTrp: 0.564 ± 0.305
2.536ProTyr: 2.536 ± 1.254
0.0ProXaa: 0.0 ± 0.0
Gln
1.691GlnAla: 1.691 ± 0.442
0.0GlnCys: 0.0 ± 0.0
2.536GlnAsp: 2.536 ± 0.75
1.972GlnGlu: 1.972 ± 0.746
0.564GlnPhe: 0.564 ± 0.284
1.972GlnGly: 1.972 ± 0.762
1.127GlnHis: 1.127 ± 0.627
3.663GlnIle: 3.663 ± 0.592
1.972GlnLys: 1.972 ± 0.408
3.099GlnLeu: 3.099 ± 0.595
0.564GlnMet: 0.564 ± 0.341
0.564GlnAsn: 0.564 ± 0.305
1.691GlnPro: 1.691 ± 0.89
1.127GlnGln: 1.127 ± 0.606
1.409GlnArg: 1.409 ± 0.482
2.536GlnSer: 2.536 ± 0.597
1.409GlnThr: 1.409 ± 0.491
1.691GlnVal: 1.691 ± 0.916
0.282GlnTrp: 0.282 ± 0.347
1.409GlnTyr: 1.409 ± 0.67
0.0GlnXaa: 0.0 ± 0.0
Arg
1.972ArgAla: 1.972 ± 0.52
2.254ArgCys: 2.254 ± 0.415
1.691ArgAsp: 1.691 ± 0.641
3.945ArgGlu: 3.945 ± 1.108
3.663ArgPhe: 3.663 ± 0.953
3.663ArgGly: 3.663 ± 0.642
1.127ArgHis: 1.127 ± 0.611
3.099ArgIle: 3.099 ± 0.713
4.79ArgLys: 4.79 ± 0.511
5.072ArgLeu: 5.072 ± 0.609
2.536ArgMet: 2.536 ± 0.804
2.818ArgAsn: 2.818 ± 0.877
2.818ArgPro: 2.818 ± 0.75
2.536ArgGln: 2.536 ± 1.292
3.099ArgArg: 3.099 ± 0.651
4.508ArgSer: 4.508 ± 1.103
2.536ArgThr: 2.536 ± 1.053
3.099ArgVal: 3.099 ± 1.155
0.282ArgTrp: 0.282 ± 0.153
1.972ArgTyr: 1.972 ± 0.959
0.0ArgXaa: 0.0 ± 0.0
Ser
3.381SerAla: 3.381 ± 1.05
1.972SerCys: 1.972 ± 0.658
3.945SerAsp: 3.945 ± 0.606
5.635SerGlu: 5.635 ± 0.874
3.099SerPhe: 3.099 ± 0.293
2.536SerGly: 2.536 ± 0.363
1.409SerHis: 1.409 ± 0.491
5.354SerIle: 5.354 ± 1.65
6.199SerLys: 6.199 ± 0.91
8.171SerLeu: 8.171 ± 2.224
0.845SerMet: 0.845 ± 0.408
1.972SerAsn: 1.972 ± 0.421
2.254SerPro: 2.254 ± 0.687
0.845SerGln: 0.845 ± 0.458
5.635SerArg: 5.635 ± 0.818
6.481SerSer: 6.481 ± 1.982
3.381SerThr: 3.381 ± 0.892
4.508SerVal: 4.508 ± 0.848
2.254SerTrp: 2.254 ± 0.906
1.972SerTyr: 1.972 ± 0.775
0.0SerXaa: 0.0 ± 0.0
Thr
1.691ThrAla: 1.691 ± 0.404
0.845ThrCys: 0.845 ± 0.876
2.254ThrAsp: 2.254 ± 0.581
3.663ThrGlu: 3.663 ± 0.537
1.409ThrPhe: 1.409 ± 0.618
3.381ThrGly: 3.381 ± 0.621
1.409ThrHis: 1.409 ± 0.491
4.227ThrIle: 4.227 ± 0.976
1.972ThrLys: 1.972 ± 1.064
5.917ThrLeu: 5.917 ± 1.663
1.409ThrMet: 1.409 ± 0.404
2.818ThrAsn: 2.818 ± 0.533
2.536ThrPro: 2.536 ± 0.441
1.409ThrGln: 1.409 ± 0.763
3.099ThrArg: 3.099 ± 0.809
4.227ThrSer: 4.227 ± 0.789
4.508ThrThr: 4.508 ± 1.031
3.663ThrVal: 3.663 ± 0.523
1.127ThrTrp: 1.127 ± 0.355
1.691ThrTyr: 1.691 ± 0.356
0.0ThrXaa: 0.0 ± 0.0
Val
4.508ValAla: 4.508 ± 1.849
1.409ValCys: 1.409 ± 0.491
3.945ValAsp: 3.945 ± 1.061
5.072ValGlu: 5.072 ± 1.099
1.972ValPhe: 1.972 ± 1.069
3.663ValGly: 3.663 ± 1.137
1.972ValHis: 1.972 ± 0.675
4.79ValIle: 4.79 ± 1.097
3.663ValLys: 3.663 ± 1.165
5.635ValLeu: 5.635 ± 1.121
1.972ValMet: 1.972 ± 0.421
3.099ValAsn: 3.099 ± 0.954
2.254ValPro: 2.254 ± 0.429
1.127ValGln: 1.127 ± 0.275
1.691ValArg: 1.691 ± 0.916
2.536ValSer: 2.536 ± 0.839
4.508ValThr: 4.508 ± 0.243
5.917ValVal: 5.917 ± 0.756
0.845ValTrp: 0.845 ± 0.427
2.536ValTyr: 2.536 ± 1.038
0.0ValXaa: 0.0 ± 0.0
Trp
0.845TrpAla: 0.845 ± 0.296
0.0TrpCys: 0.0 ± 0.0
1.127TrpAsp: 1.127 ± 0.375
0.845TrpGlu: 0.845 ± 0.458
1.691TrpPhe: 1.691 ± 0.734
1.409TrpGly: 1.409 ± 0.763
0.282TrpHis: 0.282 ± 0.153
0.845TrpIle: 0.845 ± 0.367
1.409TrpLys: 1.409 ± 0.25
1.409TrpLeu: 1.409 ± 0.45
0.564TrpMet: 0.564 ± 0.341
1.972TrpAsn: 1.972 ± 0.762
0.564TrpPro: 0.564 ± 0.305
0.282TrpGln: 0.282 ± 0.4
0.282TrpArg: 0.282 ± 0.153
1.691TrpSer: 1.691 ± 0.637
0.564TrpThr: 0.564 ± 0.284
1.691TrpVal: 1.691 ± 0.919
0.282TrpTrp: 0.282 ± 0.153
0.845TrpTyr: 0.845 ± 0.53
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.972TyrAla: 1.972 ± 1.061
1.691TyrCys: 1.691 ± 0.655
2.254TyrAsp: 2.254 ± 0.879
1.127TyrGlu: 1.127 ± 0.412
1.691TyrPhe: 1.691 ± 0.393
1.691TyrGly: 1.691 ± 0.692
0.564TyrHis: 0.564 ± 0.284
1.691TyrIle: 1.691 ± 0.56
2.818TyrLys: 2.818 ± 1.25
5.072TyrLeu: 5.072 ± 0.728
0.282TyrMet: 0.282 ± 0.399
0.845TyrAsn: 0.845 ± 0.458
1.691TyrPro: 1.691 ± 0.56
1.691TyrGln: 1.691 ± 0.418
3.381TyrArg: 3.381 ± 0.448
2.536TyrSer: 2.536 ± 0.305
1.409TyrThr: 1.409 ± 1.144
1.409TyrVal: 1.409 ± 0.404
1.127TyrTrp: 1.127 ± 0.567
0.845TyrTyr: 0.845 ± 0.458
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (3550 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski