Amino acid dipepetide frequency for Umbre virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.175AlaAla: 4.175 ± 2.288
1.228AlaCys: 1.228 ± 0.526
2.947AlaAsp: 2.947 ± 2.225
2.701AlaGlu: 2.701 ± 1.784
1.719AlaPhe: 1.719 ± 1.549
3.193AlaGly: 3.193 ± 0.593
1.719AlaHis: 1.719 ± 0.635
3.193AlaIle: 3.193 ± 1.193
5.403AlaLys: 5.403 ± 5.545
5.157AlaLeu: 5.157 ± 1.703
1.719AlaMet: 1.719 ± 1.536
4.912AlaAsn: 4.912 ± 1.643
0.491AlaPro: 0.491 ± 0.324
1.965AlaGln: 1.965 ± 0.697
1.965AlaArg: 1.965 ± 0.776
3.929AlaSer: 3.929 ± 1.123
3.929AlaThr: 3.929 ± 1.558
3.929AlaVal: 3.929 ± 1.301
0.246AlaTrp: 0.246 ± 0.162
2.456AlaTyr: 2.456 ± 0.807
0.0AlaXaa: 0.0 ± 0.0
Cys
1.719CysAla: 1.719 ± 0.567
0.246CysCys: 0.246 ± 0.162
1.473CysAsp: 1.473 ± 0.729
1.719CysGlu: 1.719 ± 0.694
1.228CysPhe: 1.228 ± 0.526
1.965CysGly: 1.965 ± 1.773
0.491CysHis: 0.491 ± 0.174
2.456CysIle: 2.456 ± 0.871
2.701CysLys: 2.701 ± 2.438
2.21CysLeu: 2.21 ± 0.865
0.982CysMet: 0.982 ± 0.388
1.719CysAsn: 1.719 ± 0.567
1.719CysPro: 1.719 ± 0.567
1.473CysGln: 1.473 ± 0.523
0.491CysArg: 0.491 ± 0.443
0.982CysSer: 0.982 ± 0.577
1.965CysThr: 1.965 ± 1.456
1.965CysVal: 1.965 ± 0.888
0.0CysTrp: 0.0 ± 0.0
0.982CysTyr: 0.982 ± 0.388
0.0CysXaa: 0.0 ± 0.0
Asp
3.929AspAla: 3.929 ± 0.586
0.491AspCys: 0.491 ± 0.324
1.965AspAsp: 1.965 ± 0.531
2.947AspGlu: 2.947 ± 1.33
4.666AspPhe: 4.666 ± 1.642
1.965AspGly: 1.965 ± 0.651
0.491AspHis: 0.491 ± 0.443
5.157AspIle: 5.157 ± 1.308
3.438AspLys: 3.438 ± 1.089
4.175AspLeu: 4.175 ± 1.372
1.719AspMet: 1.719 ± 0.659
3.438AspAsn: 3.438 ± 1.476
1.719AspPro: 1.719 ± 0.852
3.438AspGln: 3.438 ± 1.153
1.473AspArg: 1.473 ± 1.509
3.438AspSer: 3.438 ± 0.406
3.684AspThr: 3.684 ± 1.999
3.193AspVal: 3.193 ± 0.593
0.491AspTrp: 0.491 ± 0.443
2.947AspTyr: 2.947 ± 1.164
0.0AspXaa: 0.0 ± 0.0
Glu
2.947GluAla: 2.947 ± 0.444
0.982GluCys: 0.982 ± 0.577
5.648GluAsp: 5.648 ± 1.047
4.42GluGlu: 4.42 ± 1.519
2.701GluPhe: 2.701 ± 0.48
0.982GluGly: 0.982 ± 0.348
1.965GluHis: 1.965 ± 1.286
5.648GluIle: 5.648 ± 1.617
3.929GluLys: 3.929 ± 1.772
6.631GluLeu: 6.631 ± 1.075
2.947GluMet: 2.947 ± 1.33
3.193GluAsn: 3.193 ± 1.152
1.965GluPro: 1.965 ± 1.011
2.701GluGln: 2.701 ± 1.784
3.438GluArg: 3.438 ± 0.554
3.929GluSer: 3.929 ± 1.167
4.42GluThr: 4.42 ± 0.864
4.175GluVal: 4.175 ± 1.914
0.491GluTrp: 0.491 ± 0.877
3.193GluTyr: 3.193 ± 0.393
0.0GluXaa: 0.0 ± 0.0
Phe
2.947PheAla: 2.947 ± 1.013
1.228PheCys: 1.228 ± 0.403
1.719PheAsp: 1.719 ± 0.635
3.684PheGlu: 3.684 ± 0.478
1.719PhePhe: 1.719 ± 0.689
1.965PheGly: 1.965 ± 0.669
0.982PheHis: 0.982 ± 0.348
4.175PheIle: 4.175 ± 0.914
5.403PheLys: 5.403 ± 0.927
4.912PheLeu: 4.912 ± 1.729
1.719PheMet: 1.719 ± 0.604
2.21PheAsn: 2.21 ± 0.924
0.737PhePro: 0.737 ± 0.831
1.719PheGln: 1.719 ± 0.595
1.473PheArg: 1.473 ± 0.694
3.929PheSer: 3.929 ± 1.141
4.42PheThr: 4.42 ± 1.656
1.965PheVal: 1.965 ± 0.531
0.246PheTrp: 0.246 ± 0.162
0.737PheTyr: 0.737 ± 0.486
0.0PheXaa: 0.0 ± 0.0
Gly
0.982GlyAla: 0.982 ± 0.803
2.456GlyCys: 2.456 ± 1.052
2.947GlyAsp: 2.947 ± 0.379
3.684GlyGlu: 3.684 ± 1.013
2.21GlyPhe: 2.21 ± 0.541
1.473GlyGly: 1.473 ± 0.694
0.491GlyHis: 0.491 ± 0.324
2.947GlyIle: 2.947 ± 0.379
3.193GlyLys: 3.193 ± 1.137
3.438GlyLeu: 3.438 ± 1.089
0.0GlyMet: 0.0 ± 0.0
1.965GlyAsn: 1.965 ± 0.651
1.228GlyPro: 1.228 ± 0.69
2.947GlyGln: 2.947 ± 1.014
1.965GlyArg: 1.965 ± 0.651
2.456GlySer: 2.456 ± 1.052
2.701GlyThr: 2.701 ± 2.063
1.473GlyVal: 1.473 ± 0.599
0.982GlyTrp: 0.982 ± 0.577
1.719GlyTyr: 1.719 ± 0.595
0.0GlyXaa: 0.0 ± 0.0
His
0.982HisAla: 0.982 ± 0.348
1.719HisCys: 1.719 ± 1.235
1.228HisAsp: 1.228 ± 0.795
1.228HisGlu: 1.228 ± 0.403
0.982HisPhe: 0.982 ± 0.388
0.491HisGly: 0.491 ± 0.324
0.737HisHis: 0.737 ± 0.364
1.228HisIle: 1.228 ± 0.811
2.21HisLys: 2.21 ± 0.76
2.21HisLeu: 2.21 ± 0.836
0.491HisMet: 0.491 ± 0.174
0.246HisAsn: 0.246 ± 0.222
0.982HisPro: 0.982 ± 0.388
1.228HisGln: 1.228 ± 0.795
1.473HisArg: 1.473 ± 1.623
1.965HisSer: 1.965 ± 1.4
1.473HisThr: 1.473 ± 0.523
1.473HisVal: 1.473 ± 0.506
0.246HisTrp: 0.246 ± 0.162
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.929IleAla: 3.929 ± 0.825
3.684IleCys: 3.684 ± 1.381
4.912IleAsp: 4.912 ± 0.771
3.684IleGlu: 3.684 ± 1.313
4.175IlePhe: 4.175 ± 0.716
2.947IleGly: 2.947 ± 0.575
1.473IleHis: 1.473 ± 0.523
3.684IleIle: 3.684 ± 1.821
6.631IleLys: 6.631 ± 1.524
7.613IleLeu: 7.613 ± 1.736
1.965IleMet: 1.965 ± 0.888
3.929IleAsn: 3.929 ± 1.123
2.947IlePro: 2.947 ± 0.969
2.701IleGln: 2.701 ± 1.036
3.193IleArg: 3.193 ± 1.562
5.894IleSer: 5.894 ± 1.048
6.385IleThr: 6.385 ± 4.148
3.438IleVal: 3.438 ± 0.554
0.491IleTrp: 0.491 ± 0.324
2.701IleTyr: 2.701 ± 0.48
0.0IleXaa: 0.0 ± 0.0
Lys
4.175LysAla: 4.175 ± 2.859
2.947LysCys: 2.947 ± 1.457
4.912LysAsp: 4.912 ± 0.774
5.403LysGlu: 5.403 ± 2.235
3.438LysPhe: 3.438 ± 1.27
3.684LysGly: 3.684 ± 0.478
2.456LysHis: 2.456 ± 0.885
6.385LysIle: 6.385 ± 1.199
7.367LysLys: 7.367 ± 0.892
8.35LysLeu: 8.35 ± 2.405
2.456LysMet: 2.456 ± 2.112
3.193LysAsn: 3.193 ± 1.137
1.965LysPro: 1.965 ± 0.669
2.701LysGln: 2.701 ± 1.021
2.21LysArg: 2.21 ± 0.568
4.666LysSer: 4.666 ± 0.685
5.403LysThr: 5.403 ± 0.75
5.157LysVal: 5.157 ± 1.095
0.737LysTrp: 0.737 ± 0.253
4.175LysTyr: 4.175 ± 1.205
0.0LysXaa: 0.0 ± 0.0
Leu
4.912LeuAla: 4.912 ± 1.325
2.456LeuCys: 2.456 ± 1.303
5.157LeuAsp: 5.157 ± 1.476
6.139LeuGlu: 6.139 ± 2.372
3.193LeuPhe: 3.193 ± 1.311
2.947LeuGly: 2.947 ± 0.701
2.947LeuHis: 2.947 ± 1.217
5.403LeuIle: 5.403 ± 4.279
8.841LeuLys: 8.841 ± 1.493
10.314LeuLeu: 10.314 ± 2.616
1.965LeuMet: 1.965 ± 0.651
6.385LeuAsn: 6.385 ± 2.185
3.193LeuPro: 3.193 ± 0.393
3.193LeuGln: 3.193 ± 2.199
4.666LeuArg: 4.666 ± 7.866
6.876LeuSer: 6.876 ± 1.134
7.367LeuThr: 7.367 ± 2.066
4.42LeuVal: 4.42 ± 2.384
0.737LeuTrp: 0.737 ± 1.532
4.175LeuTyr: 4.175 ± 0.973
0.0LeuXaa: 0.0 ± 0.0
Met
1.965MetAla: 1.965 ± 2.531
0.491MetCys: 0.491 ± 0.174
0.982MetAsp: 0.982 ± 0.649
1.228MetGlu: 1.228 ± 0.403
0.982MetPhe: 0.982 ± 1.758
1.473MetGly: 1.473 ± 0.665
0.246MetHis: 0.246 ± 0.162
2.701MetIle: 2.701 ± 0.901
1.719MetLys: 1.719 ± 0.635
3.929MetLeu: 3.929 ± 1.141
0.491MetMet: 0.491 ± 0.174
1.228MetAsn: 1.228 ± 1.427
0.982MetPro: 0.982 ± 0.348
1.965MetGln: 1.965 ± 0.697
1.965MetArg: 1.965 ± 0.749
1.473MetSer: 1.473 ± 1.399
1.965MetThr: 1.965 ± 0.494
1.719MetVal: 1.719 ± 0.567
0.0MetTrp: 0.0 ± 0.0
0.737MetTyr: 0.737 ± 0.253
0.0MetXaa: 0.0 ± 0.0
Asn
4.912AsnAla: 4.912 ± 1.435
1.228AsnCys: 1.228 ± 0.795
3.684AsnAsp: 3.684 ± 1.408
3.684AsnGlu: 3.684 ± 1.266
2.21AsnPhe: 2.21 ± 1.292
0.982AsnGly: 0.982 ± 0.388
1.965AsnHis: 1.965 ± 0.494
3.193AsnIle: 3.193 ± 0.696
2.947AsnLys: 2.947 ± 1.045
3.929AsnLeu: 3.929 ± 1.552
1.719AsnMet: 1.719 ± 0.635
2.701AsnAsn: 2.701 ± 0.827
3.929AsnPro: 3.929 ± 1.251
1.228AsnGln: 1.228 ± 0.526
2.456AsnArg: 2.456 ± 1.076
1.473AsnSer: 1.473 ± 1.371
3.193AsnThr: 3.193 ± 1.461
1.965AsnVal: 1.965 ± 0.651
1.228AsnTrp: 1.228 ± 0.403
1.965AsnTyr: 1.965 ± 0.651
0.0AsnXaa: 0.0 ± 0.0
Pro
2.701ProAla: 2.701 ± 0.749
0.246ProCys: 0.246 ± 0.222
1.473ProAsp: 1.473 ± 0.694
3.193ProGlu: 3.193 ± 0.593
1.228ProPhe: 1.228 ± 0.526
3.193ProGly: 3.193 ± 0.446
0.246ProHis: 0.246 ± 0.162
3.193ProIle: 3.193 ± 0.696
2.21ProLys: 2.21 ± 1.285
1.719ProLeu: 1.719 ± 0.689
0.982ProMet: 0.982 ± 0.388
0.737ProAsn: 0.737 ± 0.253
0.246ProPro: 0.246 ± 0.222
0.737ProGln: 0.737 ± 0.253
1.719ProArg: 1.719 ± 0.635
1.473ProSer: 1.473 ± 0.694
2.21ProThr: 2.21 ± 0.432
1.228ProVal: 1.228 ± 0.538
0.491ProTrp: 0.491 ± 0.324
1.228ProTyr: 1.228 ± 0.403
0.0ProXaa: 0.0 ± 0.0
Gln
2.947GlnAla: 2.947 ± 0.969
0.982GlnCys: 0.982 ± 0.388
2.21GlnAsp: 2.21 ± 0.432
1.965GlnGlu: 1.965 ± 0.494
1.473GlnPhe: 1.473 ± 0.506
1.228GlnGly: 1.228 ± 0.403
0.491GlnHis: 0.491 ± 0.324
2.947GlnIle: 2.947 ± 1.045
2.947GlnLys: 2.947 ± 0.871
3.193GlnLeu: 3.193 ± 3.111
0.982GlnMet: 0.982 ± 1.511
2.21GlnAsn: 2.21 ± 0.76
1.228GlnPro: 1.228 ± 0.811
1.965GlnGln: 1.965 ± 1.799
2.456GlnArg: 2.456 ± 1.651
2.456GlnSer: 2.456 ± 1.052
4.175GlnThr: 4.175 ± 0.809
1.719GlnVal: 1.719 ± 0.694
0.491GlnTrp: 0.491 ± 0.877
1.473GlnTyr: 1.473 ± 0.506
0.0GlnXaa: 0.0 ± 0.0
Arg
0.982ArgAla: 0.982 ± 0.388
0.737ArgCys: 0.737 ± 0.364
2.21ArgAsp: 2.21 ± 0.76
4.912ArgGlu: 4.912 ± 2.248
2.701ArgPhe: 2.701 ± 1.021
0.737ArgGly: 0.737 ± 0.811
0.737ArgHis: 0.737 ± 0.253
4.42ArgIle: 4.42 ± 0.947
2.456ArgLys: 2.456 ± 0.885
4.42ArgLeu: 4.42 ± 9.628
0.982ArgMet: 0.982 ± 0.388
1.719ArgAsn: 1.719 ± 0.852
1.228ArgPro: 1.228 ± 1.433
1.228ArgGln: 1.228 ± 2.012
1.965ArgArg: 1.965 ± 1.011
2.701ArgSer: 2.701 ± 0.48
2.701ArgThr: 2.701 ± 1.605
1.965ArgVal: 1.965 ± 0.531
0.491ArgTrp: 0.491 ± 1.585
1.719ArgTyr: 1.719 ± 0.635
0.0ArgXaa: 0.0 ± 0.0
Ser
4.912SerAla: 4.912 ± 0.854
2.456SerCys: 2.456 ± 1.303
2.21SerAsp: 2.21 ± 0.865
3.684SerGlu: 3.684 ± 0.478
3.438SerPhe: 3.438 ± 1.219
3.684SerGly: 3.684 ± 1.627
0.982SerHis: 0.982 ± 0.348
3.929SerIle: 3.929 ± 1.301
6.139SerLys: 6.139 ± 1.134
5.403SerLeu: 5.403 ± 2.461
2.701SerMet: 2.701 ± 1.22
2.21SerAsn: 2.21 ± 0.736
1.719SerPro: 1.719 ± 0.852
1.965SerGln: 1.965 ± 1.286
2.456SerArg: 2.456 ± 1.903
7.122SerSer: 7.122 ± 8.994
5.157SerThr: 5.157 ± 3.328
4.42SerVal: 4.42 ± 1.272
0.246SerTrp: 0.246 ± 1.644
1.473SerTyr: 1.473 ± 0.729
0.0SerXaa: 0.0 ± 0.0
Thr
4.912ThrAla: 4.912 ± 0.771
2.456ThrCys: 2.456 ± 1.59
4.175ThrAsp: 4.175 ± 1.405
3.193ThrGlu: 3.193 ± 2.254
4.42ThrPhe: 4.42 ± 2.803
3.929ThrGly: 3.929 ± 1.508
1.965ThrHis: 1.965 ± 0.888
6.876ThrIle: 6.876 ± 3.172
5.648ThrLys: 5.648 ± 0.791
7.613ThrLeu: 7.613 ± 5.042
1.228ThrMet: 1.228 ± 1.502
3.193ThrAsn: 3.193 ± 1.079
1.965ThrPro: 1.965 ± 0.888
2.947ThrGln: 2.947 ± 2.303
2.21ThrArg: 2.21 ± 1.171
4.42ThrSer: 4.42 ± 1.472
4.175ThrThr: 4.175 ± 1.088
3.193ThrVal: 3.193 ± 2.129
0.491ThrTrp: 0.491 ± 0.174
3.684ThrTyr: 3.684 ± 0.479
0.0ThrXaa: 0.0 ± 0.0
Val
1.473ValAla: 1.473 ± 0.665
1.228ValCys: 1.228 ± 0.403
3.438ValAsp: 3.438 ± 1.135
4.912ValGlu: 4.912 ± 2.588
2.456ValPhe: 2.456 ± 0.807
2.456ValGly: 2.456 ± 1.514
1.719ValHis: 1.719 ± 0.635
3.929ValIle: 3.929 ± 1.303
3.684ValLys: 3.684 ± 1.999
5.648ValLeu: 5.648 ± 1.257
0.982ValMet: 0.982 ± 0.348
2.456ValAsn: 2.456 ± 1.476
0.982ValPro: 0.982 ± 0.649
1.719ValGln: 1.719 ± 0.635
1.473ValArg: 1.473 ± 1.371
4.42ValSer: 4.42 ± 1.272
4.42ValThr: 4.42 ± 0.856
2.21ValVal: 2.21 ± 0.799
0.491ValTrp: 0.491 ± 0.443
1.719ValTyr: 1.719 ± 0.542
0.0ValXaa: 0.0 ± 0.0
Trp
0.491TrpAla: 0.491 ± 0.174
0.246TrpCys: 0.246 ± 0.162
0.491TrpAsp: 0.491 ± 0.324
0.737TrpGlu: 0.737 ± 0.253
0.491TrpPhe: 0.491 ± 0.174
0.246TrpGly: 0.246 ± 0.222
0.0TrpHis: 0.0 ± 0.0
0.491TrpIle: 0.491 ± 0.443
0.246TrpLys: 0.246 ± 0.162
1.473TrpLeu: 1.473 ± 3.579
0.491TrpMet: 0.491 ± 0.174
0.491TrpAsn: 0.491 ± 0.324
0.0TrpPro: 0.0 ± 0.0
0.737TrpGln: 0.737 ± 0.253
0.246TrpArg: 0.246 ± 0.222
1.228TrpSer: 1.228 ± 0.83
0.491TrpThr: 0.491 ± 0.443
0.246TrpVal: 0.246 ± 1.644
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.982TyrAla: 0.982 ± 0.348
0.982TyrCys: 0.982 ± 0.577
0.982TyrAsp: 0.982 ± 0.731
3.193TyrGlu: 3.193 ± 1.545
2.456TyrPhe: 2.456 ± 1.076
1.719TyrGly: 1.719 ± 0.542
0.491TyrHis: 0.491 ± 0.443
4.42TyrIle: 4.42 ± 1.298
4.912TyrLys: 4.912 ± 0.912
2.947TyrLeu: 2.947 ± 0.871
1.473TyrMet: 1.473 ± 0.694
2.456TyrAsn: 2.456 ± 0.427
1.228TyrPro: 1.228 ± 0.403
0.982TyrGln: 0.982 ± 0.388
1.719TyrArg: 1.719 ± 0.689
1.473TyrSer: 1.473 ± 0.523
2.456TyrThr: 2.456 ± 0.427
1.965TyrVal: 1.965 ± 0.697
0.246TyrTrp: 0.246 ± 0.162
1.719TyrTyr: 1.719 ± 1.235
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (4073 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski