Amino acid dipepetide frequency for Curionopolis virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.967AlaAla: 0.967 ± 1.046
0.483AlaCys: 0.483 ± 0.362
1.209AlaAsp: 1.209 ± 0.585
2.175AlaGlu: 2.175 ± 0.415
1.934AlaPhe: 1.934 ± 0.867
1.934AlaGly: 1.934 ± 0.486
0.725AlaHis: 0.725 ± 0.326
1.45AlaIle: 1.45 ± 0.913
2.417AlaLys: 2.417 ± 1.064
4.351AlaLeu: 4.351 ± 0.921
0.725AlaMet: 0.725 ± 0.409
1.209AlaAsn: 1.209 ± 0.603
0.725AlaPro: 0.725 ± 0.283
1.45AlaGln: 1.45 ± 0.595
2.417AlaArg: 2.417 ± 0.617
2.901AlaSer: 2.901 ± 0.953
0.967AlaThr: 0.967 ± 0.289
2.175AlaVal: 2.175 ± 0.763
0.0AlaTrp: 0.0 ± 0.0
1.692AlaTyr: 1.692 ± 0.384
0.0AlaXaa: 0.0 ± 0.0
Cys
0.483CysAla: 0.483 ± 0.314
0.0CysCys: 0.0 ± 0.0
0.483CysAsp: 0.483 ± 0.215
0.725CysGlu: 0.725 ± 0.596
1.209CysPhe: 1.209 ± 0.681
1.209CysGly: 1.209 ± 0.578
1.209CysHis: 1.209 ± 0.407
0.483CysIle: 0.483 ± 0.215
2.417CysLys: 2.417 ± 0.646
2.901CysLeu: 2.901 ± 1.035
0.0CysMet: 0.0 ± 0.0
0.967CysAsn: 0.967 ± 0.62
1.209CysPro: 1.209 ± 0.554
0.242CysGln: 0.242 ± 0.217
1.934CysArg: 1.934 ± 0.46
1.692CysSer: 1.692 ± 0.543
0.483CysThr: 0.483 ± 0.314
0.725CysVal: 0.725 ± 0.288
0.483CysTrp: 0.483 ± 0.215
0.725CysTyr: 0.725 ± 0.288
0.0CysXaa: 0.0 ± 0.0
Asp
1.692AspAla: 1.692 ± 1.116
0.725AspCys: 0.725 ± 0.536
4.593AspAsp: 4.593 ± 1.517
6.526AspGlu: 6.526 ± 2.287
3.626AspPhe: 3.626 ± 0.964
3.384AspGly: 3.384 ± 0.927
0.725AspHis: 0.725 ± 0.527
3.384AspIle: 3.384 ± 0.483
2.417AspLys: 2.417 ± 0.716
7.493AspLeu: 7.493 ± 1.531
1.45AspMet: 1.45 ± 1.101
1.209AspAsn: 1.209 ± 0.529
4.109AspPro: 4.109 ± 0.596
2.659AspGln: 2.659 ± 1.498
1.934AspArg: 1.934 ± 0.457
3.142AspSer: 3.142 ± 0.756
2.175AspThr: 2.175 ± 0.634
1.209AspVal: 1.209 ± 0.395
2.659AspTrp: 2.659 ± 1.272
3.384AspTyr: 3.384 ± 0.868
0.0AspXaa: 0.0 ± 0.0
Glu
2.659GluAla: 2.659 ± 0.84
1.45GluCys: 1.45 ± 0.458
5.318GluAsp: 5.318 ± 0.647
11.603GluGlu: 11.603 ± 1.651
5.076GluPhe: 5.076 ± 0.995
5.56GluGly: 5.56 ± 0.538
0.967GluHis: 0.967 ± 0.395
5.801GluIle: 5.801 ± 1.239
8.944GluLys: 8.944 ± 1.821
7.01GluLeu: 7.01 ± 0.667
2.175GluMet: 2.175 ± 0.43
3.142GluAsn: 3.142 ± 0.868
2.175GluPro: 2.175 ± 0.559
1.45GluGln: 1.45 ± 1.052
3.384GluArg: 3.384 ± 0.752
7.735GluSer: 7.735 ± 1.381
2.659GluThr: 2.659 ± 1.128
4.109GluVal: 4.109 ± 1.65
1.45GluTrp: 1.45 ± 0.436
2.175GluTyr: 2.175 ± 0.792
0.0GluXaa: 0.0 ± 0.0
Phe
0.967PheAla: 0.967 ± 0.667
2.175PheCys: 2.175 ± 0.697
2.417PheAsp: 2.417 ± 0.922
3.384PheGlu: 3.384 ± 0.806
1.692PhePhe: 1.692 ± 0.655
2.659PheGly: 2.659 ± 0.751
0.967PheHis: 0.967 ± 0.422
2.659PheIle: 2.659 ± 0.966
3.384PheLys: 3.384 ± 0.342
7.493PheLeu: 7.493 ± 1.994
0.725PheMet: 0.725 ± 0.288
1.45PheAsn: 1.45 ± 0.644
2.417PhePro: 2.417 ± 0.682
1.209PheGln: 1.209 ± 0.423
2.175PheArg: 2.175 ± 0.536
4.351PheSer: 4.351 ± 1.1
1.45PheThr: 1.45 ± 0.646
1.934PheVal: 1.934 ± 0.652
1.209PheTrp: 1.209 ± 0.419
0.967PheTyr: 0.967 ± 1.0
0.0PheXaa: 0.0 ± 0.0
Gly
0.967GlyAla: 0.967 ± 0.445
1.209GlyCys: 1.209 ± 0.681
3.384GlyAsp: 3.384 ± 0.984
7.01GlyGlu: 7.01 ± 1.104
1.934GlyPhe: 1.934 ± 0.415
5.076GlyGly: 5.076 ± 1.721
1.45GlyHis: 1.45 ± 0.457
3.384GlyIle: 3.384 ± 0.89
7.493GlyLys: 7.493 ± 2.343
5.076GlyLeu: 5.076 ± 1.009
2.901GlyMet: 2.901 ± 1.276
1.209GlyAsn: 1.209 ± 0.49
1.45GlyPro: 1.45 ± 0.477
1.692GlyGln: 1.692 ± 0.44
3.384GlyArg: 3.384 ± 0.923
6.043GlySer: 6.043 ± 1.582
4.109GlyThr: 4.109 ± 1.081
4.351GlyVal: 4.351 ± 1.445
0.483GlyTrp: 0.483 ± 0.314
2.417GlyTyr: 2.417 ± 0.678
0.0GlyXaa: 0.0 ± 0.0
His
0.725HisAla: 0.725 ± 0.309
0.483HisCys: 0.483 ± 0.215
1.209HisAsp: 1.209 ± 0.49
0.967HisGlu: 0.967 ± 0.398
0.967HisPhe: 0.967 ± 0.359
0.725HisGly: 0.725 ± 0.413
1.209HisHis: 1.209 ± 0.634
1.45HisIle: 1.45 ± 0.526
0.725HisLys: 0.725 ± 0.41
2.417HisLeu: 2.417 ± 0.918
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
0.725HisPro: 0.725 ± 0.288
0.967HisGln: 0.967 ± 0.395
0.967HisArg: 0.967 ± 0.387
1.692HisSer: 1.692 ± 0.545
0.242HisThr: 0.242 ± 0.136
1.45HisVal: 1.45 ± 0.707
0.483HisTrp: 0.483 ± 0.314
1.45HisTyr: 1.45 ± 0.402
0.0HisXaa: 0.0 ± 0.0
Ile
2.659IleAla: 2.659 ± 0.553
1.209IleCys: 1.209 ± 0.49
3.384IleAsp: 3.384 ± 0.557
5.318IleGlu: 5.318 ± 0.871
3.384IlePhe: 3.384 ± 0.866
5.56IleGly: 5.56 ± 2.011
1.45IleHis: 1.45 ± 0.78
4.834IleIle: 4.834 ± 0.794
8.944IleLys: 8.944 ± 1.212
5.801IleLeu: 5.801 ± 1.61
1.692IleMet: 1.692 ± 0.636
2.417IleAsn: 2.417 ± 0.678
3.626IlePro: 3.626 ± 0.711
1.692IleGln: 1.692 ± 0.366
4.351IleArg: 4.351 ± 1.233
3.868IleSer: 3.868 ± 0.725
1.934IleThr: 1.934 ± 0.377
4.834IleVal: 4.834 ± 1.357
1.209IleTrp: 1.209 ± 0.557
3.142IleTyr: 3.142 ± 0.711
0.0IleXaa: 0.0 ± 0.0
Lys
3.142LysAla: 3.142 ± 0.835
1.45LysCys: 1.45 ± 0.702
4.351LysAsp: 4.351 ± 1.283
5.56LysGlu: 5.56 ± 1.125
3.142LysPhe: 3.142 ± 0.679
7.01LysGly: 7.01 ± 1.004
0.483LysHis: 0.483 ± 0.272
7.493LysIle: 7.493 ± 1.256
7.493LysLys: 7.493 ± 2.094
6.768LysLeu: 6.768 ± 0.931
2.901LysMet: 2.901 ± 0.687
4.109LysAsn: 4.109 ± 1.312
3.626LysPro: 3.626 ± 1.098
1.209LysGln: 1.209 ± 0.529
5.801LysArg: 5.801 ± 1.11
6.768LysSer: 6.768 ± 1.564
6.285LysThr: 6.285 ± 1.604
3.384LysVal: 3.384 ± 0.729
1.934LysTrp: 1.934 ± 0.733
1.692LysTyr: 1.692 ± 0.545
0.0LysXaa: 0.0 ± 0.0
Leu
3.868LeuAla: 3.868 ± 1.07
1.45LeuCys: 1.45 ± 0.517
7.493LeuAsp: 7.493 ± 2.331
10.394LeuGlu: 10.394 ± 1.425
4.834LeuPhe: 4.834 ± 1.088
6.285LeuGly: 6.285 ± 1.423
0.967LeuHis: 0.967 ± 0.395
8.219LeuIle: 8.219 ± 1.09
9.185LeuLys: 9.185 ± 2.992
11.119LeuLeu: 11.119 ± 1.618
1.45LeuMet: 1.45 ± 0.517
4.834LeuAsn: 4.834 ± 0.652
2.175LeuPro: 2.175 ± 0.601
1.692LeuGln: 1.692 ± 0.358
4.834LeuArg: 4.834 ± 1.716
8.46LeuSer: 8.46 ± 0.931
3.868LeuThr: 3.868 ± 1.146
4.109LeuVal: 4.109 ± 0.862
0.483LeuTrp: 0.483 ± 0.303
3.142LeuTyr: 3.142 ± 0.643
0.0LeuXaa: 0.0 ± 0.0
Met
0.967MetAla: 0.967 ± 0.359
0.483MetCys: 0.483 ± 0.314
1.45MetAsp: 1.45 ± 0.434
1.209MetGlu: 1.209 ± 0.71
1.209MetPhe: 1.209 ± 0.423
1.45MetGly: 1.45 ± 0.462
0.0MetHis: 0.0 ± 0.0
2.175MetIle: 2.175 ± 0.84
3.142MetLys: 3.142 ± 0.987
1.934MetLeu: 1.934 ± 0.725
1.209MetMet: 1.209 ± 0.649
0.967MetAsn: 0.967 ± 0.606
0.725MetPro: 0.725 ± 0.751
0.967MetGln: 0.967 ± 0.456
1.45MetArg: 1.45 ± 0.517
1.209MetSer: 1.209 ± 0.903
1.934MetThr: 1.934 ± 0.481
0.242MetVal: 0.242 ± 0.333
0.483MetTrp: 0.483 ± 0.344
0.967MetTyr: 0.967 ± 0.359
0.0MetXaa: 0.0 ± 0.0
Asn
1.692AsnAla: 1.692 ± 0.565
0.967AsnCys: 0.967 ± 0.395
1.934AsnAsp: 1.934 ± 0.377
2.659AsnGlu: 2.659 ± 0.777
0.967AsnPhe: 0.967 ± 0.359
2.901AsnGly: 2.901 ± 0.803
0.725AsnHis: 0.725 ± 0.409
2.659AsnIle: 2.659 ± 0.966
3.384AsnLys: 3.384 ± 0.712
4.351AsnLeu: 4.351 ± 1.233
0.725AsnMet: 0.725 ± 0.33
2.417AsnAsn: 2.417 ± 0.518
2.901AsnPro: 2.901 ± 0.328
1.45AsnGln: 1.45 ± 0.643
2.901AsnArg: 2.901 ± 1.028
3.142AsnSer: 3.142 ± 1.146
1.934AsnThr: 1.934 ± 0.605
1.209AsnVal: 1.209 ± 0.313
1.45AsnTrp: 1.45 ± 0.646
0.967AsnTyr: 0.967 ± 0.975
0.0AsnXaa: 0.0 ± 0.0
Pro
0.725ProAla: 0.725 ± 0.332
0.967ProCys: 0.967 ± 0.49
3.142ProAsp: 3.142 ± 1.168
3.384ProGlu: 3.384 ± 1.306
1.45ProPhe: 1.45 ± 0.51
2.901ProGly: 2.901 ± 0.972
0.725ProHis: 0.725 ± 0.332
2.417ProIle: 2.417 ± 0.622
2.901ProLys: 2.901 ± 0.927
3.142ProLeu: 3.142 ± 0.82
0.483ProMet: 0.483 ± 0.303
2.659ProAsn: 2.659 ± 0.708
2.901ProPro: 2.901 ± 2.529
1.209ProGln: 1.209 ± 0.313
1.692ProArg: 1.692 ± 0.857
5.076ProSer: 5.076 ± 0.498
1.692ProThr: 1.692 ± 0.816
1.692ProVal: 1.692 ± 0.433
0.725ProTrp: 0.725 ± 0.41
1.45ProTyr: 1.45 ± 0.653
0.0ProXaa: 0.0 ± 0.0
Gln
1.45GlnAla: 1.45 ± 0.434
0.242GlnCys: 0.242 ± 0.136
1.692GlnAsp: 1.692 ± 0.678
2.175GlnGlu: 2.175 ± 0.439
0.967GlnPhe: 0.967 ± 0.445
2.175GlnGly: 2.175 ± 0.863
0.242GlnHis: 0.242 ± 0.217
2.901GlnIle: 2.901 ± 0.696
1.934GlnLys: 1.934 ± 0.845
0.725GlnLeu: 0.725 ± 0.309
1.209GlnMet: 1.209 ± 0.488
2.417GlnAsn: 2.417 ± 1.309
0.242GlnPro: 0.242 ± 0.497
0.483GlnGln: 0.483 ± 0.272
1.209GlnArg: 1.209 ± 0.681
3.142GlnSer: 3.142 ± 0.772
0.483GlnThr: 0.483 ± 0.362
2.175GlnVal: 2.175 ± 0.559
0.242GlnTrp: 0.242 ± 0.217
0.483GlnTyr: 0.483 ± 0.537
0.0GlnXaa: 0.0 ± 0.0
Arg
1.692ArgAla: 1.692 ± 0.797
1.692ArgCys: 1.692 ± 0.699
3.384ArgAsp: 3.384 ± 0.392
6.285ArgGlu: 6.285 ± 1.087
2.417ArgPhe: 2.417 ± 0.389
3.384ArgGly: 3.384 ± 1.243
1.209ArgHis: 1.209 ± 0.447
3.626ArgIle: 3.626 ± 0.951
3.384ArgLys: 3.384 ± 0.78
4.593ArgLeu: 4.593 ± 0.986
1.692ArgMet: 1.692 ± 0.801
2.175ArgAsn: 2.175 ± 0.845
1.692ArgPro: 1.692 ± 0.654
1.209ArgGln: 1.209 ± 0.47
4.351ArgArg: 4.351 ± 1.735
7.252ArgSer: 7.252 ± 1.913
3.384ArgThr: 3.384 ± 0.86
1.934ArgVal: 1.934 ± 0.602
0.725ArgTrp: 0.725 ± 0.409
1.45ArgTyr: 1.45 ± 0.402
0.0ArgXaa: 0.0 ± 0.0
Ser
3.384SerAla: 3.384 ± 0.873
1.934SerCys: 1.934 ± 0.784
6.526SerAsp: 6.526 ± 1.408
7.493SerGlu: 7.493 ± 1.502
2.901SerPhe: 2.901 ± 0.676
3.868SerGly: 3.868 ± 0.564
2.417SerHis: 2.417 ± 0.84
7.493SerIle: 7.493 ± 1.768
6.043SerLys: 6.043 ± 1.075
10.152SerLeu: 10.152 ± 2.06
0.725SerMet: 0.725 ± 0.409
2.659SerAsn: 2.659 ± 0.591
3.384SerPro: 3.384 ± 1.393
2.659SerGln: 2.659 ± 0.927
5.801SerArg: 5.801 ± 0.832
6.526SerSer: 6.526 ± 1.332
3.384SerThr: 3.384 ± 0.992
4.834SerVal: 4.834 ± 0.98
1.934SerTrp: 1.934 ± 0.87
2.175SerTyr: 2.175 ± 0.462
0.0SerXaa: 0.0 ± 0.0
Thr
1.692ThrAla: 1.692 ± 0.732
0.967ThrCys: 0.967 ± 0.394
2.417ThrAsp: 2.417 ± 0.771
1.934ThrGlu: 1.934 ± 0.647
1.934ThrPhe: 1.934 ± 0.878
3.384ThrGly: 3.384 ± 0.864
1.209ThrHis: 1.209 ± 0.62
2.417ThrIle: 2.417 ± 0.598
3.384ThrLys: 3.384 ± 1.044
3.626ThrLeu: 3.626 ± 0.827
1.209ThrMet: 1.209 ± 0.395
2.901ThrAsn: 2.901 ± 0.746
2.417ThrPro: 2.417 ± 0.943
0.967ThrGln: 0.967 ± 0.456
3.384ThrArg: 3.384 ± 0.73
5.076ThrSer: 5.076 ± 0.731
1.692ThrThr: 1.692 ± 0.565
2.659ThrVal: 2.659 ± 0.612
1.209ThrTrp: 1.209 ± 0.472
0.725ThrTyr: 0.725 ± 0.552
0.0ThrXaa: 0.0 ± 0.0
Val
0.967ValAla: 0.967 ± 0.519
0.967ValCys: 0.967 ± 0.456
1.934ValAsp: 1.934 ± 0.724
2.659ValGlu: 2.659 ± 0.716
2.659ValPhe: 2.659 ± 1.029
3.142ValGly: 3.142 ± 0.559
0.967ValHis: 0.967 ± 0.551
3.142ValIle: 3.142 ± 1.378
3.626ValLys: 3.626 ± 0.636
5.801ValLeu: 5.801 ± 1.132
1.209ValMet: 1.209 ± 0.681
2.901ValAsn: 2.901 ± 0.852
2.175ValPro: 2.175 ± 0.626
1.209ValGln: 1.209 ± 0.606
3.384ValArg: 3.384 ± 0.891
3.868ValSer: 3.868 ± 1.017
4.109ValThr: 4.109 ± 1.268
2.417ValVal: 2.417 ± 0.617
0.242ValTrp: 0.242 ± 0.217
0.967ValTyr: 0.967 ± 0.281
0.0ValXaa: 0.0 ± 0.0
Trp
0.725TrpAla: 0.725 ± 0.414
0.242TrpCys: 0.242 ± 0.136
1.209TrpAsp: 1.209 ± 0.339
1.45TrpGlu: 1.45 ± 0.575
0.725TrpPhe: 0.725 ± 0.288
0.725TrpGly: 0.725 ± 0.409
0.725TrpHis: 0.725 ± 0.288
2.417TrpIle: 2.417 ± 0.421
0.483TrpLys: 0.483 ± 0.447
1.45TrpLeu: 1.45 ± 1.114
0.242TrpMet: 0.242 ± 0.136
0.725TrpAsn: 0.725 ± 0.288
0.483TrpPro: 0.483 ± 0.416
0.242TrpGln: 0.242 ± 0.381
0.725TrpArg: 0.725 ± 0.409
0.967TrpSer: 0.967 ± 0.431
1.934TrpThr: 1.934 ± 0.814
1.209TrpVal: 1.209 ± 1.23
0.725TrpTrp: 0.725 ± 0.65
0.725TrpTyr: 0.725 ± 0.326
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.483TyrAla: 0.483 ± 0.272
0.483TyrCys: 0.483 ± 0.314
0.967TyrAsp: 0.967 ± 0.359
1.934TyrGlu: 1.934 ± 1.127
2.659TyrPhe: 2.659 ± 0.591
1.934TyrGly: 1.934 ± 0.627
0.483TyrHis: 0.483 ± 0.504
2.901TyrIle: 2.901 ± 1.079
2.901TyrLys: 2.901 ± 1.442
2.659TyrLeu: 2.659 ± 0.691
1.209TyrMet: 1.209 ± 1.374
0.967TyrAsn: 0.967 ± 0.387
2.175TyrPro: 2.175 ± 0.751
1.934TyrGln: 1.934 ± 0.538
1.45TyrArg: 1.45 ± 1.072
3.384TyrSer: 3.384 ± 1.798
0.483TyrThr: 0.483 ± 0.666
1.692TyrVal: 1.692 ± 0.551
0.0TyrTrp: 0.0 ± 0.0
1.209TyrTyr: 1.209 ± 0.377
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (4138 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski