Amino acid dipepetide frequency for Caligus rogercresseyi rhabdovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.212AlaAla: 3.212 ± 1.247
0.803AlaCys: 0.803 ± 0.862
3.48AlaAsp: 3.48 ± 0.645
3.212AlaGlu: 3.212 ± 1.061
2.141AlaPhe: 2.141 ± 0.438
2.141AlaGly: 2.141 ± 0.49
0.268AlaHis: 0.268 ± 0.159
4.015AlaIle: 4.015 ± 1.158
1.606AlaLys: 1.606 ± 0.336
5.621AlaLeu: 5.621 ± 1.281
1.071AlaMet: 1.071 ± 0.352
2.409AlaAsn: 2.409 ± 0.614
1.071AlaPro: 1.071 ± 0.445
2.677AlaGln: 2.677 ± 0.721
5.353AlaArg: 5.353 ± 0.461
2.944AlaSer: 2.944 ± 1.619
5.353AlaThr: 5.353 ± 0.936
2.409AlaVal: 2.409 ± 1.217
1.338AlaTrp: 1.338 ± 0.612
1.606AlaTyr: 1.606 ± 0.54
0.0AlaXaa: 0.0 ± 0.0
Cys
1.071CysAla: 1.071 ± 0.791
0.0CysCys: 0.0 ± 0.0
0.803CysAsp: 0.803 ± 0.636
0.0CysGlu: 0.0 ± 0.0
0.268CysPhe: 0.268 ± 0.159
0.535CysGly: 0.535 ± 0.509
0.0CysHis: 0.0 ± 0.0
0.535CysIle: 0.535 ± 0.352
1.874CysLys: 1.874 ± 1.694
1.606CysLeu: 1.606 ± 1.017
0.0CysMet: 0.0 ± 0.0
0.535CysAsn: 0.535 ± 0.41
0.803CysPro: 0.803 ± 0.269
0.535CysGln: 0.535 ± 0.317
1.606CysArg: 1.606 ± 0.561
2.141CysSer: 2.141 ± 1.546
1.071CysThr: 1.071 ± 0.906
1.338CysVal: 1.338 ± 0.447
0.268CysTrp: 0.268 ± 0.159
0.268CysTyr: 0.268 ± 0.159
0.0CysXaa: 0.0 ± 0.0
Asp
4.015AspAla: 4.015 ± 1.269
0.803AspCys: 0.803 ± 0.397
3.747AspAsp: 3.747 ± 1.372
3.747AspGlu: 3.747 ± 0.958
2.141AspPhe: 2.141 ± 0.526
2.944AspGly: 2.944 ± 1.049
1.071AspHis: 1.071 ± 0.414
3.48AspIle: 3.48 ± 0.695
2.677AspLys: 2.677 ± 0.775
8.298AspLeu: 8.298 ± 1.058
1.071AspMet: 1.071 ± 0.42
1.338AspAsn: 1.338 ± 0.325
3.212AspPro: 3.212 ± 0.687
2.677AspGln: 2.677 ± 0.721
3.212AspArg: 3.212 ± 1.944
4.015AspSer: 4.015 ± 0.507
2.944AspThr: 2.944 ± 1.095
1.874AspVal: 1.874 ± 0.799
1.071AspTrp: 1.071 ± 0.437
2.944AspTyr: 2.944 ± 0.661
0.0AspXaa: 0.0 ± 0.0
Glu
2.141GluAla: 2.141 ± 1.124
1.071GluCys: 1.071 ± 1.0
4.55GluAsp: 4.55 ± 1.175
6.424GluGlu: 6.424 ± 0.683
2.141GluPhe: 2.141 ± 1.009
3.48GluGly: 3.48 ± 0.559
2.141GluHis: 2.141 ± 1.58
3.747GluIle: 3.747 ± 0.851
4.283GluLys: 4.283 ± 1.286
4.283GluLeu: 4.283 ± 1.031
1.874GluMet: 1.874 ± 0.507
4.818GluAsn: 4.818 ± 0.777
3.48GluPro: 3.48 ± 0.659
0.535GluGln: 0.535 ± 0.41
2.944GluArg: 2.944 ± 1.17
5.353GluSer: 5.353 ± 0.398
3.747GluThr: 3.747 ± 1.392
4.015GluVal: 4.015 ± 0.896
0.535GluTrp: 0.535 ± 0.317
1.874GluTyr: 1.874 ± 0.254
0.0GluXaa: 0.0 ± 0.0
Phe
1.071PheAla: 1.071 ± 0.634
1.071PheCys: 1.071 ± 0.451
1.338PheAsp: 1.338 ± 0.568
1.606PheGlu: 1.606 ± 0.414
2.409PhePhe: 2.409 ± 1.078
3.48PheGly: 3.48 ± 1.425
0.803PheHis: 0.803 ± 0.476
2.141PheIle: 2.141 ± 0.619
2.409PheLys: 2.409 ± 0.631
4.015PheLeu: 4.015 ± 0.739
0.803PheMet: 0.803 ± 0.331
2.141PheAsn: 2.141 ± 0.296
2.141PhePro: 2.141 ± 0.637
1.338PheGln: 1.338 ± 0.594
2.409PheArg: 2.409 ± 0.637
3.48PheSer: 3.48 ± 1.103
1.071PheThr: 1.071 ± 0.394
2.141PheVal: 2.141 ± 0.593
0.535PheTrp: 0.535 ± 0.317
1.071PheTyr: 1.071 ± 0.562
0.0PheXaa: 0.0 ± 0.0
Gly
2.944GlyAla: 2.944 ± 0.705
1.338GlyCys: 1.338 ± 0.459
3.212GlyAsp: 3.212 ± 1.247
4.55GlyGlu: 4.55 ± 1.091
1.606GlyPhe: 1.606 ± 0.394
3.48GlyGly: 3.48 ± 0.752
1.338GlyHis: 1.338 ± 0.452
3.48GlyIle: 3.48 ± 0.734
4.283GlyLys: 4.283 ± 0.473
9.101GlyLeu: 9.101 ± 1.449
0.803GlyMet: 0.803 ± 0.35
2.141GlyAsn: 2.141 ± 0.45
3.48GlyPro: 3.48 ± 0.83
2.677GlyGln: 2.677 ± 0.982
4.283GlyArg: 4.283 ± 0.712
6.692GlySer: 6.692 ± 0.898
2.944GlyThr: 2.944 ± 0.885
2.944GlyVal: 2.944 ± 0.903
0.535GlyTrp: 0.535 ± 0.317
2.141GlyTyr: 2.141 ± 0.652
0.0GlyXaa: 0.0 ± 0.0
His
0.803HisAla: 0.803 ± 0.916
0.268HisCys: 0.268 ± 0.368
0.803HisAsp: 0.803 ± 0.488
1.606HisGlu: 1.606 ± 0.414
1.338HisPhe: 1.338 ± 0.325
0.268HisGly: 0.268 ± 0.159
0.803HisHis: 0.803 ± 0.636
0.803HisIle: 0.803 ± 0.331
1.606HisLys: 1.606 ± 0.394
3.48HisLeu: 3.48 ± 1.128
0.535HisMet: 0.535 ± 0.272
0.268HisAsn: 0.268 ± 0.368
1.071HisPro: 1.071 ± 0.353
0.535HisGln: 0.535 ± 0.281
0.803HisArg: 0.803 ± 0.331
2.409HisSer: 2.409 ± 1.162
1.338HisThr: 1.338 ± 0.423
1.071HisVal: 1.071 ± 0.634
0.803HisTrp: 0.803 ± 0.331
2.141HisTyr: 2.141 ± 0.933
0.0HisXaa: 0.0 ± 0.0
Ile
3.212IleAla: 3.212 ± 0.865
0.268IleCys: 0.268 ± 0.368
3.48IleAsp: 3.48 ± 1.214
4.015IleGlu: 4.015 ± 1.03
2.141IlePhe: 2.141 ± 0.865
2.677IleGly: 2.677 ± 0.523
1.606IleHis: 1.606 ± 0.663
2.944IleIle: 2.944 ± 0.661
3.48IleLys: 3.48 ± 1.761
6.692IleLeu: 6.692 ± 1.386
0.535IleMet: 0.535 ± 0.281
1.338IleAsn: 1.338 ± 0.793
3.48IlePro: 3.48 ± 0.625
2.677IleGln: 2.677 ± 1.133
6.424IleArg: 6.424 ± 1.207
6.959IleSer: 6.959 ± 1.704
3.212IleThr: 3.212 ± 1.037
2.944IleVal: 2.944 ± 1.543
0.0IleTrp: 0.0 ± 0.0
1.338IleTyr: 1.338 ± 0.677
0.0IleXaa: 0.0 ± 0.0
Lys
1.606LysAla: 1.606 ± 0.54
1.338LysCys: 1.338 ± 0.612
2.944LysAsp: 2.944 ± 1.03
4.283LysGlu: 4.283 ± 0.625
1.874LysPhe: 1.874 ± 1.144
4.283LysGly: 4.283 ± 1.032
1.338LysHis: 1.338 ± 0.52
4.015LysIle: 4.015 ± 1.38
5.353LysLys: 5.353 ± 2.251
6.156LysLeu: 6.156 ± 1.173
0.803LysMet: 0.803 ± 0.651
1.071LysAsn: 1.071 ± 0.394
1.338LysPro: 1.338 ± 0.594
0.268LysGln: 0.268 ± 0.159
2.944LysArg: 2.944 ± 1.154
3.48LysSer: 3.48 ± 1.19
5.086LysThr: 5.086 ± 1.29
4.015LysVal: 4.015 ± 0.955
1.874LysTrp: 1.874 ± 0.787
1.338LysTyr: 1.338 ± 0.842
0.0LysXaa: 0.0 ± 0.0
Leu
6.424LeuAla: 6.424 ± 1.065
1.071LeuCys: 1.071 ± 1.231
6.692LeuAsp: 6.692 ± 0.849
5.889LeuGlu: 5.889 ± 1.108
4.818LeuPhe: 4.818 ± 1.175
7.227LeuGly: 7.227 ± 0.957
3.212LeuHis: 3.212 ± 0.631
7.495LeuIle: 7.495 ± 1.262
4.55LeuLys: 4.55 ± 1.165
10.439LeuLeu: 10.439 ± 2.458
3.48LeuMet: 3.48 ± 1.221
3.747LeuAsn: 3.747 ± 0.372
5.621LeuPro: 5.621 ± 0.911
5.086LeuGln: 5.086 ± 0.771
8.298LeuArg: 8.298 ± 2.082
7.762LeuSer: 7.762 ± 0.834
6.692LeuThr: 6.692 ± 0.696
5.353LeuVal: 5.353 ± 0.837
1.606LeuTrp: 1.606 ± 1.017
3.48LeuTyr: 3.48 ± 0.463
0.0LeuXaa: 0.0 ± 0.0
Met
1.874MetAla: 1.874 ± 0.279
0.268MetCys: 0.268 ± 0.159
0.803MetAsp: 0.803 ± 0.761
1.071MetGlu: 1.071 ± 1.038
0.803MetPhe: 0.803 ± 0.476
1.874MetGly: 1.874 ± 0.759
0.0MetHis: 0.0 ± 0.0
1.071MetIle: 1.071 ± 0.341
1.338MetLys: 1.338 ± 0.423
1.338MetLeu: 1.338 ± 0.793
0.268MetMet: 0.268 ± 0.314
0.803MetAsn: 0.803 ± 0.433
0.535MetPro: 0.535 ± 0.281
0.803MetGln: 0.803 ± 0.574
2.141MetArg: 2.141 ± 1.415
1.338MetSer: 1.338 ± 0.614
1.606MetThr: 1.606 ± 0.584
1.071MetVal: 1.071 ± 0.437
0.268MetTrp: 0.268 ± 0.159
1.071MetTyr: 1.071 ± 0.562
0.0MetXaa: 0.0 ± 0.0
Asn
1.606AsnAla: 1.606 ± 0.618
0.803AsnCys: 0.803 ± 0.66
1.606AsnAsp: 1.606 ± 0.805
0.803AsnGlu: 0.803 ± 0.476
1.338AsnPhe: 1.338 ± 0.608
2.409AsnGly: 2.409 ± 0.407
1.071AsnHis: 1.071 ± 0.712
1.606AsnIle: 1.606 ± 0.687
1.874AsnLys: 1.874 ± 0.509
6.959AsnLeu: 6.959 ± 1.705
0.535AsnMet: 0.535 ± 0.36
1.874AsnAsn: 1.874 ± 1.341
2.677AsnPro: 2.677 ± 0.925
1.071AsnGln: 1.071 ± 0.394
1.874AsnArg: 1.874 ± 0.63
2.944AsnSer: 2.944 ± 0.982
2.944AsnThr: 2.944 ± 0.659
1.874AsnVal: 1.874 ± 0.845
0.803AsnTrp: 0.803 ± 0.476
0.268AsnTyr: 0.268 ± 0.159
0.0AsnXaa: 0.0 ± 0.0
Pro
3.48ProAla: 3.48 ± 0.855
0.535ProCys: 0.535 ± 0.281
2.677ProAsp: 2.677 ± 0.533
5.889ProGlu: 5.889 ± 0.877
1.606ProPhe: 1.606 ± 0.545
4.283ProGly: 4.283 ± 2.285
0.535ProHis: 0.535 ± 0.281
2.409ProIle: 2.409 ± 0.588
1.606ProLys: 1.606 ± 0.641
5.889ProLeu: 5.889 ± 1.659
0.268ProMet: 0.268 ± 0.159
1.071ProAsn: 1.071 ± 0.712
3.212ProPro: 3.212 ± 0.426
1.606ProGln: 1.606 ± 1.612
2.409ProArg: 2.409 ± 0.33
3.212ProSer: 3.212 ± 0.87
2.944ProThr: 2.944 ± 1.145
1.874ProVal: 1.874 ± 0.523
1.874ProTrp: 1.874 ± 0.279
2.409ProTyr: 2.409 ± 0.797
0.0ProXaa: 0.0 ± 0.0
Gln
2.409GlnAla: 2.409 ± 0.322
0.803GlnCys: 0.803 ± 0.454
1.338GlnAsp: 1.338 ± 0.52
1.338GlnGlu: 1.338 ± 1.225
2.141GlnPhe: 2.141 ± 0.646
2.944GlnGly: 2.944 ± 1.086
0.535GlnHis: 0.535 ± 0.317
1.606GlnIle: 1.606 ± 0.843
1.338GlnLys: 1.338 ± 0.817
2.944GlnLeu: 2.944 ± 0.395
0.803GlnMet: 0.803 ± 0.69
1.338GlnAsn: 1.338 ± 0.349
0.535GlnPro: 0.535 ± 0.352
1.071GlnGln: 1.071 ± 0.562
2.141GlnArg: 2.141 ± 0.57
2.944GlnSer: 2.944 ± 0.911
2.409GlnThr: 2.409 ± 0.798
2.944GlnVal: 2.944 ± 0.523
0.535GlnTrp: 0.535 ± 0.737
1.606GlnTyr: 1.606 ± 0.829
0.0GlnXaa: 0.0 ± 0.0
Arg
3.747ArgAla: 3.747 ± 1.589
1.071ArgCys: 1.071 ± 0.326
6.156ArgAsp: 6.156 ± 1.489
6.156ArgGlu: 6.156 ± 1.173
2.677ArgPhe: 2.677 ± 1.258
5.086ArgGly: 5.086 ± 1.132
0.803ArgHis: 0.803 ± 0.476
4.015ArgIle: 4.015 ± 0.896
4.818ArgLys: 4.818 ± 0.919
6.959ArgLeu: 6.959 ± 0.527
1.071ArgMet: 1.071 ± 0.326
0.803ArgAsn: 0.803 ± 0.331
2.677ArgPro: 2.677 ± 1.428
1.071ArgGln: 1.071 ± 0.634
2.677ArgArg: 2.677 ± 0.841
6.692ArgSer: 6.692 ± 0.827
4.55ArgThr: 4.55 ± 0.855
3.747ArgVal: 3.747 ± 1.716
1.338ArgTrp: 1.338 ± 0.49
2.944ArgTyr: 2.944 ± 1.072
0.0ArgXaa: 0.0 ± 0.0
Ser
5.889SerAla: 5.889 ± 1.686
0.268SerCys: 0.268 ± 0.159
3.747SerAsp: 3.747 ± 0.628
4.55SerGlu: 4.55 ± 0.988
3.212SerPhe: 3.212 ± 0.952
6.692SerGly: 6.692 ± 1.17
2.677SerHis: 2.677 ± 0.539
6.156SerIle: 6.156 ± 1.307
4.015SerLys: 4.015 ± 1.675
8.565SerLeu: 8.565 ± 1.021
1.606SerMet: 1.606 ± 0.843
2.141SerAsn: 2.141 ± 0.536
5.621SerPro: 5.621 ± 2.112
2.944SerGln: 2.944 ± 0.543
5.353SerArg: 5.353 ± 1.063
4.55SerSer: 4.55 ± 0.735
5.353SerThr: 5.353 ± 0.281
4.818SerVal: 4.818 ± 0.245
2.141SerTrp: 2.141 ± 0.637
1.606SerTyr: 1.606 ± 0.321
0.0SerXaa: 0.0 ± 0.0
Thr
2.409ThrAla: 2.409 ± 0.964
0.803ThrCys: 0.803 ± 0.916
3.747ThrAsp: 3.747 ± 1.555
2.944ThrGlu: 2.944 ± 0.574
0.803ThrPhe: 0.803 ± 0.454
2.944ThrGly: 2.944 ± 1.076
1.338ThrHis: 1.338 ± 0.594
2.409ThrIle: 2.409 ± 1.689
3.48ThrLys: 3.48 ± 0.296
6.959ThrLeu: 6.959 ± 0.955
1.338ThrMet: 1.338 ± 0.952
5.086ThrAsn: 5.086 ± 0.88
2.409ThrPro: 2.409 ± 1.397
2.141ThrGln: 2.141 ± 1.049
6.692ThrArg: 6.692 ± 0.811
6.424ThrSer: 6.424 ± 1.418
5.621ThrThr: 5.621 ± 1.811
4.55ThrVal: 4.55 ± 0.406
1.606ThrTrp: 1.606 ± 0.663
1.606ThrTyr: 1.606 ± 0.336
0.0ThrXaa: 0.0 ± 0.0
Val
1.606ValAla: 1.606 ± 1.446
2.141ValCys: 2.141 ± 0.729
2.677ValAsp: 2.677 ± 0.523
2.677ValGlu: 2.677 ± 0.493
2.141ValPhe: 2.141 ± 0.593
3.747ValGly: 3.747 ± 0.999
1.874ValHis: 1.874 ± 0.427
4.015ValIle: 4.015 ± 0.66
2.677ValLys: 2.677 ± 1.181
5.889ValLeu: 5.889 ± 1.538
2.141ValMet: 2.141 ± 0.897
1.874ValAsn: 1.874 ± 0.858
2.944ValPro: 2.944 ± 0.388
2.141ValGln: 2.141 ± 0.797
3.747ValArg: 3.747 ± 1.01
4.55ValSer: 4.55 ± 0.788
4.283ValThr: 4.283 ± 1.098
2.944ValVal: 2.944 ± 0.459
0.535ValTrp: 0.535 ± 0.281
1.338ValTyr: 1.338 ± 0.349
0.0ValXaa: 0.0 ± 0.0
Trp
1.338TrpAla: 1.338 ± 0.594
0.0TrpCys: 0.0 ± 0.0
1.338TrpAsp: 1.338 ± 0.527
1.606TrpGlu: 1.606 ± 0.538
0.803TrpPhe: 0.803 ± 0.399
1.071TrpGly: 1.071 ± 0.634
0.0TrpHis: 0.0 ± 0.0
2.141TrpIle: 2.141 ± 0.593
0.535TrpLys: 0.535 ± 0.317
0.535TrpLeu: 0.535 ± 0.317
0.268TrpMet: 0.268 ± 0.368
1.338TrpAsn: 1.338 ± 0.568
1.071TrpPro: 1.071 ± 0.562
0.0TrpGln: 0.0 ± 0.0
1.338TrpArg: 1.338 ± 0.614
1.338TrpSer: 1.338 ± 0.459
1.606TrpThr: 1.606 ± 0.945
1.606TrpVal: 1.606 ± 0.71
0.268TrpTrp: 0.268 ± 0.159
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.874TyrAla: 1.874 ± 0.528
0.535TyrCys: 0.535 ± 0.281
2.141TyrAsp: 2.141 ± 0.514
1.071TyrGlu: 1.071 ± 0.353
1.071TyrPhe: 1.071 ± 0.562
2.409TyrGly: 2.409 ± 0.881
1.338TyrHis: 1.338 ± 0.612
1.338TyrIle: 1.338 ± 0.793
1.606TyrLys: 1.606 ± 0.71
3.212TyrLeu: 3.212 ± 0.978
0.803TyrMet: 0.803 ± 0.574
1.071TyrAsn: 1.071 ± 0.437
2.409TyrPro: 2.409 ± 0.614
1.874TyrGln: 1.874 ± 0.872
2.409TyrArg: 2.409 ± 0.631
2.677TyrSer: 2.677 ± 0.528
0.535TyrThr: 0.535 ± 0.581
2.409TyrVal: 2.409 ± 0.742
0.268TyrTrp: 0.268 ± 0.467
1.071TyrTyr: 1.071 ± 0.634
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (3737 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski