Skip to content

7.1 Bulk Connect FASTQs

The bulk import feature links multiple samples to their associated FASTQ files simultaneously.

Warning

Sample names must match FASTQ prefix names exactly. A mismatch will cause the INDUCE-seq Analysis to fail.

Steps

  1. Select Data Registry from the left sidebar.
  2. Navigate to your sample manifest.
  3. Select Import, then Bulk Import Sequencing Data.
  4. Navigate to the folder containing your uploaded FASTQs.
  5. Select the FASTQ folder to import all FASTQs at once, or hold Shift and select individual files, then select Select.

LatchBio automatically infers potential sample names from FASTQ file name components. Components are divided by separators including underscores (_), hyphens (-), and periods (.).

!!! note For example, EPplus_24h_rep3.R1.fastq.gz contains 11 name components. It is strongly recommended to modify detected names to ensure they match your sample manifest.

  1. Select Modify Name Parsing, then Specify Manually. Choose one of the following approaches based on your file naming convention:
  1. In Specify Manually, select the number of FASTQ file name components to include. LatchBio shortens all file names to the same number of components.
  2. Review the new sample ID.
  3. Select Confirm.
  1. In Specify Manually, enable the Use Regex option.
  2. Enter a regular expression matching your naming convention. For example, ^[^.]+ captures all characters up to but not including the first period: sample1-4hr_rep1.fastq.gzsample1-4hr_rep1.
  3. Select Confirm.

!!! note Contact BSB Technical Support for assistance with complex regex patterns.

  1. Verify that each File is correctly associated with its Sample Id.

Import dropdown showing Bulk Import Sequencing Data option

Import Rules interface: detected Sample IDs matched against FASTQ files

  1. Confirm the resulting Sample Id values match the sample column in your sample manifest registry table.

Side-by-side view confirming Sample IDs from FASTQ files match sample manifest entries

  1. Select Next.

  2. Enable the Left Join option. This ensures only FASTQs with sample names matching entries in the sample manifest are returned.

  3. Select fastq_1 from the Target Column for Read 1 dropdown.

  4. For paired-end sequencing, select fastq_2 from the Target Column for Read 2 dropdown. For single-end sequencing, disable Target Column for Read 2.

  5. Select Import.

To edit an individual FASTQ file path after bulk import, see Manual Connect.