[Docker Series Part 2] Connecting Images and Containers with Dockerfiles

Part 1 showed how Docker bridges developer workflows and infrastructure. The Dockerfile is the bridge itself. Once you know how to craft an image and how that image becomes a running container, phrases like “capture the environment as code” stop sounding abstract. This post walks through a first Dockerfile, highlights the difference between images and containers, and keeps the entire lesson hands-on.

How this post flows

What a Dockerfile is responsible for
Syntax basics and frequently used instructions
First lab: build a personal web server image
Understanding layers and the build cache
Practice the image → container → data flow
What to learn next

Terms introduced here

Layer: A filesystem snapshot stored during an image build. Instructions such as RUN, COPY, and ADD create layers that Docker can cache and reuse.
Base image: The existing image you start from when creating a new one.
Build context: The folder snapshot Docker can read files from while building an image.
ENTRYPOINT: The executable configured to start by default when the container starts.
.dockerignore: A file that excludes unneeded paths from the build context so they never reach the image.

Reading card

Estimated time: 18 minutes

Prereqs: command-line Linux basics plus a way to clone projects with Git

After reading: you can explain, step by step, how one Dockerfile ties images and containers together.

What a Dockerfile is responsible for

Think of a Dockerfile as a recipe that spells out which environment to build and in what order. One file captures the base image, package installs, file copies, and startup command. It makes builds far more reproducible, but identical outputs still depend on pinned inputs and deterministic build steps.

That promise becomes much stronger when you pin exact image inputs. FROM ubuntu:24.04 is better than FROM ubuntu:latest, but tags can still move over time. If you need the exact same base image later, pin by digest (FROM ubuntu@sha256:...) instead of relying on a mutable tag alone.

That single file delivers real advantages:

Reproducible environments: A Dockerfile makes rebuilds repeatable when the inputs are pinned and the steps stay deterministic.
Version control: Managing Dockerfiles in Git records every environment change.
Faster learning: Beginners can read the file and understand the build flow instruction by instruction.

Keep this mental model in mind throughout the post:

Dockerfile: the text recipe
docker build: the step that turns the recipe into an image
Image: the reusable package stored on disk
docker run: the step that starts a container from that image
Container: the running process created from the image

Syntax basics and frequently used instructions

The simplest Dockerfile looks like this:

FROM ubuntu:24.04
RUN apt update && apt install -y curl
COPY . /app
CMD ["/app/run.sh"]

This example assumes /app/run.sh actually exists in the build context and has execute permission. In real projects, missing files or non-executable scripts are a common reason the container fails to start.

FROM: chooses the starting point. Official images such as ubuntu, alpine, node, and nginx are the usual picks.
RUN: executes shell commands and stores the result as the next layer.
COPY: copies local files from the build context into the image.
ADD: can also unpack local tar archives or fetch remote URLs, but COPY is the clearer default when you only need file copies.
CMD: sets the default command.
ENTRYPOINT: sets the default executable. If you define both, CMD provides the default arguments passed to ENTRYPOINT. Runtime arguments replace CMD, and --entrypoint replaces the executable itself.

The build context is the folder snapshot you pass to docker build, usually .. COPY paths are resolved relative to that snapshot, so the command reads from the build context, not from arbitrary files elsewhere on your machine. Keep the context small with a .dockerignore such as:

node_modules/
.git/
*.log

You will also see these instructions constantly:

WORKDIR /app: sets the working directory for all following instructions.
ENV NODE_ENV=production: defines environment variables that are available during later build steps and in the container at runtime.
EXPOSE 8080: annotates which port the container uses. Remember it is only metadata—you still have to publish a port with docker run -p or a Compose ports entry.

Use ARG for build-time-only values and ENV for values that should remain available in the final image. Neither one should be treated as a safe place for secrets.

That last point trips up many beginners. Writing EXPOSE alone does not make a service reachable from a browser, and it does not secure or restrict the port either. It simply documents the intended listening port for people and tools that use the image.

Dockerfile instructions do not all run at the same time, so separate build-time behavior from runtime behavior:

Step	When it happens	Examples
Build-time	While `docker build` creates the image	`FROM`, `RUN`, `COPY`, `ADD`
Runtime	When `docker run` starts a container	`CMD`, `ENTRYPOINT`

The build context also matters here. When you run docker build ., Docker uses the current folder as the build context and sends that snapshot to the builder. Large folders slow builds, which is why .dockerignore should exclude paths such as node_modules/, .git/, and *.log.

First lab: build your own web server image

Use this mini project to build Dockerfile instincts.

Before starting, make sure docker --version prints normally, docker info succeeds, and port 8080 is free on your machine. If docker info fails, the Docker daemon may not be running yet or your user may not have permission to talk to Docker.

Create a folder structure.

mkdir -p docker-lab/app && cd docker-lab
cat <<'EOF' > app/index.html
<!doctype html>
<h1>Hello Docker!</h1>
<p>This is my first container.</p>
EOF

Write the Dockerfile.

# Dockerfile
FROM nginx:alpine
COPY ./app /usr/share/nginx/html

This works without adding a CMD because nginx:alpine already defines how to start nginx. In this lab, your Dockerfile only replaces the static files that nginx serves by default.

Build and run it.

docker build -t my-nginx-lab .
docker run -d -p 8080:80 --name lab my-nginx-lab
curl http://localhost:8080
docker stop lab && docker rm lab

A short Dockerfile is enough to produce your own image, and running that image starts a web server right away. The COPY instruction literally bakes the HTML files into the image, so the container can serve them without needing your local editor or web server.

If docker run fails with a message saying port 8080 is already in use, change the command to -p 8081:80 and then open http://localhost:8081 instead.

Remember that Docker sends the entire current folder as the build context. In real projects you add .dockerignore entries such as node_modules, .git, and log files so large or sensitive paths never reach the image in the first place.

Understanding layers and the build cache

Now that you have built an image once, the cache has a job to do. Docker stores filesystem-changing steps such as RUN, COPY, and ADD as reusable layers. If nothing affecting one of those steps changes, Docker can reuse the cached result instead of rebuilding it.

Follow these cache-friendly guidelines:

Place rarely changing steps near the top. Package installs and OS updates benefit most from reuse.
Copy dependency files before application code. That way a source-code edit does not force a full dependency reinstall.
Remember cache invalidation flows downward. If one layer changes, Docker rebuilds that layer and every layer after it.

A common pattern is copying package.json first, running npm install, and copying the rest of the source later. Docker treats the COPY layer as changed when the copied files or paths change, so keeping dependency files separate prevents small source edits from forcing a full reinstall.

For example:

COPY package.json package-lock.json ./
RUN npm ci
COPY . .

Now an application-code edit can reuse the cached dependency install layer. The same idea applies to OS packages too: pin what you can, and keep apt-get update && apt-get install ... in one step so the cache story stays predictable.

Quick cache experiment

docker build -t my-nginx-lab:v1 .
printf '<p>Updated content</p>\n' >> app/index.html
docker build -t my-nginx-lab:v2 --progress=plain .

On the second build, Docker should reuse the unchanged base-image step and rebuild only the layers affected by the modified file. Look for cache indicators in the output: classic Docker often prints CACHED, while BuildKit may use different wording depending on the progress mode. Later in the series, multi-stage builds will build on this same idea to keep runtime images smaller.

Practice the image → container → data flow

If images and containers still feel blurry, try these two experiments.

1. Launch multiple containers from a single image

docker run -d --name web-a -p 8081:80 my-nginx-lab
docker run -d --name web-b -p 8082:80 my-nginx-lab

The same image spawns independent containers. Deleting one does not affect the other.

2. Attach a data volume

docker volume create html-data
docker run -d --name web-volume \
  -v html-data:/usr/share/nginx/html \
  -p 8083:80 nginx:alpine

docker exec -it web-volume sh -c "echo 'Volume Test' > /usr/share/nginx/html/index.html"
curl http://localhost:8083

docker rm -f web-volume
docker run -d --name web-volume \
  -v html-data:/usr/share/nginx/html \
  -p 8083:80 nginx:alpine
curl http://localhost:8083

Even after deleting the container, the index.html stored inside the named volume remains. That is the key distinction: containers are disposable, but volumes persist data outside the container lifecycle. Without a volume, files written only inside the container disappear when the container is removed.

One subtle detail: the first time Docker mounts a brand-new named volume onto a path that already has files in the image, Docker initializes that volume with the image's existing contents. That surprises many beginners, so remember that named volumes and bind mounts behave differently at first mount.

What to learn next

With Dockerfile fundamentals in place, the next question is how to choose a base image. Should you stick with a familiar Ubuntu image, or switch to a lightweight Alpine image? That choice affects both learning difficulty and deployment strategy. Part 3 compares those two bases, shares a checklist you can run through on your own, and offers a Compose example to experiment with.

One last reminder: never embed API keys or passwords directly in a Dockerfile or pass them as casual build arguments. Build arguments and image metadata can leak through image history or inspection output. For build-time secrets, prefer BuildKit secret mounts. For runtime secrets, use a file mount, Swarm-style Docker secrets, or another dedicated secret manager instead of baking sensitive values into the image.